We provide IT Staff Augmentation Services!

Senior Architect - Big Data Analytics Resume

2.00/5 (Submit Your Rating)

Alpharetta, GeorgiA

PROFESSIONAL SUMMARY:

  • Has more than 21 years of progressive IT experience in large - scale complex end-to-end application software development, product development, technology operations, managing and liaising with multi-cultural, multi-location teams, Vendor Management, Budget Management, Project Management, Process Improvement, Change Management, Release Management.
  • Around 5+ years experience in ETL, Business Intelligence / Data Warehousing and reporting using - SAP BW/BI, SAP BO, Cognos, Informatica, Greenplum, SQL Server - SSRS, SSIS, SSAS, Teradata, Datastage, Tabaleau, RapidMiner. Working knowledge of SAP HANA integration to Hadoop using HANA smart data access connectivity tool
  • Oversight over database management, data migrations, specification of production monitoring requirements for new products and features, cost-tracking for existing applications and services, infrastructure enhancements/changes to improve existing applications/services and/or reduce internal costs
  • 5+ years of Big Data experience using HADOOP ecosystem tools and frameworks. Experience in Mongo DB, Cassandra, Oozie, HIVE, PIG, HBASE, Sqoop, Mapreduce, PySpark, Tez, Spark, HDFS, Shell scripting, MYSQL, PYTHON, Core-Java, RESTFul web-services, Kafka, Storm. Experience with data analytics, data mining computational methods to parse, link and develop patterns out of enterprise and consumer data, machine learning algorithms. Experience in statistical analysis software R, Mahout and SAS. Working experience in the cloud with AWS, Azure and Big Insights
  • Well versed in installing and managing distributons of Hadoop ( Cloudera, Hortonworks, Pivotal ). Knowledge in performance troubleshooting and tuning Hadoop Clusters. Good knowledge of Hadoop cluster connectivity and security. Linux system administration knowledge, understanding of Storage, Filesystem, Disks, Mounts, Network Elements etc. Experience in designing Data lakes. PoC and Use cases around data ingestion, metadata management, audit trails, governance and data security. Familiar with Optimized Row Columnar, Compression techniques, optimizing JOINs
  • Has both Big Data and relational database experience and deep knowledge of data modelling, MDM, Governance, Data Quality Assurance, Digital Marketing, Predictive and Prescriptive Analytics, data organization and storage technology, experience with high volumes and able to architect and implement multi-tier solutions using right technology in each tier, based on fit. Data modelling using ERWin tool.
  • Around 7 years of Telecommunication Domain experience in implementing AMDOCS-ENSEMBLE, Enabler, OSS/BSS, CRM, Ordering and Billing at Client Sites in Germany, Cyprus, Israel, USA, Australia, India and Canada. Working experience on Ericsson Billing Systems.
  • Thorough knowledge and experience with business requirements gathering & analysis, gap assessment, development of functional and technical specifications, data modeling, loads scheduling, testing, Reports development and user training and support.
  • Active team player and leader with excellent interpersonal skills, analytical skills, and effective written & oral communication and presentation skills.. Took many customer facing roles in building a dynamic work relation with client with continued focus on delivery and customer satisfaction.
  • Experience Software Version management and Quality Assurance work. Experience in designing, developing and executing test strategies, test plans and test cases.
  • Strong analytical skills with detail follow-through and problem solving skills in business process mapping and analysis.
  • 4 years experience in Agile environment. Experience in managing offshore and onsite team in Agile environment in creating and managing user-stories and tracking progress, identifying bottleneck and addressing them
  • Extensive experience working with business users as well as senior management in Retail, Telecom, Financial, Banking and Healthcare industry

TECHNICAL SKILLS:

Operating Systems : UNIX/LINUX, Sun Solaris, AIX, MVS, Windows 2000/NT/XP/98/95

RDBMS : Oracle 8i, 9i, 10g, 11g, MS SQL Server 2005, DB2

NoSQL : MongoDB, Cassendra, Couchbase

OLAP Tools : OBIEE 10g/11g, SAP BO, SAP Business Explorer (BEx) Analyzer, BEx Browser, Web reporting

ETL / ERP Tools : Informatica 7.1/8.x/9.x, Teradata, Cognos, SAP HANA, SAP ECC 6.0, SAP BI/BO

BI Tools : OBIEE,, Crystal Reports, SAP Business Explorer (BEx) Analyzer, BEx Browser, Web reporting

Languages : SQL, PL/SQL, JAVA, Shell Scripts, Python, R, C, C++, Pro* C, COBOL, Grunt Shell

Internet Programming: JavaScript, VBScript, HTML, and DHTML

Big Data Technology: Cloudera, Hortonworks Distributions,, Sqoop, Pig, Hive, HDFS, MapReduce, Java, Impala, Hcat, HUE, flume, Spark, Hbase,Oozie, Zookeeper, Kafka, Asterdata, Vertica, Accumulo

Other Tools : Amdocs Ensemble, Amdocs Document Designer, OSS/BSS, Online charging, Enabler, AES 7.0, Ericsson Billing, Amazon Web Service, S3, EMR, SQL*LOADER, TOAD, PL/SQL Developer, ClearCase/Quest, HP Quality Center, XML, MS Project, Visio, TOAD, TEST DIRECTOR, WIN RUNNER, Dimensions, ChangeSynergy, ServiceDesk, GIT, MAVEN, Harvest, Testbase, Relational ClearQuest, Relational Quality Management, working knowledge of UML, XML and JCL, Gradle, Maven, Jenkins, Control M

PROFESSIONAL EXPERIENCE:

Senior Architect - Big Data analytics

Confidential, Alpharetta, Georgia

Responsibilities:

  • Define and apply enterprise level standards and influence business strategy
  • Coordinate solutions from technical perspective and minimize technical risk
  • Clarify functional and non-functional requirements with BAs and Confidential Practitioners
  • TPCDS performance benchmarking of BIGSQL with 10x100 GB load in 6-node BigInsights cluster in Azure cloud and present the results with IT and Business stakeholders
  • Information Security to ensure compliance with secure development standards and negotiate security trade-offs
  • Use Restful APIs to trigger Rapid Miner workflows to ingest data into BIGSQL and SQL server from various data sources
  • Operations to take into account operational (non-functional) requirement
  • Transferred the analyzed data across relational database from HDFS using Sqoop enabling BI team to visualize analytics.
  • Real Time data ingestions into cold storage ( BIG SQL) and warm storage (SQL server ) using Kafka and Spark streaming
  • Develop workflows for ETL and Data Science using Rapid Miner to generate the risk indicators and risk scores. Calculate composite risk scores on daily basis and generate alerts for Insider Threat and Bankers Supervision Program.
  • ETL to Hive/BigSQL/SQLServer, data architecture, data modelling, data analysis, data profiling, metadata management
  • Design Logging and Audit Trails framework for RapidMiner ETL and Data Science jobs
  • Provide Multi-Tenant architecture solution for different client engagement using Ranger security features
  • Well-versed with the client's chosen mode of project delivery, SAFE Agile and the related tools (TFS).

Environment: Azure - Big Insights, Rapid Miner, Multi-Tenant architecture, Tabaleau, BIGSQL, SQL Server, Kafka, Spark, Python, JAVA, GIT,TFS, Hadoop, HDFS, Map Reduce, Linux, Hive, Big Data, GIT, Stash, Spark, Mongo DB, Big Data eco system tools, PERL,Scala, Python, Falcon, YARN, AngularJS, Ranger, Dot Net

Solutions Architect - Big Data analytics

Confidential, Bentonville, Arkansas

Responsibilities:

  • Design schemas and create HIVE tables for importing data from multiple data sources using sqoop using dynamic trigger mechanism
  • Designed solution for loading offers data created in Instant Savings in real time into Hadoop using RESTFUL APIs and oozie web services
  • Involved in processing the data in the Hive tables using HQL   high-performance, low-latency queries on Tez.
  • Transferred the analyzed data across relational database from HDFS using Sqoop enabling BI team to visualize analytics.
  • Processed hive tables using Spark hiveContext.
  • Created and populated bucketed tables in Hive to allow for faster mapside joins and for more efficient jobs and more efficient sampling. Also performed partitioning of data to optimize Hive queries
  • Design Offer Assignment scalability solution of Super node to process 60 million personalized membership data where we moved the OFFER ASSIGNMENT algorithm in Sams Personalize Engine (SPE) from hadoop cluster to Super node where we achieved 4X performance improvement
  • Design/ Architect/ Implement various solutions arising out of the large data processing (GB’s/ PB’s) environment
  • Real Time Load data from mainframe system to Oracle and Hadoop Backend using APIs, Web Services, SOAP and/or REST services and vice versa
  • Apply other HDFS formats and structure (Avro, Optimized Row Column, Parquet) to support fast retrieval of data, user analytics and analysis.
  • Implemented automation to reduce manual administrative tasks through use of jobs and scripts in Big Data migration areas ( Customer Knowledge Platform, Member Engagement and Campaign Management )
  • Handling data ingestion process using generic scripts / falcon / oozie jobs that obtains the metadata/parameters necessary to bring the data in from the landing area of the edge node.
  • Support business to complete Campaigns runs in production for Mass/ External campaigns and resolve production issues in timely manner.
  • Data Ingestion from Model Centralization, Governance, Data Quality Assurance, Digital Marketing, and Predictive analysis
  • Identified areas of improvement in the analytical models like Offer Assignment, Backfill and Offer Load
  • Designed implementation strategies for improving overall efficacy of the critical processes in Offer Assignment and Backfill Models.
  • Provided technical guidance to the team based on the application specific know-how and customer expectations.
  • Well-versed with the client's chosen mode of project delivery, SAFE Agile and the related tools (JIRA,Teamforge,Rally).

Environment: Pivotal Distribution and Hortonworks distribution of Hadoop. ORACLE, IBM-AIX/, Linux, Teradata, DataStage, Netezza, HIVE, Pig, HBASE, Scoop, Impala, Flume, Kafka, Storm, Python, JAVA , GIT,Jenkins, Maven, SVN, Hadoop , HDFS, Map Reduce, Linux , Hive , Big Data, GIT, Stash, Spark, Mongo DB, Big Data eco system tools, PERL,Scala, Python, Falcon, YARN, AngularJS, Ranger, JAVA

Solution Architect - Revenue Management/ EDW/ Big Data

Confidential

Responsibilities:

  • Instituted stability and reliability program across IT including planning, execution, reporting and discussing findings and remediation plan with SLT, on applications, service architecture
  • Improved billing SLA by one day and reduced system outages.
  • Developed and implemented a long-term survival roadmap for legacy applications enhancement and infrastructure projects according to business projections and customer experience. Contributed within various governance bodies to develop IT wide standards/policies and processes.
  • Managed relationship with all vendors in the environment.
  • Remediated PCI, SOX compliance, securities and vulnerabilities findings.
  • Attained improved application support and operational efficiencies by process enhancements
  • Successfully designed, developed and Implemented End to End Billing Optimization project to reduce the run time of billing maps by 40%
  • Monitoring batch jobs and provided solutions to systems performance issues
  • Investigated production failures and provided resolution to these issues. Diagnosed and debugged issues.
  • Solutioning and implementing with Amdocs for Rogers Long Time Survival projects for Server upgradation from UNIX to LINUX, ORACLE upgradation from 9i to 11g and application porting to the new Linux platform
  • Design and build scalable infrastructure and platform to collect, process and analyze very large amounts of data (structured and unstructured), including streaming real-time data
  • Resolved issues relating to performance, scalability, capacity and reliability
  • Harvested server logs and filtered error logs for processing in different stream.
  • Collaborate with development and strategy teams on component and 3rd party tool identification, recommendation, installation and management of MapReduce jobs.
  • architect and develop big data solutions streaming/batch using Hadoop technologies
  • Designed solution road map for Hadoop integration projects with Revenue Management applications
  • ETL Architecture & Design, Visualization/Reporting Architecture & Design
  • Review technical deliverables & provide feedback
  •  Interact with business users to gather detailed requirements, Design Audit & Security framework
  • Hands on experience in loading data from UNIX file system to HDFS. Also performed parallel transfer of data from landing zone to the HDFS file system using DistCp.
  • Worked on loading and transforming of large sets of structured and semi structured data from HDFS through Sqoop and placed in HDFS for further processing.

Environment: HP-UX (Itanium)/ ORACLE, IBM-AIX/, Linux,ORACLE, SQL Server, Cognos, Teradata, Erwin, DataStage, Netezza, Hortonworks distribution of Hadoop, HIVE, Pig, HBASE, Scoop, Flume, Kafka, Storm, Python, JAVA , GIT, JAVA, Core JAVA , Hadoop , HDFS, Map Reduce, Linux , Hive , Big Data, MSSQL,GIT, Mongo DB, Scala, Python,AWS. Hortonworks distribution of Hadoop

Solution Architect EDW and Big Data

Confidential, Texas

Responsibilities:

  • Participation in code reviews to assure that developed and tested code conforms with the design and architecture principles  
  • QA and testing of modules/applications/interfaces.
  • Preparation of documentation of data architecture, designs and implemented code
  • Provide input on operational efficiencies to streamline and optimize platform operations.
  • Diagnose, assess, troubleshoot and fix issues within the Open Source environment.
  • Documentation of all environmental settings and configurations.
  • Planning and upgrading of the environments both hardware and software .
  • Ensure platform decisions result in high availability and stability of the systems in production.
  • Did Linux Scripting, Pig Scripts, Hive queries.
  • Data transformation and loading using pig, Hive
  • Did design/deploy Hadoop Infrastructure solutions (high availability, big data clusters, elastic load tolerance)
  • Resolved issues relating to performance, scalability, capacity and reliability
  • Troubleshoot and debug Hadoop ecosystem run-time issues
  • Access suitability and quality of candidate data sets for the Data Lake and EDW
  • Apply other HDFS formats and structure (Avro, Parquet, etc. ) to support fast retrieval of data, user analytics and analysis
  • Cloudera installation, configuration and tuning
  • Importing and exporting data using Sqoop from HDFS to Relational Database systems/mainframe and vice-versa.
  • Implemented architectural concepts (Multi-tenancy, SOA, SCA ) and identified and incorporating various NFR’s (performance, scalability, monitoring etc)

Environment: HP-UX (Itanium)/ ORACLE, IBM-AIX/ORACLE, Cognos, SAS, WebFocus, ReportEase, Mapinfo, Essbase, Erwin, Netezza, Oracle, SQL Server, DataStage , Teradata, Cloudera distribution of Hadoop, Core JAVA , Hadoop , HDFS, Map Reduce, Linux , Hive , PIG, Big Data, MSSQL, GIT, Asterdata, Vertica,,Scala , Mongo DB

Billing Solutions Prime/ Designer

Confidential

Responsibilities:

  • Participated in the project from initial blue printing, requirements gathering, transform business functional requirements into technical data model and query specifications.
  • Configuration of the systems, testing and end user training, writing functional design specifications for extracting and reporting.
  • Write test schedule and test scenarios for integration testing, update technical documentation for processes and procedures
  • Working directly with business analysts, end users and project teams to develop new Business Intelligence functionality from specifications
  • Participating in user acceptance testing (including scenario development), and documenting test results
  • Data loading, creating aggregates, SQL queries performance tuning
  • Write Linux/UNIX shell scripts (BASH) to execute ETL processes that ingest, transform, and publish highly refined analytical data sets
  • Master data management
  • Converting complex requirements into an optimal design, BW Data Modelling
  • Developing reports using Tabaleau
  • Coordinating testing, and performing change management / training
  • Monitoring, communicating and troubleshooting issues with data integrity, data design, and functional and technical software issues
  • Preparing project-related documentation (object designs, business rules, technical specifications, etc.) throughout various stages of the project

SAP BW Functional Consultant

Confidential, NY

Responsibilities:

  • Performing analysis and configuration in SAP BW environment to facilitate various analytics solutions
  • Working directly with business analysts, end users and project teams to develop new Business Intelligence functionality from specifications
  • Converting complex requirements into an optimal design, BW Data Modelling
  • Developing reports using BEx toolset - BEx Query Designer and Web Application Designer
  • Coordinating testing, and performing change management / training
  • Monitoring, communicating and troubleshooting issues with data integrity, data design, and functional and technical software issues
  • Preparing project-related documentation (object designs, business rules, technical specifications, etc.) throughout various stages of the project
  • Supported ad-hoc report requirements from business for analysisAdhering to BW best practices and standardsParticipating in user acceptance testing (including scenario development), and documenting test results
  • architected data warehousing using Kimball methodology

Environment: UNIX/ Oracle, SQL Server, DataStage , Informatica, Core JAVA , ECC 6.0, DAP BW/BI, MSSQL, TSYS, Banking domain, Loan / Mortgages, Agile methodologies

Senior Solution Analyst

Confidential, Kansas

Responsibilities:

  • Involved all phases of a project including Analysis, Design, Development, and Testing, Development of solutions, as well as Documentation and end-user training
  • Spearheaded conversion requirement, cost estimation, development, testing, implementation and post-production support phases.
  • Executed end-to-end delivery of conversion activities for voice & data usages, customer management, financials, number management, interfaces and reporting.
  • Debugged issues/defects, prioritized and rectified them.
  • Co-ordinated in multi vendor environments with cross-functional teams to understand requirements, existing processes and practices in order to introduce right solution.
  • Managed and assigned work to onsite and offshore teams in direct and matrix reporting.
  • Investigate Remedy Tickets in the Production & Product Test environment , identify root cause and prepare estimates for resolution.

Team Leader

Confidential

Responsibilities:

  • Responsible for leading a development team of 3 people.
  • Design and development of bill hierarchy for cellular, Cable, Landline & Messaging customers and departmentalized bill for corporate customers.
  • Did coding of the paper and non-paper formats. Code Inspections, documentation of Impact Assessment , Specifications, Mapping documents.
  • Design test cases , create test environment for testing, validation. My role was also to handle interface issues with other subsystems - Customer Support Management, Message Processing System, Billing and Print Shops.

Programmer/ Systems Analyst

Confidential

Responsibilities:

  • Liaison between client, onsite and offshore teams.
  • Manage requirements, discuss solution, develop, implement and provide support.
  • Impart training to the user departments.
  • Successfully delivered several solutions in production.
  • Worked with several clients at the District and State and National level.
  • Received several recognitions from District administration for successfully implementing Public Distribution System in Jorhat district.

We'd love your feedback!