Manager, Application Master And Containers Resume
SUMMARY:
- Team Management § Troubleshooting and Problem Solving § Territory Management
- Infrastructure/Server maintenance/setup § BigData § DW/ETL/ELT Tool Administration and Application Development § Quality Process Management Application/Service Oriented Architecture § Requirements analysis and design
- IBM Certified InfoSphere Warehouse9.5 designer and IBM Certified InfoSphere Datastage8.5 Solution Developer, Enterprise Data Architect and solution developer with 13(including 9years of onsite/USA) years of over all experience in IT that includes legacy and current technologies around Distributed, Big Data, DWH and ETL, client server applications, more than 2 years of hands on experience on Big Data technologies with leading IT organizations( Confidential /TSYS/American Express/IBM/Ascential) to leverage technologies toward accomplishing strategic and tactical business objectives. Recent experience involves helping organizations implement major expense - reducing, revenue-enhancing, competitive and business flexibility improvements functioning in the roles of enterprise data architect, Technical Lead and strategic IT planner. Innovative, visionary backed on various business domains like Airlines, Merchant, Payment, Banking and Financial Solutions. Excellent communication, motivational, managerial skills and experience in the following areas:
- Project Management Enterprise Data Architecture
- Enterprise Technology ArchitectureBig Data Cloudera & HDP Distribution
- Data Management, Data ModelingHadoop Administration & Architecture
- ETL InfrastructureData Engineering/Science
- Proven history of building large-scale data processing systems and serving as an expert in data warehousing while working with a variety of database technologies. Experience architecting highly scalable, distributed systems using different open source tools as well as designing and optimizing large, multi-terabyte data warehouse. Able to integrate state-of-the-art Big Data technologies into the overall architecture, working closely with business users to understand line of business requirements design efficient and effective solutions on Hadoop responsible for planning and operationalizing Hadoop clusters and monitoring cluster heartbeats, provide Infrastructure recommendation, capacity planning and lead a team of developers through the construction, testing and implementation phase.
- Consulted with business partners and made recommendations to improve the effectiveness of Big Data systems, descriptive/ prescriptive analytics systems. Integrated new tools and developed technology framework/prototypes to accelerate the data integration process and empower the deployment of predictive analytics. Working knowledge of machine learning and predictive modeling.
- Experience designing, reviewing, implementing and optimizing data transformation processes in the Hadoop and ecocystems. Able to consolidate, validate and cleans data from a vast range of sources - from applications and databases to files and web services.
- Capable of extracting data from an existing databases, Web sources or APIs. Experience designing and implementing fast and efficient data acquisition using Big Data processing techniques and tools.
- Good understanding of Classic Hadoop and Yarn architecture along with various Hadoop Daemons such
TECHNICAL SKILLS:
BigData Technologies: HDFS Hive Pig Hadoop Streaming MapReduce R Sqoop Flume
NoSQL: Hbase MongoDB
ETL TOOL: IBM Data Stage/IIS8.7 Ablnitio (GDE, EME & Dataprofiler) Informatica PC9
LANGUAGES: C C++ Java Python Unix Scripting IPC SQL PL/SQL
Databases: DB2 UDB 7 & 8 Oracle 9i Informix Teradata MySQL
OPERATING SYSTEMS: UNIX (AIX, HP-UX, SunOS, & Linux) MS Windows
VERSION CONTROL: Rational ClearCase EME - Co-OS (Ablnitio) MS TFS SubVersion CMVC CVS
TOOLS: Tivoli BMC control-M AVRO GitHub Maven Elastic Solr AutoSys IBM Maestro gdb dbx Humming Bird Exceed
PROFESSIONAL EXPERIENCE:
Confidential
Manager, Application Master and Containers
Responsibilities:
- Expert level of scripting using Pig scripts and Hive queries for processing and analyzing large volume of Data
- Extensively worked on importing and exporting of data using Sqoop from HDFS to RDBMS and RDBMS to HDFS.
- Design and developing Java classes for binary/AVRO file formats.
- Expert in understanding of Data Structures and Algorithms for Optimization.
- Experience in NoSQL database MongoDB and Cassandra
- Hands on experience with CDH3 & CDH4, expert level subject and architectural knowledge with HDP 2.0(Hartonworks).
- Experienced in collection of Log Data and JSON data into HDFS using Flume and processed the data using Hive/Pig.
- Provided Technical direction for development design and systems integration for client management from requirements analysis to implementation.
- Sound knowledge in text processing and analysis using various Unix tools (awk, Perl, sed, shell programs), python, ruby, JSON and Hadoop stack - internal tools like Pig, Hive, HiveQL, UDFs.
- Experience in troubleshooting errors in HBase Shell/API, Pig, Hive and Map Reduce, analyzing application errors and cluster logs and deamon logs.
- Experienced in installing and running various Oozie workflows and automating parallel job executions.
- Experience in running shell scripts using Hadoop Streaming.
- Experience in HBase cluster configuration, deployment and troubleshooting.
- Expert level programming and problem solving skills. Ability to prioritize workload and consistency meet deadlines.
- Experience in SQOOP bulk loads to and from HDFS, provide tactical solutions on dealing with heavy volume data as big as 10GB in parallel.
- Experience in parsing and loading files/logs which are structured, semi-structured and un-structured.
- Experience with patch or new components install/upgrade from GitHub, Maven
Confidential
Responsibilities:
- Provided solution to build prototype and POC.
- Provided technical guidance with SDLC design for enterprise applications.
- Participated in gathering and analysis of business and technical requirements.
- Implemented Big Data solutions and analyzed virtual machine requirements.
- Reviewed administrator process and updated system configuration documentation.
- Formulated and executed designing standards for data analytical systems
- Involved in setting up of the 64 node cluster and configuring the Hadoop platform
- Migrating the data form Oracle Exa and Teradata into HDFS using Sqoop and importing various formats of flat files into HDFS.
- Worked on Hive queries to categorize the data from different claims
- Integrated the Hive warehouse with HBase for information sharing among subject areas.
- Designed and created Hive external tables using shared Meta store and supported partitioning, dynamic partitioning for faster data retrieval. Deployed and supported MapReduce programs that are running on the cluster.
- Development using HiveQL scripts to create, load and query tables for extracting the summarized information.
- Cleansing unstructured data using PIG scripts and generate analytical data and storing data into Hive tables.
- Developed MapReduce Java Classes to process and transform Net-Tracer files for baggage processing.
- DEG subject level DWH arhivals from Oracle to HDFS using Sqoop,
- Scheduling and defining job flow and work flow using Oozie.
- Developed internal framework to ease and fasten the development phase.
- Contribute in POC and subject area statistical information to design Hbase warehouse on hadoop cluster.
- Design and develop solution using Hadoop ecosystem HDFS, MapReduce, Pig, Hive, Impala, HBase, and Zookeeper.
- Successfully developed and deployed 3 robust and high volume batch process applications.
- Involved in design reviews and ensure for highly modular, portable and performance optimized.
- Worked closely with engineers and other groups to design and architect solutions and mentoring team members.
Environment: Hadoop, Hive, Hbase, BigData, HiveQL, MapReduce, Oozie, HDFS, Pig, Impala, Zookeeper
Confidential
Responsibilities:
- Installed and configured Hadoop, MapReduce, HDFS, developed multiple MapReduce jobs using Java for data cleaning and processing. Importing and exporting data into HDFS and Hive using Sqoop. Cluster coordination services using ZooKeeper.
- Provided solution using large scale server side systems with distributed processing algorithms, Hadoop ecosystem - HDFS, MapReduce, Pig, Hive, Hbase, and Zookeeper. HiveQL scripts to analyze merchant data to determine merchant demographics.
- Actively involved in code review and bug fixing for improving the performance.
- Data migration from Informix/Oracle Exa to Hadoop for building advanced data analytics to achieve better performance.
- Developed MapReduce programs using python/Ruby/Unix scripts.
- Supporting and mentoring developer team and assisting in troubleshooting and optimization of MapReduce jobs and Pig Latin scripts. Hardware and architectural guidance, planning, estimating cluster capacity and creating roadmaps for Hadoop cluster deployment. Deep understanding of Hadoop design principles, cluster connectivity, security and the factors that affect
- Distributed system performance. Managing and scheduling jobs on Hadoop cluster using Oozie. End to end performance tuning of Hadoop clusters and MapReduce routines against very large datasets. Functional and performance testing and engineering. Developing UDFs using Pig and HiveQL SQLs to preprocess the data for analysis and file parsing.
- Data import and export into HDFS using Sqoop. Using cloudera manager for managing Hadoop operations.
- Involved in moving all log files generated from various sources to HDFS for further processing through Flume.
- Experience in managing and reviewing Hadoop log files. Agile and waterfall Methodology of development.
- Good understanding of ETL tools and how they can be applied in Big Data environment.
- Extensively worked on several ETL assignments to extract, transform and load data into tables as part of
- Data Warehouse development with high complex Data models of Relational, Star, and Snowflake schema.
Environment: Hadoop, ETL, BigData, Hive, HDFS, Hbase, Pig, HiveQL, MapReduce, Zookeeper, Agile, Snowflake Schema.
Confidential
Senior Application Programmer § Client: American Express
Responsibilities:
- Handled on-site coordination, project planning, monitoring, controlling, and project lead activities. Establish detailed program specifications through discussions with clients. Generate and implement possible solutions to predicted problems. Provide critical commendations to senior management on performance improvements. Oversee data analysis GCST Card & Travel application and offered application support and production tickets and issue resolutions. Plan and execute business development for venture funding of financial products. Manage ETL/DW/BI development team using abInitio Tool, EME administration.
- Guaranteed smooth running of business product/applications; troubleshoot technical issues and adhered to quality assurance standards to surpass customer expectations; acted as liaison with other departments with regards to technical issues and Solution design. Administered client interfacing, SDLC, and maintenance for Ascential Datastage product development; handled project planning, system integration, UAT, implementation for South-east Asia Regional project and provided product enhanced functionality and design reviews.
- Lead offshore development team and drive core technical analysis for various initiatives for PX-Interrogator module.
- Additional Experience in Computer programming, Data warehousing, application Design, Development & Support
