Hadoop Developer Resume
Dearborn, UsA
PROFESSIONAL SUMMARY:
- 10 years of experience in OO Analysis, Design, Development using Java/J2EE and supporting of Data Warehouse & Business Intelligence solutions using Operational Data Store, Enterprise Data Warehouse, Data Marts, ETL, OLAP Client/Server using Informatica (ETL tool).
- 3 years of hands on experience in Hadoop Eco system technologies such as in Pig, Hive, HBase, Zookeeper, Map Reduce, Oozie, Flume and Sqoop.
- Responsible for analyze big data and provide technical expertise and recommendations to improve current existing systems.
- Hands on experience in Capacity planning, Installation, monitoring and Performance Tuning of Hadoop Clusters.
- Hands on experience in writing map - reduce programs in java.
- Involved in finding, evaluating and deploying new Big Data technologies and tools.
- Proficient Knowledge on Apache Spark and Apache Storm to process real time data.
- Worked on writing custom UDFs in java for Hive and Pig and SERDE’s for Hive.
- Implemented custom Sinks and Interceptors for Flume.
- Involved in building, evolving and reporting framework on top of the Hadoop cluster to facilitate data mining, predictive Modeling, analytics and dash-boarding.
- Support a wide variety of ad hoc data needs.
- Has a strong ability to prepare and present data in a visually appealing and easy to understand manner.
- Build high-volume real-time data processing applications using Hadoop platform.
- Performed Importing and exporting data into HDFS, Hive and HBase using Sqoop.
- Hands-on experience on full life cycle implementation using HDP (Hartonworks Data Platform) Platform
- Involved in Design and Architecting of Big Data solutions using Hadoop Eco System.
- Proficient in designing efficient and robust ETL workflows.
- Extensively worked on Informatica PowerCenter 9.1/8.6 Designer, Repository Manager, Workflow Manager, Workflow Monitor including Administration performance tuning of transformations/mappings/sessions/workflows
- Experience working in large scale Databases like Oracle 11g, XML, DB2, MS Excel and Flat files
- Strong background in Data warehousing concepts and implementing dimensional Modeling concepts.
- Proficient in interacting wif the business users by conducting meetings wif the clients during requirements Analysis phase
- Experience in all the phases of Data warehouse life cycle involving Requirement Analysis, Design, Coding, Testing, and Deployment
- Involved in Performance Tuning and Productivity Improvement activities
- Excellent knowledge wif Unit Testing, Regression Testing, Integration Testing, User Acceptance Testing, Production implementation and Maintenance
- Demonstrated ability to communicate and gather requirements, partner wif Enterprise Architects, Business Users, Analysts and development teams to deliver rapid iterations of complex solutions
TECHNICAL SKILLS:
Environments: Win 95/98, Win NT, Unix, Win XP
Languages: C/C++ and Java/J2EE
Hadoop related Big Data Technologies: Hadoop, Map Reduce, HDFS, Pig, Hive, Oozie, Sqoop, Flume, Spark, Strom and Zookeeper
ETL Tools: Informatica PowerCenter 9.1/8.6, Repository, Designer, Workflow Manager, Workflow Monitor
Databases: Oracle 11g/10g, IBM DB2SQL Server 2008, MS Access
Version Control: PVCS, Accurev
Tools & Utilities: FileZilla, SPUFI, AbendAid, Endevor, Remedy, QMF and File-Aid
GUI: SQL Developer, SQL Server Management Studio, VB 6.0 and Developer 2000
Mainframe Tools: COBOL, JCL, DB2, CICS, IMS DB and VSAM
PROFESSIONAL EXPERIENCE
Confidential, Dearborn, USA
Hadoop Developer
Responsibilities:
- Handled importing of data from various data sources, performed transformations using Hive, MapReduce, and loaded data into HDFS.
- Participate in Design Reviews & Daily Project Scrums
- Worked closely wif the business analysts to convert the Business Requirements into TechnicalRequirements and prepared low and high level documentation.
- Hands on experience on writing MR jobs for encryption and also for converting text data into Avro format.
- Hands on experience in joining raw data wif the reference data using Pig scripting.
- Written custom UDF’s in Hive.
- Hands on extracting data from different databases and to copy into HDFS file system using Sqoop.
- Written Sqoop incremental import job to move new / updated info from Database to HDFS.
- Created Oozie coordinated workflow to execute Sqoop incremental job daily.
- Used Oozie workflow engine to run multiple Hive and Pig jobs.
- Hands on exporting the analyzed data into relational databases using Sqoop for visualization and to generate reports for the BI team.
- Involved in installing and configuring Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster.
- Worked wif application teams to install operating system, Hadoop updates, patches, version upgrades as required.
- Working wif clients on requirements based on their business needs
- Communicate deliverables status to user/stakeholders, client and drive periodic review meetings
- On time completion of tasks and the project per quality goals
- Good knowledge on Hbase.
Environment: Hadoop, HDFS, Map Reduce, HIVE, Pig, Sqoop, HBase, Oozie, My Sql, RSA editor, Putty, Zookeeper, Ganglia, UNIX and Shell scripting
Confidential, NY, USA
Hadoop Developer
Responsibilities:
- Worked closely wif the business analysts to convert the Business Requirements into TechnicalRequirements and prepared low and high level documentation.
- Hands on experience on writing MR jobs for encryption and also for converting text data into Avro format.
- Hands on using log files and to copy them into HDFS using flume
- Hands on writing Map Reduce code to make unstructured data as structured data and for inserting data into HBase from HDFS.
- Experience in creating integration between Hive and HBase.
- Experience in defining job flows and wrote some simple to complex Map Reduce jobs as per the requirement.
- Involved in creating Hive tables, loading wif data and writing hive queries which will run internally in map reduce way.
- Implemented business logic by writing Pig and Hive UDFs for some aggregative operations and to get the results from them.
- Hands on experience in exporting the results into relational databases using Sqoop for visualization and to generate reports for the BI team.
- Work closely wif the business analysts to convert the Business Requirements into Technical Requirements and to make sure dat the correct source table attributes are identified as per Dimensional Data Modeling(Fact Table Attributes and Dimensional Table Attributes).
- Experienced wif NoSQL database and handled using the queries.
- Monitored the health of Map Reduce Programs which are running on the cluster.
- Involved in loading data from UNIX file system to HDFS.
- Installed and configured Hadoop Map Reduce, HDFS and Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster.
- Hands on experience in joining raw data wif the reference data using Pig scripting.
- Handled importing of data from various data sources, performed transformations using Hive, MapReduce, and loaded data into HDFS.
- Written custom UDF’s in Hive.
- Hands on extracting data from different databases and to copy into HDFS file system using Sqoop.
- Written Sqoop incremental import job to move new / updated info from Database to HDFS.
- Created Oozie coordinated workflow to execute Sqoop incremental job daily.
Environment: Hadoop, HDFS, MR, HIVE, Pig, Sqoop, HBase, Oozie, DB2, Putty, Zookeeper, UNIX and Shell scripting
Confidential, Michigan, USA
System Analyst
Responsibilities:
- Involved in source system Analysis and business requirement gathering wif users
- Worked closely wif the team responsible for gathering the reporting needs as well as the ensuring dat the sourced data is not currently available in the existing data warehouse
- Worked closely wif the data modeler’s to come up wif the data model and ensure dat it confirms to dimensional modeling reporting needs
- Worked on identifying Mapping Bottlenecks in Source, Target and Mappings to improve Performance
- Implemented various business rules of data using various transformations like Normalizer, Source Qualifier, Update Strategy, Look up (connected/unconnected), Sequence Generator, expression, Aggregator and Stored Procedures
- Involved in performance tuning the ETL processes, testing of stored procedures and functions, testing of Informatica sessions, batches and the target Data
- Coordinated wif Business Users to understand business needs and implement the same into a functional Data warehouse design.
- Converted functional specifications into technical specifications (design of Informatica Mappings)
- Worked wif source system teams to resolve data quality issues raised by end users.
- Involved in reviewing technical documents, Unit test plans and test cases for QA and System testing
- Requirement analysis and component design including detailed design and resolving design issues
- Providing direction and guidance to a team of developers, including allocation and management of workload by conducting status calls and code reviews
- Working wif clients on requirements based on their business needs
- Communicate deliverables status to user/stakeholders, client and drive periodic review meetings
- On time completion of tasks and the project per quality goals
Environment: Informatica PowerCenter 9.1, Oracle, SQL, PL/SQL, Remedy, SQL Developer, Flat Files, FileZilla, Star/Snowflakes Schema, COBOL, JCL, Shell scripting
Confidential
Technical Lead
Responsibilities:
- Involved in source system Analysis and business requirement gathering wif users
- Worked closely wif the team responsible for gathering the reporting needs as well as the ensuring dat the sourced data is not currently available in the existing data warehouse
- Worked closely wif the data modeler’s to come up wif the data model and ensure dat it confirms to dimensional modeling reporting needs
- Responsible for end to end verification of requirements gathered and the functional specifications and come up wif technical design document and Source to Target mappings documents.
- Development and review of mappings involving extracting data from Flat Files, Oracle, sources to Oracle database
- Executed the role of ETL developer by gathering the requirements, developing the high level designs for the mappings and executing the development wif an offshore team
- Closely worked wif the Enterprise business intelligence group to provide design suggestions and review the designs of other projects.
- Worked on identifying Mapping Bottlenecks in Source, Target and Mappings to improve Performance
- Implemented various business rules of data using various transformations like Normalizer, Source Qualifier, Update Strategy, Look up (connected/unconnected), Sequence Generator, expression, Aggregator and Stored Procedures
- Involved in performance tuning the ETL processes, testing of stored procedures and functions, testing of Informatica sessions, batches and the target Data
- Implemented Slowly Changing Dimensions - Type me & II in different mappings as per the requirements
- Coordinated wif Business Users to understand business needs and implement the same into a functional Data warehouse design.
- Converted functional specifications into technical specifications (design of Informatica Mappings)
- Worked wif source system teams to resolve data quality issues raised by end users.
- Involved in reviewing technical documents, Unit test plans and test cases for QA and System testing
- Requirement analysis and component design including detailed design and resolving design issues
- Providing direction and guidance to a team of developers, including allocation and management of workload by conducting status calls and code reviews
- Working wif clients on requirements based on their business needs
- Communicate deliverables status to user/stakeholders, client and drive periodic review meetings
- On time completion of tasks and the project per quality goals
Environment: Informatica PowerCenter 9.1, Oracle, SQL, PL/SQL, Remedy, SQL Developer, Flat Files, FileZilla, Star/Snowflakes Schema, COBOL, JCL, VSAM, DB2, Shell scripting
Confidential, Charlotte
Java Developer
Responsibilities:
- Interacting wif client to define business requirements and scope of the project.
- Creating GUI Screen and implementing business logics for them.
- Coding wif MVC Infra (Propriety Framework) using Java, J2EE, XML and Javascript
- Maintained, developed and fixed bugs for applications.
- Solid background in Object-Oriented analysis and design.
- Compiling and running the software.
- Writing test plans, test cases for the developed screen in Mercury Quality Centre.
- Executing test cases and fixing bugs through unit testing.
- Integrating the module wif other modules and deploying them on UNIX.
- Generating daily progress reports
- Co-ordinating wif other programmers in the team to ensure dat all the modules compliment each other well.
Environment: Java, JSP, Servlets, XML, Rational Rose, Web Services, DB2, Windows XP, Red Hat LINUX.
