Hadoop Developer Resume
Oakland, CA
SUMMARY
- Over 8+ years of professional IT experience which includes 2+ years of experience in Big data ecosystem related technologies like Hadoop, Pig, Hive, Sqoop, HBase, Cassandra and 5 years in Oracle PLSQL development.
- In - depth knowledge of Hadoop architecture and its components like HDFS, Name Node, Data Node, Job Tracker, Application Master, Resource Manager, Task Tracker and Map Reduce programming paradigm.
- Experience in cluster planning, designing, deploying, performance tuning, administering and monitoring Hadoop ecosystem.
- Commendable knowledge / experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS) and vice-versa.
- Experience in developing Map/Reduce jobs to process large data sets utilizing the Map/Reduce programming paradigm.
- Good understanding of cloud configuration in Amazon web services (AWS).
- Experience in database design. Used PL/SQL to write Stored Procedures, Functions, Triggers and strong experience in writing complex queries for Oracle 10g/11g/12C
- Proficient in writing SQL, PL/SQL stored procedures, functions, constraints, packages and triggers.
- Extensive experience working in Oracle, DB2, SQL Server and My SQL database.
- Good knowledge on Hadoop Cluster architecture and monitoring the cluster.
- Hadoop Shell commands, Writing Map reduce Programs, Verifying the Hadoop Log Files.
- Exposure on Query Programming Model of Hadoop.
- Worked on analyzing Hadoop stack and different big data analytic tools including Pig, Hive, Hbase and Sqoop and exposure on HBase, MongoDB.
- Experience in Object Oriented Analysis and Design (OOAD) and development of software using UML methodology.
- Extensive experience working with Oracle database.
- Proficient in XHTML/HTML and CSS.
- Expertise in writing Packages, Stored Procedures, Functions, Views and Database Triggers using SQL and PL/SQL in Oracle.
- Hands on experience in design, development of end user screens and reports using Oracle Developer/2000 (Forms, Reports), Forms and Reports 9i, Oracle Developer Suite 10g and other front-end tools.
- Worked with query tools like Toad, SQL*Plus, SQL Developer.
- Manipulated Stored Procedures, Triggers, Views, Functions and Packages using TOAD.
- Experience in Performance Tuning & Optimization of SQL statements.
- Good inter personnel and communication skills along with excellent team player and self-starter skills.
TECHNICAL SKILLS
Big Data Technologies: Hadoop (HDFS, Map Reduce, Pig, Hive, HBase, Mahout, Falcon, Oozie, Accumulo, Zookeeper.
Programming: Hadoop, Map Reduce, Scala, python, Spark, HDFS, Hive, Pig, Java, SQL, PL/SQL, Oracle Forms.
Middleware: Apache Tomcat, Maven, Horton Work’s data platform.
Databases: Oracle 12C/11g/10g, MangoDB, HBase.
Querying/Reporting: PL/SQL, SQL, Forms
Oracle Tools: ANSI SQL, Oracle PL/SQL, SQL*Plus, TOAD, iSQL*Plus, SQL*Loader, Oracle procedure Builder
Development Tools: Developer 11g/10g/9i, Forms 6i/9i/10g
Operating Systems: UNIX (Sun Solaris/HP-UX/IBM AIX), Red Hat Linux, Oracle Enterprise Linux and Windows
Tools: SQL TRACE, EXPLAIN PLAN, Eclipse, Toad, Forms D2K, HP ALM, CVS.
PROFESSIONAL EXPERIENCE
Confidential, Oakland, CA
Hadoop Developer
Responsibilities:
- Involved in Requirement gathering, Business Analysis and translated business requirements into Technical design in Hadoop and Big Data.
- Developed data pipeline using Flume, Sqoop, Pig and Java Map Reduce to ingest behavioral data into HDFS for analysis.
- Importing and exporting data intoHDFSfrom database and vice versa usingSqoop.
- Writtenhivejobs to parse the logs and structure them in tabular format to facilitate effective querying on the log data
- Developed data pipeline using Flume, Sqoop, Pig and Java Map Reduce to ingest behavioral data into HDFS for analysis.
- Developed workflow inControl Mto automate tasks of loading data intoHDFSand preprocessing withPIG.
- Cluster co-ordination services throughZooKeeper.
- UsedMavenextensively for building jar files ofMap Reduceprograms and deployed to Cluster.
- Created customized BI tool for manager team that perform Query analytics using HiveQL.
- Created Partitions, Buckets based on State to further process using Bucket based Hive joins.
- Developed suit of Unit Test Cases forMapper, ReducerandDriverclasses usingMR Testinglibrary.
- Used Oozie workflow engine to manage interdependent Hadoop jobs and to automate several types of Hadoop jobs such as Java map-reduce Hive, Pig, and Sqoop.
- Created Data Pipeline of Map Reduce programs using Chained Mappers.
- Implemented Optimized join base by joining different data sets to get top claims based on state using Map Reduce.
- Implemented map reduce programs to perform joins on the Map side using Distributed Cache in Java.
- ModelledHivepartitions extensively for data separation and faster data processing and followedPigandHivebest practices for tuning.
Environment: RHEL, HDFS, Map-Reduce, Hive, Pig, Sqoop, Flume, Oozie, Mahout,HBase.
Confidential, NYC, NY
Hadoop Developer
Responsibilities:
- Responsible for building scalable distributed data solutions using Hadoop
- Installed and configured Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster.
- Developed Simple to complex Map/reduce Jobs using Hive and Pig
- Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms
- Handled importing of data from various data sources, performed transformations using Hive, Map Reduce, loaded data into HDFS and Extracted the data from MySQL into HDFS using Sqoop
- Analyzed the data by performing Hive queries and running Pig scripts to study customer behavior
- Implemented business logic by writing UDFs in Java and used various UDFs from Piggybanks and other sources.
- Continuous monitoring and managing the Hadoop cluster using Cloudera Manager
- Worked with application teams to install operating system, Hadoop updates, patches, version upgrades as required
- Developed Map-Reduce programs to process data that is stored in HDFS using Java.
- Loaded business accounts/customer details into HDFS from RDBMS Database using Sqoop.
- Involved in schema design for the application using Hive and HBase.
- Extract Claims for Life and Dental EDI Data into XML
- Process XML using Meta data into Key, Value pairs.
- Installed and configured Hadoop Map Reduce, HDFS (non-production environment).
- Coordinate with other teams for each quarterly deployment and deploy new functionality in prod environment.
- Worked on analyzing Hadoop stack and different big data analytic tools including Pig and Hive, Hbase and Sqoop.
Environment: Hadoop, HDFS, Hive, Pig, Sqoop, Hbase, Hue, Linux, Map Reduce, Hadoop distribution of Cloudera 3 and Flume.
Confidential
Oracle PL/SQL Developer
Responsibilities:
- Involved in design and development phases of Software Development Life Cycle (SDLC) using Scrum methodology.
- Generated server side PL/SQL scripts for data manipulation and validation.
- Involved in working with Business to automating many reports using Unix shell scripts on daily basis.
- Used Bulk Collections for better performance and easy retrieval of data, by reducing context switching between SQL and PL/SQL engines.
- Coded many generic routines (as functions), which could be called from other procedures.
- Worked with the change requests on user interface
- Developed various Reports for the end users as per their requirements and created many reports to suit preprinted format of the company.
- Created user defined Exceptions while handling exceptions.
- Wrote stored procedures, Functions and Database triggers using PL/SQL.
- Designed, developed and maintained data extraction and transformation processes and ensured that data is properly loaded and extracted in and out of our systems.
- Identified and implemented programming enhancements
- Written unit test cases for the modules in order to verify the functionality as per the requirements.
Environment: Oracle 11g, SQL, PL/SQL, .NET Framework, Visual Studio,Toad, HP ALM
Confidential
Oracle PL/SQL Developer
Responsibilities:
- Involved in SDLC gathering requirements from end users.
- Developed views to facilitate easy interface implementation and enforce security on critical customer information.
- Involved in GUI designing using Oracle Developer 10g (Forms 10g and Report 10g).
- Developed stored procedures and triggers to facilitate consistent data entry into the database.
- Written Stored Procedures using PL/SQL and functions and procedure for common utilities.
- Participated in system analysis and data modeling, which included creating tables, views, indexes, synonyms, triggers, functions, procedures, cursors and packages.
- Created programming code using advanced concepts of Records, Collections, and Dynamic SQL.
- Developed Database Triggers for audit and validation purpose.
- Used PL/SQL to validate data and to populate billing tables.
- Developed Installation scripts for all the deliverables.
- Performed functional testing for different Oracle Forms application functionalities.
- Created and manipulated stored procedures, functions, packages and triggers using TOAD.
- Wrote heavy stored procedures using dynamic SQL to populate data into temp tables from fact and dimensional tables .
- Involved in migrating database from oracle 9i to 10g database.
- Involved in developing screens and generating reports.
- Developed Forms and Reports.
Environment: Oracle 9i, 10g SQL, PL/SQL, Forms 9i, SQL*Loader, SQL Navigator, Toad, HP ALM.
Confidential
Oracle PL/SQL Developer
Responsibilities:
- Created and maintained PL/SQL scripts and stored procedures.
- Coded many generic routines (as functions), which could be called from other procedures.
- Developed user interface screens, Master detail relations and Report screens.
- Developed various Reports for the end users as per their requirements and created many reports to suit preprinted format of the company.
- Created user defined Exceptions while handling exceptions.
- Wrote stored procedures, Functions and Database triggers using PL/SQL.
- Conducted training sessions for the end-users.
- Designed, developed and maintained data extraction and transformation processes and ensured that data is properly loaded and extracted in and out of our systems.
- Identified and implemented programming enhancements
Environment: Oracle 9i,10g SQL*Plus, Forms D2k, Sharepoint, CVS.
