Hadoop Consultant Resume
CA
SUMMARY
- 7 years of experience in designing and developing JAVA application using JAVA technologies, System Analysis, Technical Design, Implementation, Performance Tuning and Testing.
- Around 3 years of experience with Cloudera Hadoop Ecosystems including HDFS, Map Reduce (MRV1 and understanding of YARN) and Hadoop tools (Pig, Hive, HBase, Sqoop, Zookeeper, Oozie).
- Excellent Hands on Experience in developing Hadoop Architecture within the project in Windows and Linux platforms.
- Analyzing client’s Big Data requirement and transforming it in to the Hadoop eco system with account of performance bottlenecks and tunings on the existing Hadoop infrastructure.
- Proficient in Data migration from existing DBMS or RDBMS to Hadoop files system using Sqoop.
- Experience in handling variety of data sets from customer which includes structured as well as unstructured data using HDFS and HBase.
- Developed map programs to perform data transition and analysis using java, hive and pig.
- Excellent Scripting Skills in Pig and Hive Systems.
- Good Experience in Table Partitioning in Hive and Parameter passing in Pig.
- Built libraries, user defined functions, and frameworks around Hadoop Ecosystems.
- Excellent hands on experience in handling 10TB data/week and successfully implanted in Production Cluster running 100 nodes.
- Experience writing Java Map Reduce Jobs, HIVEQL for Data Architects, Data Scientists.
- Good Experience in data loading from Oracle and MYSQL databases to HDFS system using Sqoop (Structure Data) and Flume (Log Files & XML).
- Defining job flows in Hadoop environment using tools like Oozie by using capacity and fair scheduler.
- Understanding of Cluster coordination services through Zookeeper.
- Understanding of loading a streaming data directly in to HDFS using Flume.
- Good understanding of SPARK with python programming.
- Preparation of proof of concept about Hadoop and micro services.
- Working knowledge of database such as Oracle, MySql.
- Strong experience in database design, writing complex MySQL Queries and Stored Procedures
- Good understanding of OLTP and OLAP processing and data warehousing concepts
- Good understanding of XML methodologies (XML, XSL, XSD) including Web Services and SOAP
- Strong problem solving skills, good communication, interpersonal skills and a good team player
- Ability to work in a fast changing environment and learn new technologies effortlessly
TECHNICAL SKILLS
Hadoop Ecosystem: Hive, HBase, Sqoop, Oozie, Flume, PIG, MapReduce, Zookeeper, SPARK, Ambari, HUE
Big data Platform: Cloudera Hadoop CDH 3/4/5, MapReduce(MRV1,MRV2/YARN),Hortonworks sandbox
ETL/BI tools: MySQL, Oracle 11g, SQL sever
Programming skills: C, C++, Python, JAVA,PIG Latin
Java IDE: Eclipse IDE
Academic experience: OOCP, Data structures, Algorithm development
Parallel Programming: Message passing interface
Operating system: Windows 7,8.1,10, Linux(Ubuntu), Cent OS 6.2
PROFESSIONAL EXPERIENCE
Confidential, CA
Hadoop Consultant
Responsibilities:
- Developed data pipeline using Pig and Hive for data coming from EDW to HDFS.
- Effectively used Sqoop to transfer data between EDW and HDFS.
- Designed workflow by scheduling Hive processes for Log file data streamed into HDFS using Flume for real time access
- Designed Hive table layout, and loading and analyzing of data using HiveQL.
- Developed MapReduce programs to cleanse the data in HDFS obtained from heterogeneous data sources to make it suitable for ingestion into Hive schema for analysis.
- Developed hive scripts and UDF’s as per the Business logic.
- Developed shell scripts to perform automation of batch scheduling
- Experienced in supporting users with the cases using Hadoop cluster.
- Developed Map Reduce Input format to read Confidential specific data format.
- Performance tuning of Hive Queries written by data analysts.
- Used Sqoop to efficiently Transfer data from DB2 to HDFS and vice versa.
- Designed and Developed Oozie workflows, integration with Pig.
- Exported the analyzed data to HBase using Sqoop and to generate reports for the BI team.
- Monitoring and Debugging Hadoop jobs/Applications running in production using HUE as GUI.
- Worked on Hadoop CDH upgrade from CDH4.x to CDH5.x.
- Worked on Providing User support and application support on Hadoop Infrastructure.
- Worked on Evaluating, comparing different tools for test data management with Hadoop.
- Helping testing team to get up to speed on Hadoop Application testing.
- Worked on Installing 20 node UAT Hadoop cluster.
Environment: Cloudera Hadoop, CDH 4/5, Hive, Pig, MapReduce, Oozie, Sqoop, Flume, Eclipse IDE, EDW, JAVA, Shell, Linux.
Confidential, El Segundo, CA
Hadoop Consultant
Responsibilities:
- Coordinated with business customers to gather business requirements and migrated the existing data toHadoopfrom RDBMS (MySQL) using Sqoop for processing the data.
- Used Cloudera distribution of Hadoop CDH 3/4
- Ingestion of log data to HDFS using Flume.
- Analyzed click stream data using Hadoop components Hive and Pig by performing joins and filter.
- Involved in creating managed and external Hive Tables, loading data, generating partitions and buckets and User Defined Functions for optimizing the categorical distribution over ingested data.
- Design and implementation of MapReduce jobs to support distributed data processing to process large data sets utilizing Hadoop cluster.
- Developed MapReduce programs to cleanse and parse data in HDFS obtained from various data sources and to perform joins on the Map side.
- Created Hive tables and working on them using Hive QL.
- Created concurrent access for hive tables with shared and exclusive locking that can be enabled in hive with the help of Zookeeper implementation in the cluster.
- Exported the business required information to RDBMS from HDFS using Sqoop to make the data available for BI team to generate reports.
- Implemented daily workflow for extraction, processing and analysis of data with Oozie.
Environment: Java 1.6, Hadoop 2.0.0, MapReduce, HDFS, Sqoop 1.4.3,Hive 0.10.0, Pig 0.11.0, Linux, XML, Eclipse Juno service, Cloudera - CDH3/4 Distribution, Oracle 11g, MySQL, HBase 0.94.6, Oozie
Confidential, Phoenix, Arizona
Hadoop Developer
Responsibilities:
- Developed and Supported MapReduce Programs those are running on the cluster.
- Created Hive tables and working on them using Hive QL.
- Handled 2 TB of data volume and implemented the same in Production.
- Weekly meetings with technical collaborators and active participation in code review sessions with senior and junior developers.
- Responsible to manage heterogeneous data coming from different sources using Sqoop.
- Supported HBase Architecture Design with the Hadoop Architect team to develop a Database Design in HDFS.
- Involved in HDFS maintenance and loading of structured and unstructured data.
- Wrote Hive queries for data analysis to meet the business requirements.
- Installed and configured Pig and also written Pig Latin scripts.
- Developed Scripts and Batch Job to schedule various Hadoop Program.
- Upgrading the Hadoop Cluster from CDH3 to CDH4 and setup High availability Cluster Integrate the HIVE with existing applications.
- Continuous monitoring and managing the Hadoop cluster through Cloudera Manager.
- Installed Oozie workflow engine to run multiple Hive and Pig jobs.
- Developed Hive queries to process the data and generate the data cubes for visualizing
Environment: cloudera Hadoop, CDH 3/4, Hive, Pig, Map Reduce, Oozie, Sqoop, Eclipse, Hue, MySql, JAVA,Shell,Linux.
Confidential
Application Developer
Responsibilities:
- Responsible for gathering and analyzing requirements and converting them into technical specifications
- Used Rational Rose for creating sequence and class diagrams
- Developed presentation layer using Java, HTML and JavaScript
- Used Spring Core Annotations for Dependency Injection
- Performed Performance Tuning activities using SQL scripts. Involved in scripts preparation using SQL. Designed and developed Hibernate configuration and session-per-request design pattern for making database connectivity and accessing the session for database transactions respectively.
- Participated in the design and development of database schema and Entity-Relationship diagrams of the backend Oracle database tables for the application
- Implemented web services with Apache Axis
- Designed and Developed Stored Procedures, Triggers in Oracle to cater the needs for the entire application. Developed complex SQL queries for extracting data from the database
- Handling the all types of issues of PRIME and ONLINE application like Interest levied wrongly transaction related, installment plans, statement not generating, reward points not getting properly, EMI conversion, product change etc.
Environment: SQL server, Apache Axis, Rational Rose XDE, Spring2.5, Notepad++, Eclipse
Confidential
Java Developer
Responsibilities:
- Developed code for handling exceptions and converting them into Action Messages.
- Used JavaScript for validations and other checking functionality for the UI screens.
- Involved in Struts Based Validation
- Involved in Personal Information module.
- Followed agile and SCRUM methodology for project implementation.
- Designed and developed the user interface layer using Java Script, Ajax, HTML, CSS.
- Used HTML to control the display, position HTML elements and to handle events in the user interface.
- Used JavaScript objects to handle events on text boxes, forms to call business logic.
- Involved in resolving business technical issues.
- Written the JUNIT test cases for the functionalities.
- Developed and tuned the database SQL queries.
- Used Eclipse IDE web application server in development.
Environment: Java/J2EE,Spring 3, Oracle 10g, JavaScript, CSS, AJAX, JUnit, Log4j,SOAP Web Services, Restful Web Services, Eclipse IDE.
Confidential
Software engineer
Responsibilities:
- This project covered hospital functions, management activities and decision-making. It provides all-round and all-angle support for the modern hospital.
- Worked on outpatient registration module and emergency registration module
- Made all requests and processes controlled by the system.
- Used interface-oriented programming manner improving flexibility and expandability of the System
- Build Database and tables according to client requirements
- Debugged and fixed the problems that were found during the different phases of the project
- Maintenance of the Database and the Systems, also updated the System from time to time.
Environment: Java, JDBC, HTML, JavaScript, Eclipse IDE, Windows 2000/7, oracle database, Microsoft Excel.
