Hadoop Developer Resume

PROFESSIONAL SUMMARY:

7+ years experience inSoftware Development Life Cycle (SDLC) and Software Engineering including Requirement Gathering, Analyzing, Designing, Implementing, Testing, Support and Maintenance
Have experience with Hadoop Ecosystem including HDFS, MapReduce, PIG, HIVE and HBase.
Haveexperience in installing, configuring and administrating Hadoop cluster for major Hadoop distributions.
Have experience in working with Hadoop in stand - alone, pseudo and distributed modes.
Experience in importing and exporting data from different RDBMS like MySql, Oracle and SQL Server into HDFS and Hive using Sqoop.
Experience in developing custom Map-Reduce programs using Apache Hadoop to perform Data Transformation and analysis as per requirement.
Strong competency in HIVE Schema design, Data imports and Analysis.
Hands-on experience in writing Pig Latin scripts, working with grunt shell and job scheduling with Oozie.
Experience in Design & Development, tuning and maintenance of NoSQL databases.
Experience in working with NoSQL databases like HBase.
Strong understanding of Data Modeling and experience with Data Cleansing, Data Profiling and Data analysis.
Excellent experience in ETL analysis, designing, developing, testing and implementing ETL processes including performance tuning and query optimizing of databases.
Excellent experience in extracting source data from Sequential files, XML files, Excel files, transforming and loading it into the target data warehouse.
Expertise in Java/J2EE technologies such as Core Java, JDBC, HTML, JavaScript, JSON
Have knowledge of Python programming.
Proficiency in programming with different IDE's like Eclipse, NetBeans and Canopy.
Involved in database design, creating Tables, Views, Stored Procedures, Functions, Triggers and Indexes.
Have hands on experience on various DB platforms like Oracle, MySQL, DB2 and MS SQL Server.
Experience in deploying applications in heterogeneous application servers TOMCAT and WebLogic.
Good understanding of service oriented architecture (SOA) and web services like XML, XSD, XSDL, SOAP
Experience in object oriented analysis and design (OOAD), unified modeling language (UML) and Agile Methodologies.
Strong communication skills with professional attitude and can take the pressures to drive with enthusiasm to support with full potential.

TECHNICAL SKILLS:

Hadoop Ecosystem: HDFS, MapReduce, Hive, Pig, Zookeeper, Sqoop, Oozie, Flume

Languages: C, Java, PigLatin, Shell Scripting

Operating Systems: Windows Variants,UNIX, LINUX

Database: Oracle 11g/10g/9i, MySQL, Postgresql

IDE Tools: Eclipse, Net Beans, SQL Developer, MS Visual Studio

Version Control: Git

Software Tools: MS Office Suite(Word, Excel, Project),MS Visio, Rational Rose, Apache Maven

Web Technologies: HTML, CSS, XML, JSP

Monitoring Tools: Ganglia, Nagios and Cloudera Manager, Ops Center

Application Servers: Apache Tomcat

WORK EXPERIENCE:

Hadoop Developer

Confidential

Responsibilities:

Used Sqoop to transfer data between RDBMS and HDFS.
Involved in collecting and aggregating large amounts of streaming data into HDFS using Flume and defined channel selectors to multiplex data into different sinks.
Implemented complex map reduce programs to perform map side joins using distributed cache
Designed and implemented custom writable, custom input formats, custom partitions and custom comparators in Mapreduce.
Thoroughly tested Mapreduce programs using MRUnit and Junit testing frameworks.
Responsible for troubleshooting issues in the execution of Mapreduce jobs by inspecting and reviewing log files
Converted existing SQL queries into Hive QL queries.
Implemented UDFs, UDAFs, UDTFs in java for hive to process the data that can’t be performed usingHive inbuilt functions
Effectively used Oozie to develop automatic workflows of Sqoop, Mapreduce and Hive jobs.
Exported the analyzed data into relational databases using Sqoop for visualization and to generate reports for the BI team.
Gatheird the business requirements from the Business Partners and Subject Matter Experts.
Utilized Agile Scrum Methodology to help manage and organize a team of 4 developers with regular code review sessions.
Weekly meetings with technical collaborators and active participation in code review sessions with senior and junior developers.
Loaded and analyzed Omniture logs generated by different web applications.
Loaded and transformed large sets of structured, semi structured and unstructured data in various formats like text, zip, XML and JSON.
Refined the Website clickstream data from Omniture logs and moved it into Hive.
Written multiple MapReduce programs to power data for extraction, transformation and aggregation from multiple file formats including XML, JSON, CSV & other compressed file formats.
Defined job flows and developed simple to complex Map Reduce jobs as per the requirement.
Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms.
Developed PIG UDFs for manipulating the data according to Business Requirements and also worked on developing custom PIG Loaders.
Responsible for creating Hive tables based on business requirements
Implemented Partitioning, Dynamic Partitions and Buckets in HIVE for efficient data access.
Involved in NoSQL database design, integration and implementation.
Loaded data into NoSQL database HBase

BigData Engineer

Confidential, CA.

Responsibilities:

Analyzed the Big Data business requirements and transformed it into Hadoop centric technologies.
Built Hadoop cluster ensuring High availability for Namenode, mixed-workload management, performance optimization, health monitoring, backup and recoveryacross one or more nodes
Worked with Big data Analysts, designers and scientists in troubleshooting issues with Sqoop, Hive, Pig, Flume etc.
Used compression codecs to compress data in HDFSImported data into HDFS using Sqoop.
Experience in retrieving data from databases like MYSQL and Oracle into HDFS using Sqoop and ingesting them into HBase.
Developed Hive Queries to analyze the data in HDFS to identify issues and behavioral patterns.
Worked on shell scripting to automate jobs.
Used PigLatin to analyze datasets and perform transformation according to business requirements.
Configured Nagios for receiving alerts on critical failures in the cluster by integrating with custom Shell Scripts.
Configured the Ganglia monitoring tool to monitor both Hadoop and system specific metrics.
Worked on implementing Flume to import streaming data logs and aggregating the data to HDFS through Flume.
Implemented MapReduce programs to perform joins using secondary sorting and distributed cache.

Application Developer

Confidential

Responsibilities:

Key responsibilities included requirements gathering, designing and developing the applications.
Implemented design patterns and OO design concepts to build the code.
Participated in planning and development of UML diagrams like Use Case Diagrams, Object Diagrams, Class Diagrams and Sequence Diagrams to represent the detail design phase.
Identified and fixed transactional issues due to incorrect exceptional handling and concurrency issues due to unsynchronized block of code.
Created module for providing authentication to the users for using dis application and to synchronize handset with the Exchange server.
Performed unit testing, system testing and user acceptance test.

Confidential

Java Developer

Responsibilities:

Involved in design phase meetings for Business Analysis and Requirements gathering.
Worked with business functional lead to review and finalize requirements and data profiling analysis.
Extracted data from source systems SQL Server and Sequential files, Flat files.
Created DataStage jobs, batches and job sequences and tuned them for better performance.
Designed various Mappings (Source-to-Target) using DataStage to link between different source systems and Data warehouse for loading data into Warehouse.
Used Shared Containers for code reuse and implementing complex business logic

Confidential

Java Developer

Responsibilities:

Interacting with the system analysts & business users for design & requirement clarification.
Used J2EE Design Patterns like Session Façade pattern.
Used Struts Framework to deal with functionalities like credit issues, invoice details. Used Spring DAO Module along with Struts Framework to deal with Database.
Spring MVC Framework IOC(Inversion Of Control) design pattern is used to have relationships between application components.
Developed, and debugged the servlets and EJB with Jboss Application server
Developed JSPs according to requirement
Used Connection Pooling to get JDBC connection and access database procedures.
Wrote complex SQL queries to retrieve invoice details and other financial elements etc.
Used Log4j as a debugging tool.

We provide IT Staff Augmentation Services!

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship