We provide IT Staff Augmentation Services!

Hadoop Developer Resume

0/5 (Submit Your Rating)

PROFESSIONAL SUMMARY:

  • 7+ years experience inSoftware Development Life Cycle (SDLC) and Software Engineering including Requirement Gathering, Analyzing, Designing, Implementing, Testing, Support and Maintenance
  • Have experience with Hadoop Ecosystem including HDFS, MapReduce, PIG, HIVE and HBase.
  • Haveexperience in installing, configuring and administrating Hadoop cluster for major Hadoop distributions.
  • Have experience in working with Hadoop in stand - alone, pseudo and distributed modes.
  • Experience in importing and exporting data from different RDBMS like MySql, Oracle and SQL Server into HDFS and Hive using Sqoop.
  • Experience in developing custom Map-Reduce programs using Apache Hadoop to perform Data Transformation and analysis as per requirement.
  • Strong competency in HIVE Schema design, Data imports and Analysis.
  • Hands-on experience in writing Pig Latin scripts, working with grunt shell and job scheduling with Oozie.
  • Experience in Design & Development, tuning and maintenance of NoSQL databases.
  • Experience in working with NoSQL databases like HBase.
  • Strong understanding of Data Modeling and experience with Data Cleansing, Data Profiling and Data analysis.
  • Excellent experience in ETL analysis, designing, developing, testing and implementing ETL processes including performance tuning and query optimizing of databases.
  • Excellent experience in extracting source data from Sequential files, XML files, Excel files, transforming and loading it into the target data warehouse.
  • Expertise in Java/J2EE technologies such as Core Java, JDBC, HTML, JavaScript, JSON
  • Have knowledge of Python programming.
  • Proficiency in programming with different IDE's like Eclipse, NetBeans and Canopy.
  • Involved in database design, creating Tables, Views, Stored Procedures, Functions, Triggers and Indexes.
  • Have hands on experience on various DB platforms like Oracle, MySQL, DB2 and MS SQL Server.
  • Experience in deploying applications in heterogeneous application servers TOMCAT and WebLogic.
  • Good understanding of service oriented architecture (SOA) and web services like XML, XSD, XSDL, SOAP
  • Experience in object oriented analysis and design (OOAD), unified modeling language (UML) and Agile Methodologies.
  • Strong communication skills with professional attitude and can take the pressures to drive with enthusiasm to support with full potential.

TECHNICAL SKILLS:

Hadoop Ecosystem: HDFS, MapReduce, Hive, Pig, Zookeeper, Sqoop, Oozie, Flume

Languages: C, Java, PigLatin, Shell Scripting

Operating Systems: Windows Variants,UNIX, LINUX

Database: Oracle 11g/10g/9i, MySQL, Postgresql

IDE Tools: Eclipse, Net Beans, SQL Developer, MS Visual Studio

Version Control: Git

Software Tools: MS Office Suite(Word, Excel, Project),MS Visio, Rational Rose, Apache Maven

Web Technologies: HTML, CSS, XML, JSP

Monitoring Tools: Ganglia, Nagios and Cloudera Manager, Ops Center

Application Servers: Apache Tomcat

WORK EXPERIENCE:

Hadoop Developer

Confidential

Responsibilities:

  • Used Sqoop to transfer data between RDBMS and HDFS.
  • Involved in collecting and aggregating large amounts of streaming data into HDFS using Flume and defined channel selectors to multiplex data into different sinks.
  • Implemented complex map reduce programs to perform map side joins using distributed cache
  • Designed and implemented custom writable, custom input formats, custom partitions and custom comparators in Mapreduce.
  • Thoroughly tested Mapreduce programs using MRUnit and Junit testing frameworks.
  • Responsible for troubleshooting issues in the execution of Mapreduce jobs by inspecting and reviewing log files
  • Converted existing SQL queries into Hive QL queries.
  • Implemented UDFs, UDAFs, UDTFs in java for hive to process the data that can’t be performed usingHive inbuilt functions
  • Effectively used Oozie to develop automatic workflows of Sqoop, Mapreduce and Hive jobs.
  • Exported the analyzed data into relational databases using Sqoop for visualization and to generate reports for the BI team.
  • Gatheird the business requirements from the Business Partners and Subject Matter Experts.
  • Utilized Agile Scrum Methodology to help manage and organize a team of 4 developers with regular code review sessions.
  • Weekly meetings with technical collaborators and active participation in code review sessions with senior and junior developers.
  • Loaded and analyzed Omniture logs generated by different web applications.
  • Loaded and transformed large sets of structured, semi structured and unstructured data in various formats like text, zip, XML and JSON.
  • Refined the Website clickstream data from Omniture logs and moved it into Hive.
  • Written multiple MapReduce programs to power data for extraction, transformation and aggregation from multiple file formats including XML, JSON, CSV & other compressed file formats.
  • Defined job flows and developed simple to complex Map Reduce jobs as per the requirement.
  • Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms.
  • Developed PIG UDFs for manipulating the data according to Business Requirements and also worked on developing custom PIG Loaders.
  • Responsible for creating Hive tables based on business requirements
  • Implemented Partitioning, Dynamic Partitions and Buckets in HIVE for efficient data access.
  • Involved in NoSQL database design, integration and implementation.
  • Loaded data into NoSQL database HBase

BigData Engineer

Confidential, CA.

Responsibilities:

  • Analyzed the Big Data business requirements and transformed it into Hadoop centric technologies.
  • Built Hadoop cluster ensuring High availability for Namenode, mixed-workload management, performance optimization, health monitoring, backup and recoveryacross one or more nodes
  • Worked with Big data Analysts, designers and scientists in troubleshooting issues with Sqoop, Hive, Pig, Flume etc.
  • Used compression codecs to compress data in HDFSImported data into HDFS using Sqoop.
  • Experience in retrieving data from databases like MYSQL and Oracle into HDFS using Sqoop and ingesting them into HBase.
  • Developed Hive Queries to analyze the data in HDFS to identify issues and behavioral patterns.
  • Worked on shell scripting to automate jobs.
  • Used PigLatin to analyze datasets and perform transformation according to business requirements.
  • Configured Nagios for receiving alerts on critical failures in the cluster by integrating with custom Shell Scripts.
  • Configured the Ganglia monitoring tool to monitor both Hadoop and system specific metrics.
  • Worked on implementing Flume to import streaming data logs and aggregating the data to HDFS through Flume.
  • Implemented MapReduce programs to perform joins using secondary sorting and distributed cache.

Application Developer

Confidential

Responsibilities:

  • Key responsibilities included requirements gathering, designing and developing the applications.
  • Implemented design patterns and OO design concepts to build the code.
  • Participated in planning and development of UML diagrams like Use Case Diagrams, Object Diagrams, Class Diagrams and Sequence Diagrams to represent the detail design phase.
  • Identified and fixed transactional issues due to incorrect exceptional handling and concurrency issues due to unsynchronized block of code.
  • Created module for providing authentication to the users for using dis application and to synchronize handset with the Exchange server.
  • Performed unit testing, system testing and user acceptance test.

Confidential

Java Developer

Responsibilities:

  • Involved in design phase meetings for Business Analysis and Requirements gathering.
  • Worked with business functional lead to review and finalize requirements and data profiling analysis.
  • Extracted data from source systems SQL Server and Sequential files, Flat files.
  • Created DataStage jobs, batches and job sequences and tuned them for better performance.
  • Designed various Mappings (Source-to-Target) using DataStage to link between different source systems and Data warehouse for loading data into Warehouse.
  • Used Shared Containers for code reuse and implementing complex business logic

Confidential

Java Developer

Responsibilities:

  • Interacting with the system analysts & business users for design & requirement clarification.
  • Used J2EE Design Patterns like Session Façade pattern.
  • Used Struts Framework to deal with functionalities like credit issues, invoice details. Used Spring DAO Module along with Struts Framework to deal with Database.
  • Spring MVC Framework IOC(Inversion Of Control) design pattern is used to have relationships between application components.
  • Developed, and debugged the servlets and EJB with Jboss Application server
  • Developed JSPs according to requirement
  • Used Connection Pooling to get JDBC connection and access database procedures.
  • Wrote complex SQL queries to retrieve invoice details and other financial elements etc.
  • Used Log4j as a debugging tool.

We'd love your feedback!