Hadoop Developer Resume
PROFESSIONAL SUMMARY:
- 7+ years experience inSoftware Development Life Cycle (SDLC) and Software Engineering including Requirement Gathering, Analyzing, Designing, Implementing, Testing, Support and Maintenance
- Have experience with Hadoop Ecosystem including HDFS, MapReduce, PIG, HIVE and HBase.
- Haveexperience in installing, configuring and administrating Hadoop cluster for major Hadoop distributions.
- Have experience in working with Hadoop in stand - alone, pseudo and distributed modes.
- Experience in importing and exporting data from different RDBMS like MySql, Oracle and SQL Server into HDFS and Hive using Sqoop.
- Experience in developing custom Map-Reduce programs using Apache Hadoop to perform Data Transformation and analysis as per requirement.
- Strong competency in HIVE Schema design, Data imports and Analysis.
- Hands-on experience in writing Pig Latin scripts, working with grunt shell and job scheduling with Oozie.
- Experience in Design & Development, tuning and maintenance of NoSQL databases.
- Experience in working with NoSQL databases like HBase.
- Strong understanding of Data Modeling and experience with Data Cleansing, Data Profiling and Data analysis.
- Excellent experience in ETL analysis, designing, developing, testing and implementing ETL processes including performance tuning and query optimizing of databases.
- Excellent experience in extracting source data from Sequential files, XML files, Excel files, transforming and loading it into the target data warehouse.
- Expertise in Java/J2EE technologies such as Core Java, JDBC, HTML, JavaScript, JSON
- Have knowledge of Python programming.
- Proficiency in programming with different IDE's like Eclipse, NetBeans and Canopy.
- Involved in database design, creating Tables, Views, Stored Procedures, Functions, Triggers and Indexes.
- Have hands on experience on various DB platforms like Oracle, MySQL, DB2 and MS SQL Server.
- Experience in deploying applications in heterogeneous application servers TOMCAT and WebLogic.
- Good understanding of service oriented architecture (SOA) and web services like XML, XSD, XSDL, SOAP
- Experience in object oriented analysis and design (OOAD), unified modeling language (UML) and Agile Methodologies.
- Strong communication skills with professional attitude and can take the pressures to drive with enthusiasm to support with full potential.
TECHNICAL SKILLS:
Hadoop Ecosystem: HDFS, MapReduce, Hive, Pig, Zookeeper, Sqoop, Oozie, Flume
Languages: C, Java, PigLatin, Shell Scripting
Operating Systems: Windows Variants,UNIX, LINUX
Database: Oracle 11g/10g/9i, MySQL, Postgresql
IDE Tools: Eclipse, Net Beans, SQL Developer, MS Visual Studio
Version Control: Git
Software Tools: MS Office Suite(Word, Excel, Project),MS Visio, Rational Rose, Apache Maven
Web Technologies: HTML, CSS, XML, JSP
Monitoring Tools: Ganglia, Nagios and Cloudera Manager, Ops Center
Application Servers: Apache Tomcat
WORK EXPERIENCE:
Hadoop Developer
Confidential
Responsibilities:
- Used Sqoop to transfer data between RDBMS and HDFS.
- Involved in collecting and aggregating large amounts of streaming data into HDFS using Flume and defined channel selectors to multiplex data into different sinks.
- Implemented complex map reduce programs to perform map side joins using distributed cache
- Designed and implemented custom writable, custom input formats, custom partitions and custom comparators in Mapreduce.
- Thoroughly tested Mapreduce programs using MRUnit and Junit testing frameworks.
- Responsible for troubleshooting issues in the execution of Mapreduce jobs by inspecting and reviewing log files
- Converted existing SQL queries into Hive QL queries.
- Implemented UDFs, UDAFs, UDTFs in java for hive to process the data that can’t be performed usingHive inbuilt functions
- Effectively used Oozie to develop automatic workflows of Sqoop, Mapreduce and Hive jobs.
- Exported the analyzed data into relational databases using Sqoop for visualization and to generate reports for the BI team.
- Gatheird the business requirements from the Business Partners and Subject Matter Experts.
- Utilized Agile Scrum Methodology to help manage and organize a team of 4 developers with regular code review sessions.
- Weekly meetings with technical collaborators and active participation in code review sessions with senior and junior developers.
- Loaded and analyzed Omniture logs generated by different web applications.
- Loaded and transformed large sets of structured, semi structured and unstructured data in various formats like text, zip, XML and JSON.
- Refined the Website clickstream data from Omniture logs and moved it into Hive.
- Written multiple MapReduce programs to power data for extraction, transformation and aggregation from multiple file formats including XML, JSON, CSV & other compressed file formats.
- Defined job flows and developed simple to complex Map Reduce jobs as per the requirement.
- Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms.
- Developed PIG UDFs for manipulating the data according to Business Requirements and also worked on developing custom PIG Loaders.
- Responsible for creating Hive tables based on business requirements
- Implemented Partitioning, Dynamic Partitions and Buckets in HIVE for efficient data access.
- Involved in NoSQL database design, integration and implementation.
- Loaded data into NoSQL database HBase
BigData Engineer
Confidential, CA.
Responsibilities:
- Analyzed the Big Data business requirements and transformed it into Hadoop centric technologies.
- Built Hadoop cluster ensuring High availability for Namenode, mixed-workload management, performance optimization, health monitoring, backup and recoveryacross one or more nodes
- Worked with Big data Analysts, designers and scientists in troubleshooting issues with Sqoop, Hive, Pig, Flume etc.
- Used compression codecs to compress data in HDFSImported data into HDFS using Sqoop.
- Experience in retrieving data from databases like MYSQL and Oracle into HDFS using Sqoop and ingesting them into HBase.
- Developed Hive Queries to analyze the data in HDFS to identify issues and behavioral patterns.
- Worked on shell scripting to automate jobs.
- Used PigLatin to analyze datasets and perform transformation according to business requirements.
- Configured Nagios for receiving alerts on critical failures in the cluster by integrating with custom Shell Scripts.
- Configured the Ganglia monitoring tool to monitor both Hadoop and system specific metrics.
- Worked on implementing Flume to import streaming data logs and aggregating the data to HDFS through Flume.
- Implemented MapReduce programs to perform joins using secondary sorting and distributed cache.
Application Developer
Confidential
Responsibilities:
- Key responsibilities included requirements gathering, designing and developing the applications.
- Implemented design patterns and OO design concepts to build the code.
- Participated in planning and development of UML diagrams like Use Case Diagrams, Object Diagrams, Class Diagrams and Sequence Diagrams to represent the detail design phase.
- Identified and fixed transactional issues due to incorrect exceptional handling and concurrency issues due to unsynchronized block of code.
- Created module for providing authentication to the users for using dis application and to synchronize handset with the Exchange server.
- Performed unit testing, system testing and user acceptance test.
Confidential
Java Developer
Responsibilities:
- Involved in design phase meetings for Business Analysis and Requirements gathering.
- Worked with business functional lead to review and finalize requirements and data profiling analysis.
- Extracted data from source systems SQL Server and Sequential files, Flat files.
- Created DataStage jobs, batches and job sequences and tuned them for better performance.
- Designed various Mappings (Source-to-Target) using DataStage to link between different source systems and Data warehouse for loading data into Warehouse.
- Used Shared Containers for code reuse and implementing complex business logic
Confidential
Java Developer
Responsibilities:
- Interacting with the system analysts & business users for design & requirement clarification.
- Used J2EE Design Patterns like Session Façade pattern.
- Used Struts Framework to deal with functionalities like credit issues, invoice details. Used Spring DAO Module along with Struts Framework to deal with Database.
- Spring MVC Framework IOC(Inversion Of Control) design pattern is used to have relationships between application components.
- Developed, and debugged the servlets and EJB with Jboss Application server
- Developed JSPs according to requirement
- Used Connection Pooling to get JDBC connection and access database procedures.
- Wrote complex SQL queries to retrieve invoice details and other financial elements etc.
- Used Log4j as a debugging tool.
