We provide IT Staff Augmentation Services!

Hadoop Developer Resume

0/5 (Submit Your Rating)

AR

SUMMARY

  • Proactive IT developer with 7+ years of working experience in Designing and Building high performance and scalable systems using Java/ J2EE Technologies and Big Data Ecosystem on windows and Linux environments.
  • Highly dedicated, Self - motivated and result oriented Hadoop Developer with around 3 years of strong end-to-end experience on Hadoop Development with varying level of expertise around different BIGDATA HADOOP projects
  • Strong experience using Big Data frameworks and tools to crunch Petabytes of structured and unstructured data using HDFS, MapReduce, Hive, Pig, Sqoop, Oozie, Flume, Hue and NoSQL databases (Hbase, Cassandra)
  • Experience in working with Cloudera (CDH3 & CDH4) Hadoop clusters and exposure on Horton Works Distributions (HDP).
  • Good understanding of Hadoop 2.0 (MRv2) YARN architecture.
  • Excellent understanding of MapReduce framework and its design patterns.
  • Expert in writing MapReduce programs in Java to extract information from huge volume of files, load it into HBase.
  • Expertise in Importing and exporting data into HDFS and Hive using Sqoop and vice-versa.
  • Experience in using Flume to load the log data from multiple sources directly into HDFS.
  • Excellent hands on experience in analyzing data using Pig Latin, Hive QL, and HBase.
  • Expertise with optimizing query performance and data load times in PIG, Hive and MapReduce applications
  • Experience in using Sequence, AVRO, RC files and CSV file formats and using different compression Techniques.
  • Expert in writing HiveQL queries and Pig Latin scripts.
  • Experienced with improving data cleansing process using Pig Latin operations, transformations and join operations.
  • Experience in extending Hive and Pig core functionality by writing custom UDFs like UDAFs and UDTFs.
  • Hands on experience with in designing both time driven and data driven automated workflows using Oozie.
  • Hands on experience in performing real time analytics on big data using HBase and Cassandra.
  • Experience in supporting data analysis projects using Elastic Map Reduce on the Amazon Web Services (AWS) cloud. Exporting and importing data into Amazon S3 and Teradata.
  • Exposure in using Cloudera Manager for monitoring jobs running on cluster.
  • Experience in writing Test cases, test classes using MRUnit, JUnit and Mockito.
  • Exposure on customization of embedded Apache solr using spring MVC.
  • Explored on Spark, Kafka, Crunch and Storm along with other open source projects to create a POC.
  • Exposure on MapReduce programs using Python scripting.
  • Expertise in developing large scale e commerce applications using J2EE technologies.
  • Expertise in developing applications using Struts, Enterprise Java Beans (EJB), Servlets, JPA, JMS, SOAP, JSF, Hibernate, spring frameworks.
  • Working knowledge of Web/Application Servers like JBoss, Apache TomCat, IBM WebSphere and Oracle WebLogic.
  • Expertise in tools and utilities like Eclipse, TOAD for Oracle, Rational Rose (UML tool), WSAD, RAD, Ant, Maven.
  • Strong knowledge of Agile development methodologies, waterfall methodologies to minimize customer impact.
  • Good understanding of all aspects of Testing such as Unit, Regression, Agile, White-box, Black-box.
  • Ability to effectively manage deadlines .Self-motivated, highly organized and the ability to multi-task.

PROFESSIONAL EXPERIENCE

Hadoop Developer

Confidential, AR

Responsibilities:

  • Extensively worked with Teradata utilities like Fast Load, Multi Load to export and load data to/from different sources systems including flat files.
  • Extracted the data from Teradata/RDBMS into HDFS using Sqoop export/import.
  • Experienced on loading and transforming of large sets of structured, semi structured and unstructured data.
  • Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms.
  • Developed multiple Mapreduce jobs in java for data cleaning and preprocessing
  • Improved mapper and reducer code using Python iterators and generators.
  • Expertise in using Data organizational design patterns in Map Reduce to convert business data into custom format.
  • Experienced in implementing complex map reduce programs using map reduce design patterns.
  • Expertise in optimization of MapReduce algorithms using Combiners, Partitioners and Distributed Cache to deliver best results.
  • Experienced with handling different compression codec's like LZO, GZIP, and Snappy.
  • Experienced with handling data from different sources at a time to reducer using Object Writable in Mapreduce programs.
  • Developed Hive MapReduce streaming python modules for machine learning on AWS-housed Hadoop/Hive with applications to predictive analytics.
  • Analyzed the data by performing Hive queries and running Pig scripts.
  • Created and worked Sqoop jobs with incremental load to populate Hive External tables.
  • Developed Hive scripts for end user / analyst requirements to perform ad hoc analysis
  • Very good understanding of Partitions, Bucketing concepts in Hive and designed both Managed and External tables in Hive to optimize performance.
  • Experienced with handling different optimization techniques in Hive joins.
  • Solved performance issues in Hive and Pig scripts with understanding of Joins, Group and aggregation and how does it translate to MapReduce jobs.
  • Responsible for writing Hive Queries for analyzing data in using Hive Query Language (HQL).
  • Developed UDFs in Java as and when necessary to use in HIVE queries
  • Developed Oozie workflow for scheduling and orchestrating the ETL process.
  • Implemented authentication using Kerberos authentication using Apache Sentry.
  • Experience in using Sequence files, RCFile, AVRO and Key Value file formats.
  • Deployed an Apache Solr search engine server to help speed up the search of the government cultural asset.
  • Knowledge on visualization tools like Tableau.
  • Experienced with writing build jobs using Maven and integrate that with Jenkins.
  • Involved in all phases of SDLC in Big Data Environment.

Hadoop Developer

Confidential, Hartford, CT

Responsibilities:

  • Experience in developing Shell Scripts for system management and for automating routine tasks.
  • Worked on Hadoop Map Reduce tasks in Java to convert JSON format logs to text formats like CSV and TSV (Comma separated values, Tab separated values )
  • Developed Mapreduce programs in Java for parsing the raw data and populating staging tables
  • Experienced in writing test classes for Mapreduce programs using MRUnit and Mockito.
  • Experienced in loading data from UNIX local file system to HDFS.
  • Involved in creating Hive tables, loading with data and writing Hive queries
  • Worked on NoSQL database like HBase for storing unstructured image and signature data.
  • Analyzed the data in HBase to get real time analytics using Java API.
  • Imported bulk amount of data into HBase using Mapreduce Integration.
  • Implemented performance tuning by using Map Joins, resizing the Mappers/reducers etc.
  • Implemented Hive/Pig UDFs for common operations
  • Experienced with handling Automatic failover in HBase using Zoo Keeper and HMaster.
  • Performing real time analytics in Time Series data using HBase and Hadoop eco system.
  • Experienced with performing ETL operations using Pig Latin operations and scripts.
  • Parsed JSON and XML files in PIG using Pig Loader functions and extracted meaningful information from Pig Relations by providing a regex using the built-in functions in Pig.
  • Experienced with processing Avro data files using Avro tools and Mapreduce programs.
  • Used Flume to collect, aggregate, and store the web log data from different sources like web servers, mobile and network devices and pushed to HDFS.
  • Integrated spring schedulers with Oozie client to schedule nightly cron jobs.
  • Being a part of a POC effort to help build new Hadoop clusters.
  • Gained good Knowledge in Amazon AWS concepts like EMR and EC2 web services which provides fast and efficient processing of Big Data
  • Worked hands on NoSQL databases like Cassandra, for POC purpose in storing images and URIs.
  • Experienced in monitoring, configuring cluster using Ganglia.
  • Experienced with writing Low level and high level design documents according to business requirements using Visio.

Senior Java/J2EE Developer

Confidential, Atlanta, GA

Responsibilities:

  • Responsible for gathering business and functional requirements from the users
  • Analyzed Use Case Diagrams and created various UML Diagrams such as class and sequence diagrams
  • Implemented Business components using spring core and Navigation using Spring MVC.
  • Implemented Persistent layer using Hibernate core interfaces.
  • Implemented internationalization using Spring MVC interceptors.
  • Leveraged the homegrown framework to handle exceptions
  • Implemented Message driven beans to get log events from Queue.
  • Wrote Action classes, Request processor, Business Delegate, Business Objects, Service classes and JSP pages
  • Designed the presentation tier components by customizing the Struts framework components such as configuring web modules, request processors and error handling components
  • Developed JSP pages using Struts custom tags.
  • Developed the components for parsing XML documents using SAX and DOM parsers
  • Implemented design patterns such as DAO, Session Facade and Value Objects
  • Implemented the Web Services functionality in the application to allow access by external applications to the data
  • Utilized Apache Axis for the web service framework and created and deployed clients using SOAP and WSDL
  • Developed and implemented several test cases using JUnit and performed load testing
  • Used Hibernate as ORM tool and defined the mapping and relationship of each table in database
  • Coordinated with QA team to ensure the quality of the application

Java/J2EE Developer

Confidential

Responsibilities:

  • Involved in End to End Design and Development of UI Layer, Service Layer and Persistence Layer.
  • Implemented Spring MVC for designing and implementing the UI Layer for the application.
  • Implemented UI screens using JSF for defining and executing UI flow in the application for the Order Guide module.
  • Have used AJAX to retrieve data from server synchronously in the background without interfering with the display and existing page in an interactive way.
  • Have Used DWR (Direct Web Remoting) generated script to make AJAX calls to JAVA.
  • Involved in writing JavaScript for dynamic manipulation of the elements on the screen and to validate the input.
  • Have used pair programming model for the development.
  • Involved in writing Spring Validator Classes for validating the input data.
  • Have set up Acegi security for the application using the spring framework.
  • Have used JAXB to marshal and unmarshal java objects to Communicate with the backend mainframe system.
  • Involved in writing complex PL/SQL and SQL blocks for the application
  • Worked on persistence layer using O/R Mapping tool Hibernate with Oracle 10g Database.
  • Provided expertise for performance optimizations on the end-to-end solution, implemented performance enhancements to database interaction objects.
  • Used Log4j package for the debugging, info and error tracings.

Java/J2EE Developer

Confidential

Responsibilities:

  • Developed Sequence, Use Case Diagrams and Process Flow Diagrams using Rational Rose.
  • Implemented design patterns like Session Façade, Singleton, Factory, Service Locator and DAO.
  • Involved in Installation and configuration of JBoss Server.
  • Extensively involved in writing Stored Procedures for data retrieval and data storage and updates in Oracle database using JDBC.
  • Implemented Business components using Struts Action class.
  • Implemented Pl/SQL stored procedures, functions, triggers for persistence layer.
  • Extensively used Log4j for logging throughout the application.
  • Produced web service using WSDL/SOAP standard.
  • Used SVN for source code versioning and code repository.
  • Developed Stateless Session EJB Beans to for server side processing.
  • Involved in design and implementation of front end controller using Struts Framework.
  • Implemented validation utilities using struts validation framework.
  • Used JDBC for database access
  • Experienced with writing Ant scripts for build process.
  • Experienced in creating UI screens using JSP, JavaScript, HTML, and CSS.
  • Experienced with writing design documents using Rational rose.

We'd love your feedback!