Hadoop Developer Resume
0/5 (Submit Your Rating)
AR
SUMMARY
- Proactive IT developer with 7+ years of working experience in Designing and Building high performance and scalable systems using Java/ J2EE Technologies and Big Data Ecosystem on windows and Linux environments.
- Highly dedicated, Self - motivated and result oriented Hadoop Developer with around 3 years of strong end-to-end experience on Hadoop Development with varying level of expertise around different BIGDATA HADOOP projects
- Strong experience using Big Data frameworks and tools to crunch Petabytes of structured and unstructured data using HDFS, MapReduce, Hive, Pig, Sqoop, Oozie, Flume, Hue and NoSQL databases (Hbase, Cassandra)
- Experience in working with Cloudera (CDH3 & CDH4) Hadoop clusters and exposure on Horton Works Distributions (HDP).
- Good understanding of Hadoop 2.0 (MRv2) YARN architecture.
- Excellent understanding of MapReduce framework and its design patterns.
- Expert in writing MapReduce programs in Java to extract information from huge volume of files, load it into HBase.
- Expertise in Importing and exporting data into HDFS and Hive using Sqoop and vice-versa.
- Experience in using Flume to load the log data from multiple sources directly into HDFS.
- Excellent hands on experience in analyzing data using Pig Latin, Hive QL, and HBase.
- Expertise with optimizing query performance and data load times in PIG, Hive and MapReduce applications
- Experience in using Sequence, AVRO, RC files and CSV file formats and using different compression Techniques.
- Expert in writing HiveQL queries and Pig Latin scripts.
- Experienced with improving data cleansing process using Pig Latin operations, transformations and join operations.
- Experience in extending Hive and Pig core functionality by writing custom UDFs like UDAFs and UDTFs.
- Hands on experience with in designing both time driven and data driven automated workflows using Oozie.
- Hands on experience in performing real time analytics on big data using HBase and Cassandra.
- Experience in supporting data analysis projects using Elastic Map Reduce on the Amazon Web Services (AWS) cloud. Exporting and importing data into Amazon S3 and Teradata.
- Exposure in using Cloudera Manager for monitoring jobs running on cluster.
- Experience in writing Test cases, test classes using MRUnit, JUnit and Mockito.
- Exposure on customization of embedded Apache solr using spring MVC.
- Explored on Spark, Kafka, Crunch and Storm along with other open source projects to create a POC.
- Exposure on MapReduce programs using Python scripting.
- Expertise in developing large scale e commerce applications using J2EE technologies.
- Expertise in developing applications using Struts, Enterprise Java Beans (EJB), Servlets, JPA, JMS, SOAP, JSF, Hibernate, spring frameworks.
- Working knowledge of Web/Application Servers like JBoss, Apache TomCat, IBM WebSphere and Oracle WebLogic.
- Expertise in tools and utilities like Eclipse, TOAD for Oracle, Rational Rose (UML tool), WSAD, RAD, Ant, Maven.
- Strong knowledge of Agile development methodologies, waterfall methodologies to minimize customer impact.
- Good understanding of all aspects of Testing such as Unit, Regression, Agile, White-box, Black-box.
- Ability to effectively manage deadlines .Self-motivated, highly organized and the ability to multi-task.
PROFESSIONAL EXPERIENCE
Hadoop Developer
Confidential, AR
Responsibilities:
- Extensively worked with Teradata utilities like Fast Load, Multi Load to export and load data to/from different sources systems including flat files.
- Extracted the data from Teradata/RDBMS into HDFS using Sqoop export/import.
- Experienced on loading and transforming of large sets of structured, semi structured and unstructured data.
- Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms.
- Developed multiple Mapreduce jobs in java for data cleaning and preprocessing
- Improved mapper and reducer code using Python iterators and generators.
- Expertise in using Data organizational design patterns in Map Reduce to convert business data into custom format.
- Experienced in implementing complex map reduce programs using map reduce design patterns.
- Expertise in optimization of MapReduce algorithms using Combiners, Partitioners and Distributed Cache to deliver best results.
- Experienced with handling different compression codec's like LZO, GZIP, and Snappy.
- Experienced with handling data from different sources at a time to reducer using Object Writable in Mapreduce programs.
- Developed Hive MapReduce streaming python modules for machine learning on AWS-housed Hadoop/Hive with applications to predictive analytics.
- Analyzed the data by performing Hive queries and running Pig scripts.
- Created and worked Sqoop jobs with incremental load to populate Hive External tables.
- Developed Hive scripts for end user / analyst requirements to perform ad hoc analysis
- Very good understanding of Partitions, Bucketing concepts in Hive and designed both Managed and External tables in Hive to optimize performance.
- Experienced with handling different optimization techniques in Hive joins.
- Solved performance issues in Hive and Pig scripts with understanding of Joins, Group and aggregation and how does it translate to MapReduce jobs.
- Responsible for writing Hive Queries for analyzing data in using Hive Query Language (HQL).
- Developed UDFs in Java as and when necessary to use in HIVE queries
- Developed Oozie workflow for scheduling and orchestrating the ETL process.
- Implemented authentication using Kerberos authentication using Apache Sentry.
- Experience in using Sequence files, RCFile, AVRO and Key Value file formats.
- Deployed an Apache Solr search engine server to help speed up the search of the government cultural asset.
- Knowledge on visualization tools like Tableau.
- Experienced with writing build jobs using Maven and integrate that with Jenkins.
- Involved in all phases of SDLC in Big Data Environment.
Hadoop Developer
Confidential, Hartford, CT
Responsibilities:
- Experience in developing Shell Scripts for system management and for automating routine tasks.
- Worked on Hadoop Map Reduce tasks in Java to convert JSON format logs to text formats like CSV and TSV (Comma separated values, Tab separated values )
- Developed Mapreduce programs in Java for parsing the raw data and populating staging tables
- Experienced in writing test classes for Mapreduce programs using MRUnit and Mockito.
- Experienced in loading data from UNIX local file system to HDFS.
- Involved in creating Hive tables, loading with data and writing Hive queries
- Worked on NoSQL database like HBase for storing unstructured image and signature data.
- Analyzed the data in HBase to get real time analytics using Java API.
- Imported bulk amount of data into HBase using Mapreduce Integration.
- Implemented performance tuning by using Map Joins, resizing the Mappers/reducers etc.
- Implemented Hive/Pig UDFs for common operations
- Experienced with handling Automatic failover in HBase using Zoo Keeper and HMaster.
- Performing real time analytics in Time Series data using HBase and Hadoop eco system.
- Experienced with performing ETL operations using Pig Latin operations and scripts.
- Parsed JSON and XML files in PIG using Pig Loader functions and extracted meaningful information from Pig Relations by providing a regex using the built-in functions in Pig.
- Experienced with processing Avro data files using Avro tools and Mapreduce programs.
- Used Flume to collect, aggregate, and store the web log data from different sources like web servers, mobile and network devices and pushed to HDFS.
- Integrated spring schedulers with Oozie client to schedule nightly cron jobs.
- Being a part of a POC effort to help build new Hadoop clusters.
- Gained good Knowledge in Amazon AWS concepts like EMR and EC2 web services which provides fast and efficient processing of Big Data
- Worked hands on NoSQL databases like Cassandra, for POC purpose in storing images and URIs.
- Experienced in monitoring, configuring cluster using Ganglia.
- Experienced with writing Low level and high level design documents according to business requirements using Visio.
Senior Java/J2EE Developer
Confidential, Atlanta, GA
Responsibilities:
- Responsible for gathering business and functional requirements from the users
- Analyzed Use Case Diagrams and created various UML Diagrams such as class and sequence diagrams
- Implemented Business components using spring core and Navigation using Spring MVC.
- Implemented Persistent layer using Hibernate core interfaces.
- Implemented internationalization using Spring MVC interceptors.
- Leveraged the homegrown framework to handle exceptions
- Implemented Message driven beans to get log events from Queue.
- Wrote Action classes, Request processor, Business Delegate, Business Objects, Service classes and JSP pages
- Designed the presentation tier components by customizing the Struts framework components such as configuring web modules, request processors and error handling components
- Developed JSP pages using Struts custom tags.
- Developed the components for parsing XML documents using SAX and DOM parsers
- Implemented design patterns such as DAO, Session Facade and Value Objects
- Implemented the Web Services functionality in the application to allow access by external applications to the data
- Utilized Apache Axis for the web service framework and created and deployed clients using SOAP and WSDL
- Developed and implemented several test cases using JUnit and performed load testing
- Used Hibernate as ORM tool and defined the mapping and relationship of each table in database
- Coordinated with QA team to ensure the quality of the application
Java/J2EE Developer
Confidential
Responsibilities:
- Involved in End to End Design and Development of UI Layer, Service Layer and Persistence Layer.
- Implemented Spring MVC for designing and implementing the UI Layer for the application.
- Implemented UI screens using JSF for defining and executing UI flow in the application for the Order Guide module.
- Have used AJAX to retrieve data from server synchronously in the background without interfering with the display and existing page in an interactive way.
- Have Used DWR (Direct Web Remoting) generated script to make AJAX calls to JAVA.
- Involved in writing JavaScript for dynamic manipulation of the elements on the screen and to validate the input.
- Have used pair programming model for the development.
- Involved in writing Spring Validator Classes for validating the input data.
- Have set up Acegi security for the application using the spring framework.
- Have used JAXB to marshal and unmarshal java objects to Communicate with the backend mainframe system.
- Involved in writing complex PL/SQL and SQL blocks for the application
- Worked on persistence layer using O/R Mapping tool Hibernate with Oracle 10g Database.
- Provided expertise for performance optimizations on the end-to-end solution, implemented performance enhancements to database interaction objects.
- Used Log4j package for the debugging, info and error tracings.
Java/J2EE Developer
Confidential
Responsibilities:
- Developed Sequence, Use Case Diagrams and Process Flow Diagrams using Rational Rose.
- Implemented design patterns like Session Façade, Singleton, Factory, Service Locator and DAO.
- Involved in Installation and configuration of JBoss Server.
- Extensively involved in writing Stored Procedures for data retrieval and data storage and updates in Oracle database using JDBC.
- Implemented Business components using Struts Action class.
- Implemented Pl/SQL stored procedures, functions, triggers for persistence layer.
- Extensively used Log4j for logging throughout the application.
- Produced web service using WSDL/SOAP standard.
- Used SVN for source code versioning and code repository.
- Developed Stateless Session EJB Beans to for server side processing.
- Involved in design and implementation of front end controller using Struts Framework.
- Implemented validation utilities using struts validation framework.
- Used JDBC for database access
- Experienced with writing Ant scripts for build process.
- Experienced in creating UI screens using JSP, JavaScript, HTML, and CSS.
- Experienced with writing design documents using Rational rose.
