We provide IT Staff Augmentation Services!

Hadoop Developer Resume

4.00/5 (Submit Your Rating)

NC

SUMMARY:

  • 9 plus years of overall experience in IT industry which includes hands on experience in Software development using Java and 3+ years of comprehensive experience on Big Data.Good Understanding of the software development lifecycle (SDLC). Exposed to Agile method of software development (SCRUM). Hands on Experience with core Java and good at communicating with the client requirement gathering.
  • Good hands on knowledge in Hadoop ecosystem and its components such as Map Reduce & HDFS.
  • Good understanding on various daemon processes like Job Tracker, Task Tracker, Name Node and Data Node.
  • Good experience working with Horton works Distribution and Cloudera Distribution.
  • Experienced in working with Hadoop and BigData over Amazon web services AWS Cloud and in Microsoft using SSH and Putty.
  • Worked on installing, configuring, and administrating Hadoop cluster for distributions like Cloudera and Horton works.
  • Good understanding of NOSQL database HBase, MongoDB.
  • Worked on REST full web service with spring and JSON files.
  • Very well experienced in designing and developing both server side and client side applications.
  • In - depth knowledge on Hadoop Ecosystem components like PIG, YARN, Hive, Sqoop, HBase, Oozie, Zookeeper, Hue, Cloudera Manager, Flume, Spark, Strom and Scala and Kafka
  • Hands-on experience writing MapReduce Jobs
  • Experienced in managing and reviewing Hadoop logs.
  • Hands on experience in using Sqoop for importing and exporting data from HDFS to Relational Database Systems.
  • Hands-on work experience in RDBMS and creating complex Linux shell scripts
  • Experience in extending Hive and Pig core functionality by writing custom UDFs
  • Experienced in analyzing data by using Hive query language, Pig and Map Reduce.
  • Knowledge and experience in job work-flow scheduling and monitoring tools like Oozie and Zookeeper.
  • Knowledge of data warehousing and ETL tools like Informatica.
  • Used tools such as Eclipse and net beans for Development.
  • Explored Spark, Kafka, and Storm with other open source projects to create a real-time analytics framework.
  • Experienced in various RDBMS like MS SQL Server, MySQL, and Oracle.
  • Experienced in using MVC architecture, StrutsHibernate for developing web applications using Java, JSPs, JavaScript, HTML, JQuery, AJAX, XML and JSON.
  • Expertise in Java development using J2EE, Spring, J2SE, Servlets, JSP
  • Experienced in Core Java and object oriented design with strong understanding of Collections, Multithreading and Exception handling.
  • Experienced in Agile software methodology (scrum).
  • Knowledgeable in Database concepts and writing finely tuned queries and performance tuning.
  • Strong knowledge in writing advanced Shell Scripts in Linux/Unix
  • Skilled in establishing strong relations among project's teams and team members.
  • Ability to work in high-pressure environments delivering to and managing stakeholder expectations
  • Application of structured methods to: Project Scoping and Planning, risks, issues, schedules and deliverables.
  • Strong analytical and Problem solving skills.
  • Good Inter personnel skills and ability to work as part of a team. Exceptional ability to learn and master new technologies and to deliver outputs in short deadlines

TECHNICAL SKILLS:

Hadoop Ecosystem: MapReduce, HDFS, Hive, Pig, HBase, Zookeeper, Sqoop, Oozie, Flume, Spark, Kafka, Storm

Java Technologies: Java, J2EE, Servlets, JSP, XML, AJAX, SOAP, WSDL

SDLC Methodologies: Agile, UML, Design Patterns (Core Java and J2EE)

Enterprise Frameworks: Ajax, MVC, Struts 2/1, Hibernate 3, Spring 3/2.5/2

Version and Source Control: CVS,SVN, GIT, Synergy

Programming Languages: C,C++,Java, XML, Unix Shell scripting, SQL and PLSQL

Web Technologies: HTML, DHTML, XML, XSLT, JavaScript, CSS

Modeling Tools: UML on Rational Rose 4.0 IDE Tools: Eclipse, Net beans.

WORK EXPERIENCE:

Hadoop Developer

Confidential, NC

Responsibilities:

  • Involved in installing and configuring BigInsights Hadoop platform including ecosystem environment on the server.
  • To configure and manage Hadoop Components such as Pig, Hive and Sqoop.
  • Used Flume to load unstructured and semi structured data from various sources such as websites and streaming data to cluster
  • Implemented UDFs for providing custom Pig and hive capabilities
  • Worked on designing NoSQL Schemas on HBase
  • Worked on configuring and managing disaster recovery and backup on Cassandra Data.
  • Performed file system management and monitoring on Hadoop log files.
  • Utilized Oozie workflow to run Pig and Hive jobs
  • Developed customized classes for serialization and Deserialization in Hadoop
  • Performed optimization of MapReduce for effective usage of HDFS by compression techniques.
  • Developed Shell, Perl and Python scripts to automate and provide Control flow to Pig scripts.
  • Analyzed large amounts of data sets to determine optimal way to aggregate and report on it.
  • Responsible for managing data coming from different sources.
  • Worked on Data Serialization formats for converting Complex objects into sequence bits by using AVRO, PARQUET, JSON, CSV formats.
  • Successfully converted the AVRO data into PARQUET format in IMPALA for faster query processing.
  • Involved in migration of data from existing RDBMS (oracle and SQL server) to Hadoop using Sqoop for processing data.
  • Performed analysis of data using Hive queries and Pig scripts.

Environment: Hadoop Framework, BigInsights, MapReduce, Hive, Sqoop, Pig, HBase, Flume, Oozie, Java(JDK1.6), UNIX Shell Scripting, Oracle 11g/12g, Windows NT, IBM Data stage 8.1, TOAD 9.6, Teradata.

Hadoop Developer

Confidential, Dallas, TX

Responsibilities:

  • Worked on a Hadoop environment with MapReduce, KAFKA, Sqoop, Oozie, Flume, Hbase, Pig, Hive and Impala on a multi node cloud environment
  • To configure Hadoop environment in cloud through Amazon Web Services(AWS) and to provide a scalable distributed data solution
  • Worked on installation of KAFKA on Hadoop cluster and to use it for streaming & cleansing of raw data and have extracted useful information using Hive and stored the results in Hbase and have enabled the clients to review the results using Tableauby connecting it through the IP address provided by AWS.
  • Developed producers for Kafka which compress, and bind many small files into a larger Avro and Sequence files before writing to HDFS to make best use of a Hadoop block size.
  • Worked on implementing MapReduce Jobs to parse raw weblogs into delimited records and also in handling files in various formats such as JSON, XML, Text formats.
  • Improved performance on MapReduce Jobs by creating combiners, Partitioning and Distributed Cache.
  • Exposure in spark iterative processing.
  • Created partitioned tables in Hivefor best performance and faster querying.
  • Utilized Sqoop to import data from various database sources into Hbase using Sqoop scripts by incremental data loading on transactions of customer's data by date
  • Utilized Flume in moving log files generated from various sources into Amazon S3 for processing of data.
  • Performed extensive data analysis using Hive and Pig.
  • Created Simple as well as complex results using Hive and have improved performance and reduced query time by creating partitioned tables.
  • Created workflow in Oozie for Automating tasks of loading data into Amazon S3 and to preprocess using Pig, utilized Oozie for data scrubbing and processing
  • Developed scripts and deployed them to pre-process the data before moving to HDFS.
  • Performed extensive analysis on data with Hive and Pig.
  • Worked on proof of concept on IMPALA.
  • Used Synergy for Version control and Clear Quest for creating and recording logs on defects and tasks assigned to me.

Environment: Hive, MapReduce, Pig, Impala, Tableau, HDFS, Oozie, and AWS.

Hadoop Developer

Confidential, Dallas, TX

Responsibilities:

  • Worked on analyzing Hadoop cluster using different big data analytic tools including Pig, Hive and MapReduce
  • Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis
  • Worked on debugging, performance tuning of Hive & Pig Jobs
  • Created Hbase tables to store various data formats of PII data coming from different portfolios
  • Implemented test scripts to support test driven development and continuous integration
  • Worked on tuning the performance Pig queries
  • Involved in loading data from LINUX file system to HDFS
  • Importing and exporting data into HDFS and Hive using Sqoop
  • Experience working on processing unstructured data using Pig and Hive
  • Implemented Partitioning, Dynamic Partitions, Buckets in Hive.
  • Experienced in running Hadoop streaming jobs to process terabytes of xml format data.
  • Supported MapReduce Programs those are running on the cluster
  • Gained experience in managing and reviewing Hadoop log files
  • Involved in scheduling Oozie workflow engine to run multiple Hive and pig jobs
  • Developed Pig Latin scripts to extract data from the web server output files to load into HDFS
  • Extensively used Pig for data cleansing.
  • Created and maintained Technical documentation for launching HADOOP Clusters and for executing Hive queries and Pig Scripts
  • Strong experience on Apache server configuration
  • Exported the result set from HIVE to MySQL using Shell scripts
  • Implemented SQL, PL/SQL Stored Procedures
  • Actively involved in code review and bug fixing for improving the performance.
  • Developed screens using JSP, DHTML, CSS, AJAX, JavaScript, Struts, spring, Java and XML

Environment: Hadoop, HDFS, Pig, Hive, MapReduce, Sqoop, LINUX, Cloudera, Big Data, Java APIs, Java collection, SQL, AJAX.

Hadoop Developer

Confidential, GA

Responsibilities:

  • Worked on the proof-of-concept for Apache Hadoop1.20.2 framework initiation
  • Installed and configured Hadoop clusters and eco-system
  • Developed automated scripts to install Hadoop clusters
  • Monitored Hadoop cluster job performance and capacity planning
  • Hands-on experience with Hadoop technology stack (HDFS, MapReduce, Hive, Hbase, Pig, Flume, Sqoop)
  • Had experience in high scale or distributed RDBMS
  • Created and Implemented highly scalable and reliable distributed data using NoSQL/Cassandra technology
  • Had experience in Hadoop framework, HDFS, MapReduce processing implementation
  • Tuning Hadoop performance with high availability and involved in recovery of Hadoop clusters
  • Responsible for coding JavaBatch, Restfull Service, Map Reduce program, Hive query’s, testing, debugging, Peer code review, troubleshooting and maintain status report.
  • Involved in identifying possible ways to improve the efficiency of the system
  • Requirements Study, Software Development Specification, Development and Unit Testing use of MRUnit and Junit
  • Expertise in writing Shell scripts to monitor Hadoop jobs
  • Used AVRO serialization technique to serialize the data
  • Prepare daily and weekly project status report and share it with the client.
  • Interact with the client periodically to discuss project status
  • Requirements Study, Software Development Specification, Development and Unit Testing use of Junit
  • Generate code coverage report use of Emma and PMD
  • Ensuring quality (Metrics, cohesion, coupling and etc) is maintained in the project deliverables
  • Responsible for troubleshooting and resolving all issues
  • Ensuring compliance to time sheets
  • Active co-ordination and communication with onsite counterparts

Environment: Java, Java Batch, Restfull Service, JAXB, Hadoop, Map Reduce, Junit, MRUnit and Oracle/Informix

Confidential, Mooresville, NC

Java Developer

Responsibilities:

  • Worked on a project which helps collect data of applicant's information from various portals and data is processed by authorized users and the decisions are made to the appropriate office and results are acknowledged to applicants of VDSS (Virginia-Department social service).
  • Created an eligibility system usingMVC framework, Bridge-Tier, IBM's multiple layers SOA framework withservices of EJB Server's which includestransaction, messaging, security, and connector.
  • Enabled multiple screen capability by implementing EJB and BO class on Bridge Tier framework.
  • Implementation of validations for data collection modules with web services with JavaScript and EJB beans
  • Using Collections and Exceptions in Java Core APIs for developing business logic layers.
  • Fetched required data from Oracle database using SQL queries using DOA classes.
  • Implemented a web service client for fetching information from distributed system and SOAP UI
  • Designed the front end using JSP, HTML, CSS, JSTL, Java Script, AJAX and jQuery.
  • Worked on bug fixes on production releases and also in QAT, UAT and deployment support as required.
  • Exposure to Clear Case and Sub Version tool for managing source code versioning and control.
  • Participated in code Reviews and provided valuable suggestions
  • Used Tier Framework for implementing Exception mechanism and logged the entries using Log4j.

Environment: Java, J2EE, Servlets, EJB, JNDI, JMS, Oracle 11g, SQL, JavaScript, AJAX, jQuery, XML, Soap, Junit, Bridge-Tier Framework, WebSphere 7.1, RAD 8.1, JSP, JSTL, HTML, IBM Clear Quest and Clear Case, SVN, Agile, TDD

Confidential, Mooresville, NC

Java Developer

Responsibilities:

  • Developed all the User Interfaces using JSP and Struts framework.
  • Writing Client Side validations using JavaScript.
  • Extensively used JQuery for developing interactive web pages.
  • Developed the DAO layer using the hibernate and for real time performance used the caching system for hibernate.
  • Experience in developing web services for production systems using SOAP and WSDL.
  • Developed the user interface presentation screens using HTML, XML, and CSS.
  • Experience in working with spring using AOP, IOC and JDBC template.
  • Developed the Shell scripts to trigger the Java Batch job, Sending summary email for the batch job status and processing summary.
  • Co-ordinate with the QA lead for development of test plan, test cases, test code and actual testing responsible for defects allocation and those defects are resolved.
  • Involved in testing and deployment of the application on Web logic Application Server during integration and QA testing phase.
  • Maintained the existing code base developed in the Struts, spring and Hibernate framework by incorporating new features and doing bug fixes.
  • Used ANT tool to build and deploy applications.
  • Involved in configuring web.xml and struts.xml for workflow.
  • Wrote SQL queries and created DDL scripts for interacting with the Oracle database.
  • Was part of solving the issues that came in production.
  • Documentation of common problems prior to go-live and while actively in a Production Support role.

Environment: J2SE, J2EE, Eclipse 3.2, Spring 2.5, Hibernate 3.0, Struts 1.2, JSP, XML, Junit, Weblogic 10.3, JavaScript, Oracle 10g, HTML, AJAX, JQuery CSS.

Confidential, Jacksonville, FL

Java Developer

Responsibilities:

  • Responsible for requirement gathering and analysis through interaction with end users.
  • Involved in designing use-case diagrams, class diagram, interaction using UML model with Rational Rose.
  • Designed and developed the application using various design patterns, such as session facade, business delegate and service locator.
  • Worked on Maven build tool.
  • Involved in developing JSP pages using Struts custom tags, JQuery and Tiles Framework.
  • Used JavaScript to perform client side validations and Struts-Validator Framework for server-side validation.
  • Good experience in Mule development.
  • Developed Web applications with Rich Internet applications using Java applets, Silverlight, JavaFX.
  • Involved in creating Database SQL and PL/SQL queries and stored Procedures.
  • Implemented Singleton classes for property loading and static data from DB.
  • Debugged and developed applications using Rational Application Developer (RAD).
  • Developed a Web service to communicate with the database using SOAP.
  • Developed DAO (data access objects) using Spring Framework 3.
  • Deployed the components in to WebSphere Application server 7.
  • Actively involved in backend tuning SQL queries/DB script.
  • Worked in writing commands using UNIX Shell scripting.
  • Involved in developing other subsystems’ server-side components.
  • Production supporting using IBM clear quest for fixing bugs.

Environment: Java EE 6, IBM WebSphere Application Server 7, Apache-Struts 2.0, EJB 3, Spring 3.2, JSP 2.0, WebServices, JQuery 1.7, Servlet 3.0, Struts-Validator, Struts-Tiles, Tag Libraries, ANT 1.5, JDBC, Oracle 11g/SQL, JUNIT 3.8, CVS 1.2, Rational clear case,Eclipse 4.2,JSTL,DHTML

We'd love your feedback!