Hadoop Developer Resume
Mount Laurel, NJ
SUMMARY
- Software Developer, having 6 years of extensive experience in delivering challenging technology solutions, working with geographically distributed teams.
- Extensive experience in dealing with Apache Hadoop components like Big Data, HDFS, MapReduce, Hive, Pig, Sqoop, Spark, Oozie, and Big Data Analytics.
- Solid experience in designing, implementing, and improving analytic solutions for Big Data on Apache Hadoop. Experience in Map Reduce programming model and Hadoop Distributed File Systems.
- Highly skilled in integrating with Spark streaming for high speed data processing.
- Skilled on Spark, used for data transformation for larger data sets.
- Good knowledge in writing Spark application using Python and Java.
- Deep understanding of data import & export from relational database into Hadoop cluster.
- Good Knowledge in NoSQL databases like MongoDB and Cassandra.
- Experience in writing UDFS in java for hive and pig. Experience in using HCatalog for Hive, Pig, and HBase.
- Extensive knowledge on Flume technologies. Experience in working with flume to load the log data from multiple sources directly into HDFS.
- Implemented Map Reduce jobs in Hive by querying the available data.
- Created Hive internal and external tables defined with appropriate static and dynamic partitions.
- Creating the Hive tables and partitioned tables using Hive Index and bucket to make ease data analytics.
- Experienced in developing MapReduce programs using Apache Hadoop for working with Big Data.
- Good working knowledge on Linux based system.
- Hands on experience in Import/Export of data using Hadoop Data Management tool Sqoop.
- Experience in importing and exporting the data using Sqoop from HDFS to Relational Database systems.
- Provided technical assistance for configuration, administration and monitoring of Hadoop clusters.
- Experienced in job workflow scheduling and monitoring tools like Oozie and Zookeeper.
- Experience in test environment setup and test infrastructure development in both manual and automation.
- Hands on development experience with RDBMS, including writing complex SQL queries, stored procedure, and triggers.
- Experience with Database testing using various complex set of queries and UDF’s.
- Experience in all phases of Software Development Life Cycle for maintaining and supporting the Java, and J2EE applications.
- Quick learner and self - starter with effective communication, motivation and organizational skills combined with attention to details and business process improvements.
- Excellent team player as well as an individual with strong analytical, communication and interpersonal skills.
TECHNICAL SKILLS
Hadoop Ecosystems: HDFS, MapReduce, Hive, Pig, Sqoop, Spark, Oozie, Zookeeper, Flume, Cascading.
Scripting Languages: PL/SQL, Shell Scripting, Python.
Java Technologies: Java, J2EE, Servlets, Java Beans, JSP, JDBC.
Database: MYSQL, Oracle 12c, Mongo DB, Cassandra, Teradata.
Tools: Eclipse, NETBEANS, RAD, Maven, IntelliJ, Visual Studio, Log4J, UML.
Testing Tools: Selenium IDE and Web Driver, HP Quick test professional (QTP).
Servers: Apache Tomcat, IBM WebSphere, Oracle Weblogic server, SQL Server.
Operating Systems: DOS, Windows, Linux, Ubuntu, Fedora, MAC OSX.
PROFESSIONAL EXPERIENCE
Confidential, Mount Laurel, NJ
Hadoop Developer
Responsibilities:
- Involved in ETL, Data Integration and Migration. Imported data using Sqoop to load data from Oracle to HDFS on regular basis.
- Responsible for implementation and ongoing administration of Hadoop infrastructure.
- Installed and configured Hadoop Mapreduce, HDFS, Developed multiple MapReduce jobs in java for data cleaning and preprocessing.
- Worked with data delivery teams to setup new Hadoop users.
- Integrate Apache Spark with Hadoop components.
- Involved in creating dynamic Oozie workflow using Spark.
- Worked on setting up Pig and Hive on multiple nodes.
- Developed SQL Queries to retrieve or create test data from different databases.
- Used Pig as ETL tool to do transformations, event joins, filter bot traffic and some pre-aggregations before storing the data onto HDFS.
- Written Hive queries for data analysis to meet the business requirements.
- Created Hive tables and working on them using Hive QL. Importing and exporting data into HDFS from Oracle Database and vice versa using Sqoop.
- Responsible to manage data coming from different sources.
- Experience in managing and reviewing Hadoop log files.
- Developed Scripts and Batch Job to schedule various Hadoop Program.
- Managing and scheduling Jobs on a Hadoop cluster.
- Installed and configured Pig and written Pig Latin scripts.
- Wrote MapReduce jobs using Pig Latin.
- Involved in analyzing system design specifications and developed test plans, test scenarios and test cases to cover overall quality testing.
- Implemented test scripts to support test driven development and continuous integration.
Environment: Hadoop, Hive, Linux, MapReduce, HDFS, Pig, Sqoop, Spark, Shell Scripting, Java (JDK 1.6), Java 6, Eclipse, Oracle 10g, PL/SQL, SQL*PLUS, Toad 9.6, Linux, QTP.
Confidential, Columbus, OH
Hadoop Developer
Responsibilities:
- Involved in building a real-time analytics platform based on web-crawled data stored in Enterprise Data Warehouse (EDW).
- Installed raw Hadoop and NoSQL applications and develop programs for sorting and analyzing data.
- Responsible for Data in the warehouse is processed and transformed using Map Reduce and loaded into Hadoop Distributed File System (HDFS).
- Analyzed data using Hive that exposes data in Hadoop File Systems in a distributed Query enabled platform. Sqoop is used to extract data from internal structured data stores and load into HDFS.
- Understand customer analytic needs and translate them into pluggable Hadoop Hive UDFs/UDAFs. Involved in writing UDF functions for Hive.
- Create and update Hive schema to support a highly-available data warehouse for BI teams.
- Controlled Data volumes ranging in Petabytes, which makes it challenge for regular analytics platform.
- Installed and configured Pig.
- Used MongoDB extensively to filter required data for data scientists.
- Used Flume to transport logs to HDFS.
- Involved in loading and maintenance of NoSQL database.
- Setup Hive with MySQL as a Remote Metastore.
- Modified required UNIX scripts and other SQL validation scripts, and writing scripts in SQL to validate the outputs.
- Automated several jobs for pulling data from FTP server and load data into Hive tables, using Oozie workflows.
- Developed business requirements by reaching the customer directly.
- Collected specifications and requirements and developed Test Plan accordingly.
- Involved in modifying existing Test Cases as per new requirements.
Environment: MapReduce, HDFS, Hive, Pig, Hue, Oozie, Core Java, Eclipse, Hbase, Flume, Cloudera Manager, NOSQL, Mongo DB, PL/SQL, SQL*PLUS, Toad, Putty, UNIX Shell Scripting.
Confidential, Dayton, OH
Java Developer
Responsibilities:
- Involved in various phases of Software Development Life Cycle (SDLC) as design development and unit testing.
- Developed and deployed UI layer logics of sites using JSP, XML, JavaScript, HTML/DHTML, and Ajax.cc
- Designed different design specifications for application development that includes front-end, back-end using design patterns.
- Developed proto-type test screens in HTML and JavaScript.
- Involved in developing JSP for client data presentation and, data validation on the client side with in the forms.
- Developed the application by using the Spring MVC framework.
- Collection framework used to transfer objects between the different layers of the application.
- Developed data mapping to create a communication bridge between various application interfaces using XML, and XSL.
- Spring IOC being used to inject the parameter values for the Dynamic parameters.
- Developed Junit testing framework for Unit level testing.
- Actively involved in code review and bug fixing for improving the performance.
- Documented application for its functionality and its enhanced features.
- Created connection through JDBC and used JDBC statements to call stored procedures.
- Created UML diagrams like use cases, class diagrams, interaction diagrams, and activity diagrams.
- Extensively worked on User Interface for few modules using JSPs, JavaScript and Ajax.
- Created Business Logic using Servlets, POJO's and deployed them on Web logic server.
- Wrote complex SQL queries and stored procedures.
- Developed the XML Schema and Web services for the data maintenance and structures.
- Implemented the Web Service client for the login authentication, credit reports and applicant information using Apache Axis 2 Web Service.
- Designed the logical and physical data model, generated DDL scripts, and wrote DML scripts for Oracle 10g database.
- Used Hibernate ORM framework with spring framework for data persistence and transaction management.
- Used struts validation framework for form level validation.
- Wrote test cases in JUnit for unit testing of classes.
- Involved in creating templates and screens in HTML and JavaScript.
- Involved in integrating Web Services using SOAP.
Environment: Spring MVC, Oracle 11g J2EE, Java, JDBC, Servlets, JSP, XML, CSS, HTML, JavaScript 1.2, Junit, Apache Tomcat, My SQL Server 2008.
Confidential
Java/ J2EE Developer
Responsibilities:
- Developed E-commerce product allowing websites to accept online payments and establish credit relationships.
- Developed application based on SDLC (Software Development Life Cycle). Designed application with UML (Unified Modeling Language).
- Developed complete Web tier of the application with spring framework.
- Developed complete Business tier with Stateless Session Beans. Used Hibernate for Persistence layer. DAO wrapped all the Hibernate functions.
- Developed the automatic build scripts using Ant for the application to deploy and test.
- Used Clear Case for version control.
- Created JUNIT test cases.
Environment: Java EE 5, Struts, spring, Unix, EJB, Http, HTML, XML, XSL, CSS, Servlets, JavaScript, Dojo js, GWT, AJAX, EXT JS Multicast, Hibernate, Cloud computing, JUNIT, JSP, Web services, UML.
Confidential
Analyst
Responsibilities:
- Involved in design, development and testing phases of the application.
- Implemented the application using Java object oriented principles.
- Involved in Administering and Configuring of BEA WebLogic Server v8.1 that runs application Servlets, JSPs, EJBs, etc. Used XSLT to transform test-specific XML scripts into HTML.
- Supported the application by resolving tickets and reviewing the patrol monitoring reports.
Environment: Java, Servlet, JSP, JavaScript, JUnit, HTML, XSLT, XML, BEA WebLogic Server v8.1, Eclipse, Putty.
Confidential
Java Programmer
Responsibilities:
- Worked with requirement analysis team to gather software requirements for application development.
- Designed UML and entity relational diagrams for the process flow and database design.
- Developed Java programs to implement the computational logic for the web applications.
- Implemented model view controller architecture with the help of HTML5, JSPs, Servlets and Java.
- Designed and implemented the database server as backend for web application
- Provided support and maintenance after deploying the web application.
- Designed static web user interface with html and CSS.
- Administered entire end to end flow of the web application.
- Developed custom packages to connect to standard data sources and retrieve data efficiently eliminating the need for each team to rewrite the same set of code multiple times.
- Worked on JavaScript, jQuery for data validation on client side.
- Worked on product deployment, documentation and support.
- Involved in structuring Wiki and Forums for product documentation.
- Involved in R&D, set up and designing Mediawiki, PHP and Joomla content management systems.
- Worked on incorporating LDAP service and Single sign on for the CMS web portal.
- Maintained the customer support portal.
- Worked with Java Messaging Service (JMS).
Environment: Java, Servlets, JSP, JavaScript, HTML, PHP, CSS, Eclipse, CMS.
