Hadoop Developer Resume
Charlotte, NC
SUMMARY
- An information technology professional having 8+ years of Industry Experience as a Big Data Technical Consultant
- In depth understanding/knowledge of Hadoop Architecture and its components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and MapReduce.
- Experienced in Waterfall & Agile development methodology.
- Expertise in writing HadoopJobs for analyzing data using MapReduce, Hive and Pig
- Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems and vice - versa
- Experienced in extending Hive and Pig core functionality by writing custom UDFs using Java.
- Experience with developing large-scale distributed applications.
- Experience in developing solutions to analyze large data sets efficiently
- Experience in Data Warehousing and ETLprocesses.
- Knowledge of Star Schema Modeling, and Snowflake modeling, FACT and Dimensions tables, physical and logical modeling.
- Strong database, SQL, ETL and data analysis skills.
- Good understanding of Data Mining and Machine Learning techniques
- Experienced in NoSQL databases such as HBase, and MongoDB
- Experienced in job workflow scheduling and monitoring tools like Oozie and Zookeeper
- Knowledge of administrative tasks such as installing Hadoop and its ecosystem components such as Hive and Pig
- Experienced in developing applications using all Java/J2EE technologies like Servlets, JSP, EJB, JDBC, JNDI, JMS etc.
- Experienced in developing applicationsusing HIBERNATE (Object/Relational mapping framework).
- Experienced in developing Web Services using JAX-RPC, JAXP, SOAP and WSDL. Also knowledgeable in using WSIF (Web Services Invocation Framework) API.
- Thorough knowledge and experience of XML technologies (DOM, SAX parsers), and extensive experience with XPath, XML schema, DTD’s, XSLT, XML SPY,MAPForce editor.
- Experience in Message based systems using JMS,TIBCO & MQSeries.
- Experience in writing database objects like Stored Procedures, Triggers, SQL, PL/SQL packages and Cursors for Oracle, SQL Server, DB2 and Sybase.
- Proficient in writing build scripts using Ant & Maven.
- Experienced in using CVS, SVN and Sharepoint as version manager.
- Proficient in unit testing the application using Junit, MRUnitandlogging the application using Log4J.
- Ability to learn and adapt quickly and to correctly apply new tools and technology.
- Strong communication and analytical skills with very good experience in programming &problem solving.
TECHNICAL SKILLS
- Hortonworks Hadoop
- Big Data
- Ambari
- HDFS
- Map Reduce
- Hive
- HCatalog Pig
- Sqoop
- Spark
- Flume
- Oozie
- Kafka
- ZooKeeper
- HBase
- Cassandra
- Teradata
- MS SQL Server
- Oracle
- Informix
- Sybase
- Informatica
- Datastage
- JAVA
- J2EE
- Spring
- Hibernate
- EJB
- Webservices
- JAX-RPC
- JAXP
- JAXM
- JMS
- JNDI
- Servlets
- JSP
- Jakarta Struts
- BEA Web Logic
- IBM Websphere
- JBoss
- Tomcat
- UML
- OOAD
- HTML
- AJAX
- CSS
- XHTML
- XML
- XSL
- XSLT
- WSDL
- SOAP
- CVS
- SVN
- SharePoint
- Clear Case
- Clear Quest
- Win CVS
- Junit
- TDD
- MRUnit
- Ant
- Maven
- Log4j
- FrontPage
- Eclipse
- NetBeans
- Linux
- UNIX
- Windows
PROFESSIONAL EXPERIENCE
Confidential - Charlotte, NC
Hadoop Developer
Responsibilities:
- Developed simple and complex MapReduce programs in Java for Data Analysis on different data formats.
- Developed MapReduce programs that filter bad and un-necessary claim records and find out unique records based on account type.
- Processed semi, unstructured data using Map Reduce programs.
- Implemented Daily Cron jobs that automate parallel tasks of loading the data into HDFS and pre-processing with Pig using Oozie coordinator jobs.
- Implemented custom DataTypes, InputFormat, RecordReader, OutputFormatand RecordWriter for MapReduce computations.
- Successfully migrated Legacy application to Big Data application using Hive/Pig in Production level.
- Developed MapReduce pipeline for feature extraction and tested the modules using MRUnit
- Optimized MapReduce jobs to use HDFS efficiently by using various compression mechanisms
- Creating Hive tables, loading with data and writing Hive queries which will run internally in MapReduce way.
- Responsible for performing extensive data validation using Hive.
- Implemented Partitioning, Dynamic Partitions and Bucketing in Hive for efficient data access.
- Worked on different set of tables like External Tables and Managed Tables.
- Used Oozie workflow engine to run multiple Hive and Pig jobs.
- Used DML statements to perform different operations on Hive Tables.
- Developed Hive queries for creating foundation tables from stage data.
- Used Pig as ETL tool to do transformations, event joins, filter and some pre-aggregations.
- Analyzed the data by performing Hive queries and running Pig scripts to study customer behavior.
- Loaded home mortgage data from the existing DWH tables (SQL Server) to HDFS using Scoop.
- Wrote Hive Queries to have a consolidated view of the mortgage and retail data.
- Data is loaded back to the Teradata for the BASEL reporting and for the business users to analyze and visualize the data using Datameer.
- Orchestrated hundreds of sqoop scripts, pig scripts, hive queries using oozie workflows and sub-workflows.
- Loaded the load ready files from mainframes to Hadoop and files were converted to ASCII format.
- Implemented business logic by writing Pig UDFs in Java and used various UDFs from Piggybanks and other sources.
- Worked with Sqoop to export data from Teradata, Netezza environment into HDFS.
- Proof of concept using Spark and Scala fordataanalysis.
- Responsible for implementation of one of thedatasource transformations in spark.
- Involved in Agile methodology and used JIRA for issue & project tracking.
Environment: Hortonworks Hadoop, Ambari, HDFS, MapReduce, Spark, Java (jdk1.6), Linux, Sqoop, Apache Hive, HCatalog, Apache Pig, Oozie.
Confidential, San Jose, CA
Hadoop Developer
Responsibilities:
- Importing and exporting data into HDFS and Hive from different RDBMS using Sqoop.
- Experienced in defining job flows to run multiple MapReduce and Pig jobs using Oozie
- Importing log files using Flume into HDFS and load into Hive tables to query data
- Used HBase-Hive integration, written multiple Hive UDFs for complex queries
- Involved in writing APIs to Read HBase tables, cleanse data and write to another HBase table
- Created multiple Hive tables, implemented Partitioning, Dynamic Partitioning and Buckets in Hive for efficient data access.
- Written multiple MapReduce programs in Java for data extraction, transformation and aggregation from multiple file formats including XML, JSON, CSV and other compressed file formats.
- Experienced in running batch processes using Pig Scripts and developed Pig UDFs for data manipulation according to Business Requirements
- Experienced in writing programs using HBase Client API
- Involved in loading data into HBase using HBase Shell, HBase Client API, Pig and Sqoop
- Experienced in design, development, tuning and maintenance of NoSQL database
- Developed unit test cases for HadoopMapReduce jobs with MRUnit
- Excellent experience in ETL analysis, designing, developing, testing and implementing ETL processes including performance tuning and query optimizing of database
Environment: Hortonworks Hadoop, Hive, Pig, HBase, Sqoop, Flume, Java, Linux, MySQL Server5.155, MS SQL Server 2012, SQL, PL/SQL, SQL Server Data Tools.
Confidential, Chicago, IL
Java Developrer
Responsibilities:
- Involved in design process using UML & RUP (Rational Unified Process).
- Developed different Components and Adapters of the integration framework using Stateless Session EJB.
- Developed various interfaces to integrate the SAP and various legacy systems such as PMWeb, CCWeb, Etime, EEDS,JDEdwardsetc.
- Developed different interfaces using EJB Session Beans (Stateless) and Message Driven Beans for both synchronous and asynchronous communication.
- Developed various interfaces such as FILE-RFC, RFC-FILE, JMS-RFC, RFC-Database using EJB, JMS, JDBC, JNDI, JCO, Java-Proxy and Webservices.
- JDBC Adapter was developed using Hibernate for persisting and retrieving the data in the database.
- Developed Webservices using SOAP, WSDL, JAXP and AXIS to integrate with the external systems.
- Internal data representation was made in XML format.
- Business validation and mapping rules were applied on the XML data and transformed to the target XML structure.
- DOM parser was largely used in performing these operations.
- Implemented J2EE Design patterns such as Session Façade, MVC, Business Delegate, Value Object, Data Access Object etc.
- Extensively interacted with SAPfunctional and technicalteams in resolving technical and functional issues.
- Effectively performed coderefactoring to modularize the code and improve error handling and fault tolerance.
- Provided second level and third level of production support in resolving issues relating to the interfaces.
- Developed the interfaces using Eclipse. Deployed the application in SAPWeb Application Server.
- Actively involved in configuration management tool CVS in managing the code.
- Followed Test Driven Developement
- Worked on Unit and Integration testing of the interfaces.
- Involved in designing test plans, test cases and overall Unit and Integration testing of system.
Environment: EJB, JSP, Struts, Webservices, JMS, JNDI, JDBC, SAP Webapplication Server, Eclipse, Hibernate, SAP XI, SQL, Sybase, XML, XSD, WSDL, SOAP, CVS, Win 2003 Server.
Confidential, Chicago, IL
Java Developer
Responsibilities:
- Analyzing the business requirements and doing the GAP analysis then transforming them to detailed design specifications.
- Involved in design process using UML & RUP (Rational Unified Process).
- Performed Code Reviews and responsible for Design, Code and Test signoff.
- Assisting the team in development, clarifying on design issues and fixing the issues.
- Involved in designing test plans, test cases and overall Unit and Integration testing of system.
- Development of the logic for the Business tier using Session Beans (Stateful and Stateless).
- Developed Web Services using JAX-RPC, JAXP, WSDL, SOAP, XML to provide facility to obtain quote, receive updates to the quote, customer information, status updates and confirmations.
- Responsible for Design and development of web services to test the security aspects of Web Services enabled CICSTransaction Gateway.
- Extensivelyused SQL queries, PL/SQL stored procedures & triggers in data retrieval and updating of information in the Oracle database using JDBC.
- Expert in writing, configuring and maintaining the Hibernate configuration files and writing and updating Hibernate mapping files for each Java object to be persisted.
- Expert in writing Hibernate Query Language (HQL) and Tuning the hibernate queries for better performance.
- Used the design patterns such as Session Façade, Command, Adapter, Business Delegate, Data Access Object, Value Object and Transfer Object.
- Deployed the application in Weblogic and used Weblogic Workshop for development and testing.
- Involved in application performance tuning (code refractory).
- Writing test cases using JUNIT, doing test first development.
- Used Rational Clear Case & PVCS for source control. Also used Clear Quest for defect management.
- Writing build files using ANT. Used Maven in conjunction with ANT to manage build files.
- Running the nightly builds to deploy the application on different servers.
Environment: EJB, Webservices, Hibernate, Struts, JSP, JMS, JNDI, JDBC, Weblogic, SQL, PL/SQL, Oracle, Sybase, XML, XSLT, WSDL, SOAP, UML, Rational Rose, Weblogic Workshop, OptimizeIt, Ant, JUnit, ClearCase, PVCS, ClearQuest, Win XP, Linux.
Confidential, Palo Alto, CA
Sr. Java Developer
Responsibilities:
- Involved in preparation of Functional document, Technical Document related to the application.
- Software methodology used is Agile.
- Used Struts based Application framework for design of the business layer of the application.
- Involved in Interacting with the business analysts, infrastructure team, database team, testing team for understanding requirements, environment setup, interfacing with database, defect fixing respectively.
- Involved in implementing user interface using JSP, HTML and JavaScript.
- Developed DAO Layer for portal side.
- Developed Action classes for displaying Graphs for Statistics data.
- Involved in Developing Web service classes for portal to interact with Data Center.
- Involved in Developing Web Service classes and Stub classes.
- Writing JUnit test cases for each scenario of project.
- JAXB was used to Marsherlling and Un-Marsherlling.
- Configured Web Logic server for Data Source and Creating Domain.
- Actively involved in tuning SQL queries, Stored Procedures, Triggers.
- Client and Server side validations using JavaScript and Struts Validation Framework.
- Communicating with testers, giving clarifications and resolving the problems.
- Attending Daily calls from Client and Stand up (Status) meeting every day.
Environment: Java 6.0, JSP, Struts 1.2, EJB, XML, JDBC, HTML, JavaScript, Oracle 10g, Web Logic Server, Maven, Web service (Axis), SVN, Linux OS and Junit.
Confidential
Analyst Programmer
Responsibilities:
- Involved in gathering and analyzing system requirements.
- Used Multi-threading and exceptional handling in the development of applications.
- The application is based on the MVC-II Architecture using Apache Struts framework.
- Involved in all phases of SDLC including requirements Collection,Design and analysis of thecustomer specifications from business analyst.
- Designed and developed user interface screens using HTML, DHTML and JSP.
- Developed and maintained the data layer using the EJB’s.
- Created and maintained the configuration of the Application Framework.
- Eclipse used as Java IDE tool for creating Action classes and XML files.
- Involved in maintenance of the application.
- Used Bugzilla as a Bug Reporting Tool.
Environment: Unix Shell scripting, Core Java, Struts, EJB, J2EE, JBoss Application server and Oracle, JSP, HTML, JavaScript, JDBC, JSP, Servlet, Unified Modeling Language, Xml, JMS (Persistent storage using Queues and Topics) using SQL statements.
