Hadoop Developer Resume
Atlanta, GA
SUMMARY
- Overall 7 years of experience in Analysis, Design, Development, Testing, maintenance and User training of software application which includes over 2 years in Big data, Hadoop, Spark & HDFS environment and around 4 years of experience in Java, J2EE
- 2+ years of experience in using Hadoop ecosystem using Java MapReduce, Hive, HBase, Zookeeper, Pig, Sqoop, Storm, Kafka, Oozie and Flume
- 1 year experience in Spark and Scala
- 4+ years of software development and maintenance of object oriented, client/server applications with Java and J2EE technologies
- Worked on structured, semi - structured data processing using Hadoop
- Developed analytical components using Scala, Spark and Spark Stream
- Hands on experience with importing and exporting data from Relational data bases to HDFS, Hive and HBase using Sqoop
- Analyzed large amounts of data sets writing Pig scripts and Hive queries
- Experienced in writing Map Reduce programs and UDFs for both Hive and Pig using Java
- Used Flume to channel data from different sources to HDFS
- Supported Map Reduce Programs running on the cluster and wrote Custom Map Reduce Scripts for Data Processing in Java
- Worked with Oozie work flow engine to schedule time based jobs to perform multiple actions
- Expertise in Relational data base systems like Oracle, SQL Server 2005/2008, MS Access, Teradata, DB2, highly proficient in writing, testing and implementation of triggers, stored procedures, functions, packages, Cursors using PL/SQL
- Knowledge in ETL Data warehousing
- Experience in UNIX Shell Scripting
- Knowledge in Amazon Web Services (AWS)
- Strong exposure to Object Oriented Design Principles like Design Patterns, and Data Structures
- Knowledge in software project management and comprehensive tool like ANT and Maven
- Experienced with implementing Web based, Enterprise level applications using J2EE frameworks like Hibernate, Spring and Java
- Experienced in implementing RESTful and SOAP web-services in workflow and enterprise applications
- Knowledge of Software Development Life Cycle (SDLC) methodologies such as Agile and Waterfall
- Good Knowledge in full life cycle of SDLC Java Application Developments, Including Analysis, Requirements gathering, Design, Development, Implementation, Maintenance and Enhancements
- Knowledge in UML visualizing and documenting the artifacts using Use Cases, Use Case Diagrams, Activity diagrams, Class diagrams, Data Flow Diagrams, Business Flow Diagrams, Sequence Diagrams using MS Visio
- Excellent interpersonal skills, good experience in interacting with clients with good team player and problem solving skills
TECHNICAL SKILLS
Hadoop/Big Data Technologies: HDFS, Map Reduce, YARN, Hive, HBase, Pig, Zookeeper, Sqoop, Flume, Spark, Storm, Kafka
Java Technologies: J2SE, J2EE - JSP, Servlets, JDBC, Junit, RMI, JMS
Web Technologies: AJAX, JavaScript, JQuery, HTML 5, CSS 3, XML
Data Warehousing ETL: Informatica Power Center, Power Analyzer, Power Exchange, Power Connect, Teradata Load utilities
Framework: Spring, Struts, Hibernate
Scheduler: Oozie
Databases: MySQL, Oracle 11i/10g, SQL Server 2008, 2005, 2000, Teradata 13/12, MS Access 2000
Operating Systems: UNIX, Windows, LINUX, VMware
Unix Tools: Apache, Yum
Web/ Application Servers: Web Logic, Tomcat
Web Services: SOAP, REST
Development Tools: Maven, Ant, JUnit, Log4J
Tools: Eclipse, Subversion Control (SVN) Tortoise, SVN Eclipse
Methodology: Waterfall, Agile
Environment: Sun Solaris 2.6/2.7, MS DOS 6.22, Linux, Windows 2000/2003/X
PROFESSIONAL EXPERIENCE
Confidential, Atlanta, GA
Hadoop Developer
Responsibilities:
- Worked with Hortonworks Data Platform. Subscribed data using Invenio application which is published on Data Router to NFS and HDFS
- Data is ingested using Ingestion framework which involved creating Hive tables, with implementing partitioning and bucketing
- Implemented advanced procedures like text analytics and processing using the in-memory computing capabilities like Apache Spark written in Scala
- Worked on reading multiple data formats on HDFS using Spark
- Involved in converting Hive/SQL queries into Spark transformations using Spark RDDs, Python and Scala
- Handled importing of data from various data sources, performed transformations using Hive, MapReduce, Spark and loaded data into HDFS
- Implemented preprocessing of data using MapReduce programming for business transformations on HDFS
- Developed Simple to complex Map Reduce Jobs using Hive and Pig
- The raw XML files are transformed to Hive storable format and also maintained sensitive data by performing encryption and decryption using Java
- Experience in using HBASE data for real-time requirements of the Call Records
- Pig scripts are used for data extraction and involved in cleansing the data after the required retention period using Hive and shell scripts
- Apache Kafka Streaming data are brought into server where real-time processing is done using Apache Storm, and the data is stored in HBase
- Successfully loaded files to Hive and HDFS from Oracle using Sqoop
- Optimizing the Hive queries using Partitioning and Bucketing techniques, for controlling the data distribution
- Created Hive Generic UDF's to process business logic that varies based on policy
- Worked on custom Pig Loaders and storage classes to work with variety of data formats such as JSON and XML file formats
- Used Oozie workflow engine to manage interdependent Hadoop jobs and to automate several types of Hadoop jobs such as map-reduce Hive, Pig, and Sqoop
- Experienced in running Hadoop streaming jobs to process terabytes of xml format data.
- As a subject Matter expert on data, helped the data analysts and instrumented the data sourcing process by auditing and managing the jobs in oracle tables, health status via job logs and Jobtracker URL
- Experienced in managing and reviewing Hadoop log files
Environment: Red Hat 6.5, Java 1.7, Scala 2.11, Hadoop 2.2, HDFS 2.6, YARN, Spark1.5.2, Hive 0.14, Pig, Hbase 0.98, Kafka 0.8, Storm 0.9, SVN, Eclipse, Sql Developer 4.1, Tortoise 1.6, Oracle, Sqoop 0.9, Maven, XML, Json
Confidential, Kansas City, MO
Hadoop Developer
Responsibilities:
- Installed and configured HadoopMap Reduce, HDFS, developed multiple Map Reduce jobs in java for data cleaning and preprocessing
- Imported and exported data into HDFS and Hive using Sqoop
- Experience in defining job flows using Oozie
- Involved in managing and reviewingHadooplog files
- Extracted files from RDBMS using Sqoop and stored it in HDFS
- Involved in runningHadoopstreaming jobs to process Terabytes of xml format data
- Good experience with NOSQL database
- Involved in loading data from Unix file system to HDFS
- Involved in creating Hive tables, loading data and writing Hive queries
- Replaced default Derby metadata storage system of Hive with MySQL system
- Developed Pig Latin scripts to extract the data from the web server output files to load into HDFS
- Developed Pig UDF's to preprocess the data for analysis
- Developed Hive queries for the analysts
- Involved in loading data from Linux and Unix file system to HDFS
- Involved in loading and transforming of large data sets of structured, semi structured and unstructured data
- Worked with various Hadoop file formats, including TextFiles and SequenceFile
- Involved in setting up QA environment and updating configurations for implementing scripts with Pig
Environment: Hadoop, Hive, HBase, MapReduce, HDFS, Pig, Java(JDK 1.6), Hadoop Distribution of Cloudera, Oracle 11g/10g, PL/SQL, SQL*PLUS, Toad 9.6, Unix Shell Scripting
Confidential, Miami, FL
Java/ J2EE Developer
Responsibilities:
- Involved in analysis, testing, debugging documents and implementing moderately complex software applications
- Implemented projects by using Spring Web MVC module
- Devised and modified procedures to solve complex problems considering computer equipment capacity and limitations, operating lime, and form of desired results
- Implemented, integrated and maintained the client server and web related applications
- Implemented projects using Ajax,Java Script, and HTML as UI components
- Implemented projects using Web Services to communicate with other systems
- Used spring framework for the business layer using Spring Core, DAO, Spring ORM, Spring Web MVC modules
- Implemented projects using the Hibernate framework to communicate with database
- Used SVN as version control tools to maintain the code repository
- Developed Test cases using JUnit Testing Framework and Log4J for logging and audit trail
Environment: Java 1.6, Servlets, JSP, Spring MVC, Hibernate, XML, JAXB, HTML, Ajax, JavaScript, JNDI, Web Services, WSDL, SOAP, SQL, PL/SQL, Oracle11g, Eclipse, SQL Developer, Log4j, JUnit, Maven, SVN.
Confidential
Java/ J2EE Developer
Responsibilities:
- Made analysis of the specifications provided by the client and actively involved in data flow and worked on report generation
- Designed and developed the UI using Struts view component, JSP, HTML and JavaScript
- Experience in Web Development technologies like HTML, XML and used Log4J logging framework for logging messages
- Created SOAP and RESTful Web Service to establish interface with various applications
- Development of database interaction code to JDBC API making extensive use of SQL
- Performed various database programming roles such as creating SQL queries, analyzing Stored Procedures, persisting objects and creating Data Access Object layer using Hibernate API
- Involved in writing SQL queries and involved them with usage of JDBC for accessing related data from database
- Developed Servlets as controllers to perform requisite functions and application is build using Maven
- Involved in development of waterfall and agile software development life cycle methodologies
- Interacted with the clients to understand CR (Change request) for modules
- Involved in debugging the code and resolving the issues found in Production
- Finalized the XML structure for the data store and the template xml tasks
- Deployment of the system and fixed bugs for server side functionality
- Development of java code for struts Implementation, jsp and automation tool for report distribution
Environment: Java, J2EE, JSP, Servlets, HTML, Java Script, Tomcat, Servlet, Eclipse, Java Beans, JDBC, XML, UNIX, Red hat Linux
