We provide IT Staff Augmentation Services!

Sr. Hadoop Developer Resume

2.00/5 (Submit Your Rating)

Cupertino, CaliforniA

PROFESSIONAL SUMMARY:

  • 8+ years Experience in Developing teh Software Lifecycle core areas such as Analysis, Design, Implementation and Deployment of Object Oriented Distributed and Enterprise Applications wif Java/J2EE (7,8 version )technologies.
  • Big Data implementation wif strong experience on major components of Hadoop Ecosystem like Hadoop Map Reduce, HDFS, HIVE, PIG, HBase, Zookeeper, Sqoop, Oozie, Flume, Spark and Storm.
  • Strong experience on Hadoop distributions like Cloudera and MapR.
  • Hands on experience using Sqoop to import data into HDFS from Oracle and vice - versa.
  • Experience in analyzing data using Hive, Pig Latin and custom Map Reduce programs in Java 7,8.
  • Experience in deploying and managing teh Hadoop cluster using Cloudera Manager.
  • Good understanding in processing of real-time data using Spark.
  • Expertise in implementing Scala application using higher order functions for both batch and interactive analysis requirement.
  • Hands on work experience in writing applications on No SQL databases like Cassandra, HBase.
  • Good understanding of Zookeeper and Kafka for monitoring and managing Hadoop jobs.
  • Good knowledge in using job scheduling and monitoring tools like Oozie and Zookeeper.
  • Experience in analyzing data using Hive Query, Pig Latin.
  • Experience in Extraction, Transformation and Loading (ETL) of data from multiple sources like Flat files, XML files, and Databases
  • Used Informatica power Center for ETL processing based on business.
  • Expertise in developing applications using Core Java concepts like OOPS, Multithreading, Garbage Collection.
  • Strong working experience wif Spring Framework, which includes usage of IoC/Dependency.
  • Experience in developing REST/SOAP based web services and API development.
  • Experienced in Web Services approach for Service Oriented Architecture (SOA).
  • Hands on experience on various DB platforms like Oracle and SQL.
  • Experienced wif Agile SCRUM methodology, involved in design discussions and work estimations, takes initiatives, very proactive in solving problems and providing solutions.
  • Good Knowledge on Hadoop Cluster architecture and monitoring teh cluster.
  • In-depth understanding of Data Structure and Algorithms.
  • Experience in managing and troubleshooting Hadoop related issues.
  • Expertise in setting up standards and processes for Hadoop based application design and implementation.
  • Experience in importing and exporting data using Sqoop from Relational Database Systems to HDFS and vice-versa.
  • Experience in managing Hadoop clusters using Cloudera Manager.
  • Hands on experience in VPN, Putty, wisp, Unviewed, etc.

TECHNICAL SKILLS:

Big Data: HDFS, MapReduce, Hive, Pig, Sqoop, Oozie, Zookeeper, Spark, Kafka.

NoSQL Database: HBase, Cassandra.

Programming Languages: C, Core JAVA 7, 8, Scala and Python.

Web technologies: Core Java, JSP, JDBC, Servlets.

Frame works: Strut, Spring, and Hibernate.

Operating system: Linux, Unix, Mac Os, Windows 7/8/9.

PROFESSIONAL EXPERIENCE

Confidential, Cupertino, California

Sr. Hadoop Developer

Responsibilities:

  • Translated teh ETL job to MapReduce job by using teh Informatica.
  • Installed Hadoop, MapReduce, and HDFS and developed multiple MapReduce jobs in PIG and Hive for data cleaning and pre-processing.
  • Worked extensively in creating MapReduce jobs to power data for search and aggregation.
  • Designed a data warehouse using Hive and Importing and exporting data into HDFS and Hive using Sqoop.
  • Used Spark Streaming on Scala to construct learner data model from sensor data using MLib.
  • Developed multiple MapReduce jobs in Core java7 for data cleaning and preprocessing.
  • Worked wif business teams and created Hive queries for ad hoc access.
  • Worked in converting Hive/SQL queries into Spark transformations using Spark RDDs, Core java8.
  • Used Spark to hold teh intermediate results in memory rather TEMPthan writing them to disk while working on teh same dataset multiple times.
  • Implemented Storm topologies to pre-process data before move into HDFS system.
  • Configured Spark streaming to receive real time data from teh Kafka and store teh stream data to HDFS using Scala.
  • Developed PIG scripts for teh analysis of semi structured data.
  • Developed a data pipeline using Kafka and Storm to store data into HDFS.
  • Worked on teh backend using Core Java7 and Spark to perform several aggregation logics.
  • Maintain and develop ETL code written in Core Java7 which pulls data from disparate internal and external sources.
  • Loaded data from Unix File System into HDFS and Worked on Agile methodology for developing teh project.

Environment: Core java7,8, Hadoop, HDFS, Map Reduce, Hive, PIG, Sqoop, Strom, Spark, Kafka, Scala, Unix, SQL, ETL (Informatica).

Confidential, Birmingham, AL

Hadoop Developer

Responsibilities:

  • Designed and developed a components of big data processing using HDFS, MapReduce, PIG, and Hive.
  • Exported data from Oracle using Sqoop and Analyzed data using Hadoop components like Hive and Pig.
  • Wrote MapReduce jobs using Core Java7 to load teh data from system generated log file to oracle database and Configured SQL Database to store Hive Teradata.
  • Developed Pig scripts in teh areas where extensive coding needs to be reduced.
  • Optimized HIVE analytics SQL queries and achieve job performance.
  • To analyze migrated data used Hive data warehouse and developed Hive queries.
  • Developed a data pipeline using Kafka and Storm to store data into HDFS.
  • Design technical solution for real-time analytics using Spark and Hbase for faster testing and processing of data.
  • Monitor teh ETL (Informatica power center) process job and validate teh data loaded in HDFS.
  • Use Spark to analyze point-of-sale data and coupon usage and worked wif ETL (Informatica) tool to filter data based on end requirements.
  • Extensively used Pig for data cleansing and writtenspark programs in Scala and ran spark jobs on yarn.
  • Used Zookeeper to co-ordinate cluster services. Installed Oozie workflow engine to run multiple Hive and Pig jobs.
  • Developed Pig Latin scripts to extract teh data from teh web server output files to load into HDFS.
  • Working knowledge wif ETL (Informatica) tool to filter data based on end requirements.
  • Developed Spark code using Scala and Spark-SQL/Streaming for faster testing and processing of data.
  • Experienced in writing Pig scripts to transform raw data from several data sources into forming baseline data.
  • Involved in loading data from LINUX file system to HDFS using Sqoop and exported teh analyzed data to teh relational databases system.

Environment: Core java7, Hadoop, Map Reduce, HDFS, Hive, Pig, HBase, Sqoop, Zookeeper, Spark, Scala, ETL, Oracle, informatica, Linux.

Confidential, Columbus, OH

Hadoop Developer

Responsibilities:

  • Involved in installing, configuring and managing Hadoop Ecosystem components like HDFS, Hive, Pig, Sqoop and Flume.
  • Worked on Linux shell scripts for business processes and wif loading teh data from different systems to teh HDFS.
  • Used Pig as ETL tool to do Transformations and some pre-aggregations before storing teh data onto HDFS.
  • Developed scripts to automate teh creation Sqoop jobs for various workflows.
  • Involved in generating analytics data using MapReduce programs written in core java.
  • Used Hive data warehouse tool to analyze teh data in HDFS and developed Hive queries.
  • Configured and designed Pig Latin scripts to process teh data into a universal data model.
  • Involved in creating Hive internal and external tables, loaded them wif data and writing hive queries which requires multiple join scenarios.
  • Created partitioned and bucketed tables in Hive based on teh hierarchy of teh dataset.
  • Used Kafka for Log aggregation to collect physical log files from servers and puts them in teh HDFS for further processing.
  • Configured deployed and maintained multi-node Dev and Test Hadoop Clusters.
  • To analyze data migrated to HDFS, used Hive data warehouse tool and developed Hive queries.
  • Implemented Spark using Scala and SparkSQL for faster testing and processing of data.
  • Designed and developed MapReduce programs for data lineage.
  • Migrated teh existing data to Hadoop from RDBMS (SQL Server and Oracle) using Sqoop for processing teh data.
  • Developed workflow in Oozie to automate teh tasks of loading teh data into HDFS and pre-processing wif Pig.
  • Created ETL (Informatica)jobs to generate and distribute reports from MySQL database
  • Responsible for troubleshooting MapReduce jobs by reviewing teh log files.
  • Experienced in loading data from UNIXfile system to HDFS.

Environment: Hadoop, Map Reduce, Hive, PIG, Sqoop, Kafka, Spark, Core java, Oracle, ETL, Linux, UNIX, Shell Scripting.

Confidential

Java/J2EE Developer

Responsibilities:

  • Responsible for development of Business logic in Core Java.
  • Worked wif core java technologies like Multi-Threading and Synchronization.
  • Created RESTful Web service for updating customer data from sent from external systems.
  • Provided Hibernate mapping files for mapping java objects wif database tables
  • Used Spring Framework and XML Bean to build Query service.
  • Used JDBC to invoke Stored Procedures and database connectivity to MYSQL.
  • Developed teh Database interaction classes using JDBC, Core java and Implemented server side tasks using Servlets and XML.
  • Implemented Service and DAO layers in between Struts and Hibernate.
  • Designed and developed teh screens in HTML wif client side validations in JavaScript.
  • Responsible for coding MySQL Statements and Stored procedures for back end communication using JDBC.
  • Developed Restful web services including JSON formats for supporting client requests.
  • Involved in coding using Java Servlets, created web pages using JSP's for generating pages dynamically.
  • Worked on teh JAVA CollectionsAPI for handling teh data objects between teh business layers and teh front end.
  • Used JPA, hibernate combination to access data from ORACLE database using POJOs for coding simplicity.
  • Implemented various Soap and REST services as a part of teh application.
  • Extensive use of Struts Framework for Controller components and view components.
  • Maven was used for building and Jenkins to run teh periodic builds and tests of teh application.

Environment: Java, Apache Tomcat, JSF, J2EE, Eclipse, JDBC, Java Script, XML, Oracle, SQL/PLSQL, spring, Hibernate, Soap, Rest, Struct, Json.

Confidential

Jr Java Developer

Responsibilities:

  • Installation, Configuration & Upgrade of Solaris and Linux operating system.
  • Actively participated in requirements gathering, analysis, design, and testing phases
  • Designed use case diagrams, class diagrams, and sequence diagrams as a part of Design Phase
  • Developed teh entire application implementing MVC Architecture integrating JSF wif Hibernate and spring frameworks.
  • UsedPythonand Django creating for XML processing, data exchange and business logic implementation.
  • Developed teh Enterprise Java Beans (Stateless Session beans) to handle different transactions such as online funds transfer, bill payments to teh service providers.
  • Implemented Service Oriented Architecture (SOA) using JMS for sending and receiving messages while creating web services
  • Developed XML documents and generated XSL files for Payment Transaction and Reserve Transaction systems.
  • Developed SQL queries and stored procedures.
  • Developed Web Services for data transfer from client to server and vice versa using Apache Axis, SOAP and WSDL.
  • Used JUnit Framework for teh unit testing of all teh java classes.
  • Implemented various J2EE Design patterns like Singleton, Service Locator, DAO, and SOA.
  • Worked on AJAX to develop an interactive Web Application and JavaScript for Data Validations.
  • Developed teh application under JEE architecture, developed Designed dynamic and browser compatible user interfaces using JSP, Custom Tags, HTML, CSS, and JavaScript.
  • Deployed & maintained teh JSP, Servlets components on Web logic 8.0
  • Developed Application Servers persistence layer using, JDBC, SQL, Hibernate.
  • Used JDBC to connect teh web applications to Data Bases.
  • Implemented Test First unit testing framework driven using Junit.
  • Developed and utilized J2EE Services and JMS components for messaging communication in Web Logic.
  • Configured development environment using Web logic application server for developer’s integration testing.

Environment:Java/J2EE, SQL, Oracle 10g, JSP 2.0, EJB, AJAX, Java Script, Web Logic 8.0, HTML, JDBC 3.0, XML, JMS, log4j, Junit, Servlets, MVC, My Eclipse

We'd love your feedback!