Sr. Hadoop Developer Resume Cupertino, California - Hire IT People

SUMMARY

8+ years Experience in Developing the Software Lifecycle core areas such as Analysis, Design, Implementation and Deployment of Object Oriented Distributed and Enterprise Applications with Java/J2EE (7,8 version )technologies.
Big Data implementation with strong experience on major components of Hadoop Ecosystem like Hadoop Map Reduce, HDFS, HIVE, PIG, HBase, Zookeeper, Sqoop, Oozie, Flume, Spark and Storm.
Strong experience on Hadoop distributions like Cloudera and MapR.
Hands on experience using Sqoop to import data into HDFS from Oracle and vice - versa.
Experience in analyzing data using Hive, Pig Latin and custom Map Reduce programs in Java 7,8.
Experience in deploying and managing the Hadoop cluster using Cloudera Manager.
Good understanding in processing of real-time data using Spark.
Expertise in implementing Scala application using higher order functions for both batch and interactive analysis requirement.
Hands on work experience in writing applications on No SQL databases like Cassandra, HBase.
Good understanding of Zookeeper and Kafka for monitoring and managing Hadoop jobs.
Good knowledge in using job scheduling and monitoring tools like Oozie and Zookeeper.
Experience in analyzing data using Hive Query, Pig Latin.
Experience in Extraction, Transformation and Loading (ETL) of data from multiple sources like Flat files, XML files, and Databases
Used Informatica power Center for ETL processing based on business.
Expertise in developing applications using Core Java concepts like OOPS, Multithreading, Garbage Collection.
Strong working experience with Spring Framework, which includes usage of IoC/Dependency.
Experience in developing REST/SOAP based web services and API development.
Experienced in Web Services approach for Service Oriented Architecture (SOA).
Hands on experience on various DB platforms like Oracle and SQL.
Experienced with Agile SCRUM methodology, involved in design discussions and work estimations, takes initiatives, very proactive in solving problems and providing solutions.
Good Knowledge on Hadoop Cluster architecture and monitoring the cluster.
In-depth understanding of Data Structure and Algorithms.
Experience in managing and troubleshooting Hadoop related issues.
Expertise in setting up standards and processes for Hadoop based application design and implementation.
Experience in importing and exporting data using Sqoop from Relational Database Systems to HDFS and vice-versa.
Experience in managing Hadoop clusters using Cloudera Manager.
Hands on experience in VPN, Putty, wisp, Unviewed, etc.

TECHNICAL SKILLS

Big Data: HDFS, MapReduce, Hive, Pig, Sqoop, Oozie, Zookeeper, Spark, Kafka.

NoSQL Database: HBase, Cassandra.

Programming Languages: C, Core JAVA 7, 8, Scala and Python.

Web technologies: Core Java, JSP, JDBC, Servlets.

Frame works: Strut, Spring, and Hibernate.

Operating system: Linux, Unix, Mac Os, Windows 7/8/9.

PROFESSIONAL EXPERIENCE

Confidential, Cupertino, California

Sr. Hadoop Developer

Responsibilities:

Translated the ETL job to MapReduce job by using the Informatica.
Installed Hadoop, MapReduce, and HDFS and developed multiple MapReduce jobs in PIG and Hive for data cleaning and pre-processing.
Worked extensively in creating MapReduce jobs to power data for search and aggregation.
Designed a data warehouse using Hive and Importing and exporting data into HDFS and Hive using Sqoop.
Used Spark Streaming on Scala to construct learner data model from sensor data using MLib.
Developed multiple MapReduce jobs in Core java7 for data cleaning and preprocessing.
Worked with business teams and created Hive queries for ad hoc access.
Worked in converting Hive/SQL queries into Spark transformations using Spark RDDs, Core java8.
Used Spark to hold the intermediate results in memory rather than writing them to disk while working on the same dataset multiple times.
Implemented Storm topologies to pre-process data before move into HDFS system.
Configured Spark streaming to receive real time data from the Kafka and store the stream data to HDFS using Scala.
Developed PIG scripts for the analysis of semi structured data.
Developed a data pipeline using Kafka and Storm to store data into HDFS.
Worked on the backend using Core Java7 and Spark to perform several aggregation logics.
Maintain and develop ETL code written in Core Java7 which pulls data from disparate internal and external sources.
Loaded data from Unix File System into HDFS and Worked on Agile methodology for developing the project.

Environment: Core java7,8, Hadoop, HDFS, Map Reduce, Hive, PIG, Sqoop, Strom, Spark, Kafka, Scala, Unix, SQL, ETL (Informatica).

Confidential, Birmingham, AL

Hadoop Developer

Responsibilities:

Designed and developed a components of big data processing using HDFS, MapReduce, PIG, and Hive.
Exported data from Oracle using Sqoop and Analyzed data using Hadoop components like Hive and Pig.
Wrote MapReduce jobs using Core Java7 to load the data from system generated log file to oracle database and Configured SQL Database to store Hive Teradata.
Developed Pig scripts in the areas where extensive coding needs to be reduced.
Optimized HIVE analytics SQL queries and achieve job performance.
To analyze migrated data used Hive data warehouse and developed Hive queries.
Developed a data pipeline using Kafka and Storm to store data into HDFS.
Design technical solution for real-time analytics using Spark and Hbase for faster testing and processing of data.
Monitor the ETL (Informatica power center) process job and validate the data loaded in HDFS.
Use Spark to analyze point-of-sale data and coupon usage and worked with ETL (Informatica) tool to filter data based on end requirements.
Extensively used Pig for data cleansing and writtenspark programs in Scala and ran spark jobs on yarn.
Used Zookeeper to co-ordinate cluster services. Installed Oozie workflow engine to run multiple Hive and Pig jobs.
Developed Pig Latin scripts to extract the data from the web server output files to load into HDFS.
Working knowledge with ETL (Informatica) tool to filter data based on end requirements.
Developed Spark code using Scala and Spark-SQL/Streaming for faster testing and processing of data.
Experienced in writing Pig scripts to transform raw data from several data sources into forming baseline data.
Involved in loading data from LINUX file system to HDFS using Sqoop and exported the analyzed data to the relational databases system.

Environment: Core java7, Hadoop, Map Reduce, HDFS, Hive, Pig, HBase, Sqoop, Zookeeper, Spark, Scala, ETL, Oracle, informatica, Linux.

Confidential, Columbus, OH

Hadoop Developer

Responsibilities:

Involved in installing, configuring and managing Hadoop Ecosystem components like HDFS, Hive, Pig, Sqoop and Flume.
Worked on Linux shell scripts for business processes and with loading the data from different systems to the HDFS.
Used Pig as ETL tool to do Transformations and some pre-aggregations before storing the data onto HDFS.
Developed scripts to automate the creation Sqoop jobs for various workflows.
Involved in generating analytics data using MapReduce programs written in core java.
Used Hive data warehouse tool to analyze the data in HDFS and developed Hive queries.
Configured and designed Pig Latin scripts to process the data into a universal data model.
Involved in creating Hive internal and external tables, loaded them with data and writing hive queries which requires multiple join scenarios.
Created partitioned and bucketed tables in Hive based on the hierarchy of the dataset.
Used Kafka for Log aggregation to collect physical log files from servers and puts them in the HDFS for further processing.
Configured deployed and maintained multi-node Dev and Test Hadoop Clusters.
To analyze data migrated to HDFS, used Hive data warehouse tool and developed Hive queries.
Implemented Spark using Scala and SparkSQL for faster testing and processing of data.
Designed and developed MapReduce programs for data lineage.
Migrated the existing data to Hadoop from RDBMS (SQL Server and Oracle) using Sqoop for processing the data.
Developed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing with Pig.
Created ETL (Informatica)jobs to generate and distribute reports from MySQL database
Responsible for troubleshooting MapReduce jobs by reviewing the log files.
Experienced in loading data from UNIXfile system to HDFS.

Environment: Hadoop, Map Reduce, Hive, PIG, Sqoop, Kafka, Spark, Core java, Oracle, ETL, Linux, UNIX, Shell Scripting.

Confidential

Java/J2EE Developer

Responsibilities:

Responsible for development of Business logic in Core Java.
Worked with core java technologies like Multi-Threading and Synchronization.
Created RESTful Web service for updating customer data from sent from external systems.
Provided Hibernate mapping files for mapping java objects with database tables
Used Spring Framework and XML Bean to build Query service.
Used JDBC to invoke Stored Procedures and database connectivity to MYSQL.
Developed the Database interaction classes using JDBC, Core java and Implemented server side tasks using Servlets and XML.
Implemented Service and DAO layers in between Struts and Hibernate.
Designed and developed the screens in HTML with client side validations in JavaScript.
Responsible for coding MySQL Statements and Stored procedures for back end communication using JDBC.
Developed Restful web services including JSON formats for supporting client requests.
Involved in coding using Java Servlets, created web pages using JSP's for generating pages dynamically.
Worked on the JAVA CollectionsAPI for handling the data objects between the business layers and the front end.
Used JPA, hibernate combination to access data from ORACLE database using POJOs for coding simplicity.
Implemented various Soap and REST services as a part of the application.
Extensive use of Struts Framework for Controller components and view components.
Maven was used for building and Jenkins to run the periodic builds and tests of the application.

Environment: Java, Apache Tomcat, JSF, J2EE, Eclipse, JDBC, Java Script, XML, Oracle, SQL/PLSQL, spring, Hibernate, Soap, Rest, Struct, Json.

Confidential

Jr Java Developer

Responsibilities:

Installation, Configuration & Upgrade of Solaris and Linux operating system.
Actively participated in requirements gathering, analysis, design, and testing phases
Designed use case diagrams, class diagrams, and sequence diagrams as a part of Design Phase
Developed the entire application implementing MVC Architecture integrating JSF with Hibernate and spring frameworks.
UsedPythonand Django creating for XML processing, data exchange and business logic implementation.
Developed the Enterprise Java Beans (Stateless Session beans) to handle different transactions such as online funds transfer, bill payments to the service providers.
Implemented Service Oriented Architecture (SOA) using JMS for sending and receiving messages while creating web services
Developed XML documents and generated XSL files for Payment Transaction and Reserve Transaction systems.
Developed SQL queries and stored procedures.
Developed Web Services for data transfer from client to server and vice versa using Apache Axis, SOAP and WSDL.
Used JUnit Framework for the unit testing of all the java classes.
Implemented various J2EE Design patterns like Singleton, Service Locator, DAO, and SOA.
Worked on AJAX to develop an interactive Web Application and JavaScript for Data Validations.
Developed the application under JEE architecture, developed Designed dynamic and browser compatible user interfaces using JSP, Custom Tags, HTML, CSS, and JavaScript.
Deployed & maintained the JSP, Servlets components on Web logic 8.0
Developed Application Servers persistence layer using, JDBC, SQL, Hibernate.
Used JDBC to connect the web applications to Data Bases.
Implemented Test First unit testing framework driven using Junit.
Developed and utilized J2EE Services and JMS components for messaging communication in Web Logic.
Configured development environment using Web logic application server for developer’s integration testing.

Environment: Java/J2EE, SQL, Oracle 10g, JSP 2.0, EJB, AJAX, Java Script, Web Logic 8.0, HTML, JDBC 3.0, XML, JMS, log4j, Junit, Servlets, MVC, My Eclipse

We provide IT Staff Augmentation Services!

Sr. Hadoop Developer Resume

Cupertino, CaliforniA

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship