We provide IT Staff Augmentation Services!

Hadoop Developer Resume

2.00/5 (Submit Your Rating)

Palo Alto, CA

SUMMARY

  • 7 Years of experience in IT which includes 4 years of experience in Big Data ecosystem related technologies.
  • Proficient experience in Installation, configuration, supporting and managing of Big Data and underlying infrastructure of Hadoop Cluster.
  • Hands on exposure on major components in Hadoop Ecosystem like Hadoop Map Reduce, HDFS, HIVE, PIG, Pentaho, HBase, Zookeeper, Sqoop, Oozie, Cassandra, Flume and Avro.
  • Good experience with Oozie Workflow Engine in running workflow jobs with actions that run Hadoop Map/Reduce and Pig jobs.
  • Experience in understanding the security requirements for Hadoop and integrate with Kerberos authentication and authorization infrastructure.
  • Expert in importing and exporting the data using Sqoop from HDFS to Relational Database systems/mainframe.
  • Good experience in Apache Flume for efficiently collecting, aggregating, and moving large amounts of log data.
  • Experience and in - depth understanding of analyzing data using HIVEQL, PIG Latin.
  • Experience in managing Hadoop clusters using Cloudera Manager Tool.
  • Hands on experience in Sequence files, Combiners, Counters, Dynamic Partitions, Bucketing.
  • Hands-on experience in performing analytics using Apache Spark.
  • Good Knowledge on Map Reduce design patterns.
  • Responsible for setting up processes for Hadoop based application design and implementation.
  • Developed GUI applications by using HTML, CSS, AJAX, JSP and JavaScript libraries to implement user interface screens.
  • Experience in Core Java technologies (JDBC, Multi-Threading, Networking, Generics, Oops concepts, collections, exception handling).
  • Strong experience in Design, Development, Testing, Deploying of Object Oriented, and Web based Enterprise Applications using Java/J2EE technologies.
  • Experienced in developing enterprise applications using MVC frameworks such as Struts, Spring MVC.
  • Proficiency in multiple databases likeMongoDB, Cassandra, MYSQL, Oracle 9i, 10g, 11g and MS SQL Server.
  • Experienced in developing applications using application/Web servers such as Web Sphere, Web logic and Tomcat.
  • Experienced by using Hibernate interfaces, annotations, JDBC and SQL in implementing DAO layers.
  • Extensive knowledge of Agile and Scrum methodologies to develop best practices for software development and implementation.
  • Worked and learned a great deal from (AWS) Cloud services like EC2, S3, EBS, RDS and VPC.
  • Worked on various Tools and IDEs like Eclipse, IBM Rational, Apache Ant-Build Tool, MS-Office, PL SQL Developer, SQL*Plus.
  • Expertise in using Java API and Sqoop to export data into DataStaxCassandracluster from RDBMS.

TECHNICAL SKILLS

Hadoop Ecosystem: HDFS, Map Reduce Hive, Pig, Pentaho, HBase, Storm, Zookeeper, Sqoop, Oozie, Flume, Avro and YARN

Web Technologies: Core Java, J2EE, Servlet, JSP, JDBC, XML, AJAX, SOAP, WSDL, REST.

Methodologies: Agile, UML, Water-Fall and Test Driven

Frameworks: Hibernate 2.x/3.x, Spring 2.x/3.x,Struts 1.x/2.x and JPA

Programming Languages: Java, Unix Shell scripting, HTML, C, C++,SQL, PL/SQL

Data Bases: Oracle 11g, DB2, MS-SQL Server, MYSQL, MS-Access, Mongo DB, Cassandra.

Application Servers: Web Logic, Web Sphere, Apache Tomcat.

Web Design Tools: HTML, DHTML, AJAX, JavaScript, JQuery and CSS, Angular Js, Ext.JS and JSON

Operating Systems: Windows, Linux, Unix

Development / Build Tools: TOAD, SQL Developer, SOAP UI, Eclipse, Ant, Maven, IntelliJ, JUNIT and log4J.

ETL Tools: Talend, Informatica, Pentaho

PROFESSIONAL EXPERIENCE

Confidential, Palo Alto, CA

Hadoop Developer

Responsibilities:
  • Installed and configured Hadoop, Map Reduce, HDFS, developed multiple maps reduce jobs in Java for data cleaning and pre-processing.
  • Involved in creating Hive tables, loading the data and writing hive queries that will run internally in aMap Reduceway.
  • Designed and implementedMap Reducebased large-scale parallel relation-learning system.
  • Analysed Hadoop cluster and Big Data analytic tools including Pig, Hive, HBase and SQOOP.
  • DevelopedSparkscripts by using Scala Shell commands as per the requirement.
  • Involved in converting Hive/SQL queries intoSparktransformations usingSparkRDDs and Scala.
  • Used Spark Streaming to divide streaming data into batches as an input to Spark engine for batch processing.
  • Handled data from RDBMS and streaming sources used Apache Spark for data processing.
  • Responsible for developing Kafka as per the software requirement specifications.
  • Involved in monitoring data and filtering data for high speed data handling usingKafka.
  • Worked in Spark streaming to get ongoing information from theKafkaand store the stream information to HDFS.
  • Implemented UNIX shell scripts to perform cluster admin operations and used Zookeeper for Cluster coordination services.
  • Written Storm topology to accept the events from Kafka producer and emit into Cassandra DB.
  • Imported data from various resources to theCassandracluster using Java APIs.
  • Configured, Documented and Demonstrated inter node communication between Cassandra nodes and client using SSL encryption.
  • Worked on Cassandra in creating tables to load large sets of semi structured data coming from various sources.
  • Analyzed the data to export Cassandra using Sqoop and to generate reports for the BI team.
  • Ingested the data from Relational Databases to HDFS using SQOOP.
  • Configured the servers for Auto scaling and elastic load balancing in AWS EC2 and usedAWSservices like EC2 and S3 for small data sets.
  • Developed Pig Latin scripts for replacing the existing legacy process to theHadoopand the data is fed toAWSS3.
  • Strong experience in working with ELASTIC MAPREDUCE and setting up environments on Amazon AWS EC2 instances.
  • Created S3 buckets and also managing policies for S3 buckets and Utilized S3 bucket and Glacier for storage and backup on AWS.
  • Developer Columnar Data warehouse using the Red shift for analyzing the 4TB of customer data.
  • Used Pig as ETL tool to do joins and some pre-aggregations before storing the data onto HDFS.
  • Analyzed the data by performing Hive queries and running Pig scripts as per customer requirements.
  • Used Spark SQL to load tables into HDFS to run select queries on top.
  • Involved in loading data from UNIX file system to HDFS.
  • Experienced in handling Avro data files inMap Reduceprograms using Avro data serialization system.

Environment: HDFS, Hadoop, AWS, Elastic Map Reduce, Spark, Pig, HBase, SQL, UNIX, Hive, Sqoop, Kafka, Spark Streaming, Linux shell scripting, ETL, Zoo Keeper, Eclipse and Cassandra.

Confidential, New Jersey, NY

Hadoop Developer

Responsibilities:

  • Involved in HBase Java API to populate operational HBase table with Key value.
  • Installed and configured Hive, Pig and Sqoop on theHDP2.0 cluster.
  • Performed data analysis with HBase using Hive External tables.
  • Used Map Reduce to move bulk amount of data into HBase to Integrate with NoSQL database like HBase.
  • Performed real time analytics on HBase using Java API and Fetched data to/from HBase by writing Map Reduce job.
  • Installed and configuredHadoopMapReduce, HDFS, Developed multipleMapReducejobs in java for data cleaning and Processing.
  • Developed Map-Reduce jobs on Yarn and Hadoopclusters to produce daily and monthly reports.
  • SupportedMapReducePrograms those are running on the cluster.
  • Implemented intermediate functionalities like events or records count from the Flume sinks.
  • Designed workflows by scheduling Hive processes for Log file data, which is streamed into HDFS using Flume.
  • Imported data using Sqoop to load data from MySQL to HDFS on regular basis.
  • Extracted the data from MySQL into HDFS using Sqoop.
  • Used Sqoop to load the CDRs from relational DB and other sources toHadoop cluster by using Flume.
  • Automating and scheduling the Sqoop jobs in a timely manner using Unix Shell Scripts.
  • Configured and Installed Hive on the Hadoop cluster.
  • Handled importing of data from various data sources, performed transformations using Hive, Map Reduce, and loaded data into HDFS.
  • Analyzed the data by performing Hive queries (HiveQL) and running Pig Latin scripts to study customer behaviour.
  • Wrote Java code to format XML documents; upload them toSolrserver for indexing.
  • Using Hive join queries to join multiple tables of a source system and load them toElasticsearch tables.
  • Written various key queries inElasticsearchfor retrieval of data effectively.
  • Extensively worked onElasticsearchquerying and indexing to retrieve the documents in high speeds.
  • WrittenElasticsearchtemplate for the index patterns.
  • Loaded two different datasets sources like Oracle, MySQL to HDFS and Hive respectively on daily basis.
  • Optimized Hive queries to extract the customer information from HDFS or HBase.
  • Developed Pig Latin scripts to aggregate the log files of the business clients.
  • Designed and developed Pig Latin Scripts to process data in a batch to perform trend analysis.
  • Using Hive and Pig to develop complex Map reduce streaming jobs using Java language.
  • Loaded and transformed large sets of structured, semi structured data using Pig Scripts.
  • Involved in writing shell scripts in scheduling and automation of tasks.
  • Managed and reviewed Hadoop log files to identify issues when Job fails.
  • Used Teradata Data Mover to copy data and objects such as tables and statistics from one system to another.
  • Worked on Teradata Multi-Load, Teradata Fast-Load utility to load data from Oracle and SQL Server to Teradata

Environment: HDP2.0, Hadoop, HDFS, Map Reduce, Pig, Hive, Sqoop, Java, Flume, Oracle, MySQL, HBase, Elastic Search, Linux Shell Scripting and Big Data.

Confidential, Houston, TX

Hadoop Developer

Responsibilities:

  • Installed and configuredHadoopMap Reduce, HDFS, developed multipleMap Reducejobs in java for data cleaning and pre-processing.
  • Continuously monitored and managed the Hadoop Cluster using Cloudera Manager.
  • Used Clouderamanager to pull metrics on various cluster features like JVM, Running Map and reduce tasks.
  • Utilized Apache Hadoop environment by using Cloudera.
  • Developed a workflow using Oozie to automate the tasks of loading the data into HDFS from analyzing the data.
  • Integrated Oozie with the rest of the Hadoop stack supporting several types of Hadoop jobs out of the box and analyzed the weblog data using the Hiveql.
  • Used Oozie work flow for Scheduling and managing jobs on a Hadoop cluster.
  • Developed complex queries using Hive and Impala.
  • Involved in working withImpalafor data retrieval process.
  • Experienced in migrating HiveQL intoImpalato minimize query response time.
  • Performed transformations using Map Reduce, Hive to load data into HDFS and worked on importing data from various sources.
  • Developed several newMap Reduceprograms to analyze and transform the data to uncover insights into the customer usage patterns.
  • Developed simple and complexMap Reduceprograms in Java for Data Analysis.
  • Written various Hive and Pig scripts and responsible for building scalable distributed data solutions using Hadoop.
  • Developed Hive UDFs to validate against business rules before data move to hive tables.
  • Used Pig scripts to join different data sets to perform queries.
  • Experienced in writing Pig UDF’s to perform analytics with Pig Latin operations.
  • Effectively usedSqoopto transfer data from databases (MySQL, Oracle) to HDFS, Hive.
  • Analyzed relational databases using Sqoop to export data and generate reports.
  • Worked extensively withSqoopfor importing metadata from Oracle.
  • Used MFT to work on the Ingestion of Files into HDFS from remote systems.
  • Worked on Avro Data files using Avro Serialization system.
  • Experienced with different scripting language like Python and shell scripts.
  • Implemented Frameworks using Java and python to automate the ingestion flow.
  • Designed & Implemented database Cloning using Python and Built backend support for Applications using Shell scripts

Environment: HDFS, Map Reduce, Pig, Hive, Sqoop, Python, Maven, Avro, Impala, Oozie, Avro, Eclipse and Shell Scripting.

Confidential

Java Developer

Responsibilities:

  • Implemented business rules by using Java & J2EE technology to modify the components.
  • Involved in Agile methodology includes daily SCRUM meetings, iteration planning, etc.
  • Designed different use cases by using UML, Class and Activity diagrams.
  • Developed GUI application by using HTML, Java Script and JSP to view pages in desktop portal.
  • Implemented Core Java in the application by using (Multi-Threading, Collections, OOPS, exception handling, JDBC).
  • Developed Spring ORM/JDBC layers to integrate Hibernate for regular JDBC calls.
  • Implemented navigation layer using Spring MVC components like dispatcher Servlet, controllers and view resolver components.
  • Implemented business components as Spring Beans and configured using Dependency Injection and annotations.
  • Consumed Web Services (REST) from third party and implemented those using JAX-RS.
  • Used Hibernate in persistence layer and developed POJO's, Data Access Object (DAO) to handle all database operations.
  • Packaging and Deployment of builds through Maven scripts.
  • Deployed the Application on Web Sphere application Server and used Message Driven Beans with MQ Series.
  • Implemented logger for debugging using Log4j and used GIT as version management tool.
  • Performed Code review and JIRA is used to track the information.
  • Performed unit testing using JUNIT framework to check the code review.
  • Created and modified Stored Procedures, Functions, Triggers and Complex SQL Commands using Oracle 11g as a database.

Environment: Java SE/ J2EE, Core Java, JSP,JSTL, XML,HTML, Query, Spring MVC/ DI /ORM /JDBC, Log4J, JUNIT, JavaScript, JSF, Ajax, Hibernate, Web Sphere, Eclipse, Unix, Ant, JUNIT, REST, GIT, Maven, MQ, JIRA, Oracle 11g.

Confidential

Java Developer

Responsibilities:

  • Coordinated in Object Oriented Analysis & Design of the project.
  • Developed HTML navigation menu that is derived from the database in the form of XML.
  • Performed client side validation for the application using Java-Script and J-Query.
  • Extensively involved in developing Web interface using JSP, JSP Standard Tag Libraries (JSTL) using Struts Framework.
  • The Struts Framework is used in the application which is based on MVC design pattern.
  • Developed client-side AJAX application that uses XSLT, XPath, JavaScript, OOP, and retrieve them via JNDI interface.
  • The Communication with databases is used by using JDBC and worked on Stored Procedures.
  • Used web services as a REST Services (JAX-RS) for XML response type.
  • Implemented Log4J for Logging Errors, debugging and tracking.
  • Performed unit testing using JUNIT framework to check the code review.
  • Developed Ant scripts to build and deploy the application onto Web Logic Application Server and ranUNIXshell scripts and implemented auto deployment process.
  • Developed and tested the application using Eclipse.

Environment: Java, XML, JSP, HTML, CSS, JavaScript, Struts MVC, MYSQL, JSTL, Servlet, JAX-RS, REST, Eclipse, JDBC, Log4J, JUNIT, Web Logic, Ant, JDBC.

We'd love your feedback!