We provide IT Staff Augmentation Services!

Sr Hadoop Developer Resume

4.00/5 (Submit Your Rating)

Bellevue, WA

PROFESSIONAL SUMMARY:

  • Over 8+ years of experience with emphasis on Big Data Technologies, Design and Development of Java phases of Software Development life cycle. Based enterprise applications with all
  • Cloudera certified CCA Spark and Hadoop Developer with hands on experience on major components in Hadoop Ecosystem like HadoopMapReduce,HDFS,Hive,Impala,Pig,HBase,Zookeeper,Oozie,Sqoop,Flume,Spark and Pyspark.
  • Expert in importing and exporting of data using Sqoop from HDFS to relational database system and vice - versa.
  • Expert level of scripting using Pig scripts and Hive queries for processing and analyzing large volume of data.
  • Experience with Oozie Workflow Engine in running workflow designing, job scheduling with actions that run Hadoop Map/Reduce and Pig jobs.
  • Good Experience in developing and implementing big data solutions and data mining applications on Hadoop using Hive, PIG, HBase, Hue, Oozie workflows and designing and implementing Java, Python Map Reduce programs.
  • Knowledge in installing, configuring, and using Hadoop ecosystem components like Hadoop Map Reduce, HDFS, MapR, HBase, Oozie, Hive, Sqoop, Pig, Flume, Apache Spark, Zookeeper and Kafka.
  • Experience in managing and reviewing Hadoop log files.
  • Hands-on experience with Hadoop applications (such as administration, configuration management, monitoring, debugging, and performance tuning).
  • Hands on experience in converting Hive/SQL queries into Spark transformations using Scala.
  • Good Experience with flume tool for data ingestion from various data producers (webservers) into Hadoop .
  • Good knowledge in NOSQL databases HBase, MongoDB.
  • Sound Relational Database Concepts and extensively worked with ORACLE, MySQL, SQL Server.
  • Good Experience with databases, writing complex queries and stored procedures using SQL and PL/SQL.
  • Experience in using sequence file, RC file and Avro file formats.
  • Good understanding of Classic Hadoop and Yarn architecture along with various Hadoop Demons such as Job Tracker, Task Tracker, Name Node, Data Node, Secondary Name Node, Resource Manager, Node Manager, Application Master and Containers.
  • Very good experience with both MapReduce 1 (Job Tracker) and MapReduce 2 (YARN) setups.
  • Expert in Java MapReduce Jobs, User Defined functions for Pig and Hive.
  • Knowledge in handling messaging services using Apache Kafka.
  • Familiarity on real time streaming data with Spark for fast large scale in memory Map Reduce.
  • Experience in Business Intelligence tools such as Tableau for visually analyzing the data.
  • Experience in building, maintaining multiple Hadoop clusters of different sizes and configuration and setting up the rack topology for large clusters.
  • Developed machine learning algorithms using Mahout for clustering and data mining.
  • Involvement in all phases of SDLC from project proposal, planning, analysis, development, testing, deployment and support.
  • Experience in developing and implementing web applications using Java, JSP, CSS, HTML, HTML5, XHTML and Java script, JSON, XML, JDBC.
  • Experience in working in 24X7 Support and used to meet deadlines, adaptable to ever changing priorities.
  • Proven ability to work with senior technical managers and staff to provide expert-level support for the installation, maintenance, upgrading, and administration of full-featured database management systems.
  • Excellent interpersonal and communication skills, creative, research-minded, technically competent, result-oriented with problem solving as well and ability to work well with people and to maintain a good relation with the organization.

TECHNICAL SKILLS:

Technology Tools: Big data and Hadoop, Apache/Cloudera HDFS 1.X/2.X, MapReduce, YARN, Sqoop, Flume, Spark, Scala, Hive,HBase, Pig, Oozie, Zookeeper, Kafka

Operating Systems: MS Windows, Linux, Ubuntu, CentOS

Programming Languages: C, C++, Java, SQL, PL/SQL, Python, Unix Shell Scripting

Database: Oracle, MySQL, Microsoft Sql Server, NoSQL Databases: HBase, MongoDB.

IDE Tools: Eclipse, Net Beans

Other Skills: Tableau, HTML5, Java Script, JSON, CSS XML, Apache Tomcat

PROFESSIONAL EXPÉRIENCE:

Confidential, Bellevue, WA

Sr Hadoop Developer

Responsibilities:

  • Responsible for building scalable distributed data solutions using Hadoop
  • Installed and configured Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster
  • Setup and benchmarked Hadoop/HBase clusters for internal use
  • Developed Simple to complex Map/reduce Jobs using Hive and Pig
  • Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms
  • Handled importing of data from various data sources, performed transformations using Hive, MapReduce, loaded data into HDFS and Extracted the data from MySQL into HDFS using Sqoop
  • Analyzed the data by performing Hive queries and running Pig scripts to study customer behavior Used UDF's to implement business logic in Hadoop.
  • Worked on Hortonworks platform to perform hadoop operations.
  • Implemented business logic by writing UDFs in Java.
  • Continuous monitoring and managing the Hadoop cluster using Cloudera Manager
  • Worked with application teams to install operating system, Hadoop updates, patches, version upgrades as required.
  • Monitored Hadoop cluster job performance and capacity planning.
  • Involved in loading data from UNIX/LINUX file system to HDFS.
  • Deployed Hadoop (HDFS, MapReduce and HBase) cluster. Configuration, administration, maintenance, performance tuning, monitoring and troubleshooting of Hadoop (HDFS, Mapreduce and HBase) clusters.
  • Responsible for building scalable distributed data solutions using Hadoop.
  • Responsible for cluster maintenance, adding and removing cluster nodes, cluster monitoring and troubleshooting, manage and review data backups, manage and review Hadoop log files.
  • Handled importing of data from various data sources, performed transformations using Hive, MapReduce, and loaded data into HDFS.
  • Analyzed the data by performing Hive queries and running Pig scripts to know user behavior.
  • Continuous monitoring and managing the Hadoop cluster through Cloudera Manager.
  • Installed Oozie workflow engine to run multiple Hive.
  • Developed Hive queries to process the data and generate the data cubes for visualizing.
  • Participated in support for 24x7 in Big data/Hadoop.

Environment: Java, Eclipse, CDH Hadoop, Horton Works, Sub Version, Hadoop, Hive, HBase, Sqoop, Flume, Ubuntu, CentOS, Map Reduce, HDFS, Java (JDK 1.6), Hadoop Distribution of Cloudera, Hortonworks, Windows Azure, MapR, Map Reduce, Oracle 11g / 10g, PL/SQL, SQL*PLUS, LINUX/UNIX Shell Scripting, NoSQL, Oozie, Pig, XML, JSON, YARN, Shell, Python, MongoDB, Java Script, JUNIT.

Confidential, Minneapolis, MN

Hadoop Developer

Responsibilities:

  • Installed and Configured Apache Hadoop clusters for application development and Hadoop tools like Hive, Pig, Oozie, Zookeeper, HBase, Flume and Sqoop.
  • Implemented multiple Map Reduce Jobs in java for data cleaning and pre-processing.
  • Worked in a team with 40 node cluster and increase cluster by adding Nodes, the configuration for additional data nodes was done by Commissioning process in Hadoop .
  • Responsible for Cluster maintenance, adding and removing cluster nodes, Cluster Monitoring and Troubleshooting, manage and review data backups and log files.
  • Responsible to manage data coming from different sources.
  • Managed and scheduled Jobs on a Hadoop cluster.
  • Implemented a script to transmit information from Oracle to HBase using Sqoop.
  • Involved in defining job flows, managing and reviewing log files.
  • Installed Oozie workflow engine to run multiple Map Reduce, HiveQL and Pig jobs.
  • Participated in requirement gathering form the Experts and Business Partners and converting the requirements into technical specifications.
  • Created Hive tables to store the processed results in a tabular format.
  • Was done various compressions and file formats like snappy, Gzip, Avro, Sequence, text. Wrote complex Hive queries and UDFs in Java and Python.
  • Created and exposed Hive views through Impala for the business Users
  • Involved in forecast based on the present results and insights derived from data analysis.
  • Involved in collecting the data and identifying data patterns to build trained model using Machine Learning.
  • Prepare Developer (Unit) Test cases and execute developer testing.
  • Implemented test scripts to support test driven development and continuous integration.
  • Developed and implemented some machine learning algorithms using Mahout for data mining for the data stored in HDFS.
  • Created and maintained Technical documentation for launching Hadoop Clusters and for executing Hive queries and Pig Scripts.
  • Worked on visualization tool tableau for visually analyzing the data.

Environment: Hadoop, HDFS, Pig, Hive, Map Reduce, Impala, Sqoop, Flume, Oozie, Big Data, java, Python, Mahout, Junit testing, Oracle, MySQL, Tableau, LINUX, Windows.

Confidential, New York, NY

Java/Hadoop Developer

Responsibilities:

  • Analyzed large data sets by running Hive queries and Pig scripts.
  • I nvolved in creating Hive tables, and loading and analyzing data using hive queries.
  • Developed Simple to complex Map Reduce Jobs using Hive and Pig
  • Involved in running Hadoop jobs for processing millions of records of text data.
  • Load and transform large sets of structured, semi structured and unstructured data.
  • Responsible to manage data coming from different sources.
  • Implemented multiple Map Reduce Jobs in java for data cleaning and pre-processing.
  • Implemented Partitioning, Dynamic Partitions, Buckets in Hive.
  • Monitor System health and logs and respond accordingly to any warning or failure conditions.
  • Implemented the workflows using Apache Oozie framework to automate tasks.
  • Worked with application teams to install operating system, Hadoop updates, patches, version upgrades as required.
  • Developed unit test cases for Hadoop MapReduce jobs with JUnit.
  • Developed multiple Map Reduce jobs in java for data cleaning and preprocessing.
  • Written shell scripts and Python scripts for automation of job.
  • Involved in loading data from LINUX file system to HDFS.
  • Assisted in exporting analyzed data to relational databases using Sqoop.
  • Supported Map Reduce Programs those are running on the cluster.
  • Created and maintained Technical documentation for launching Hadoop Clusters and for executing Hive queries and Pig Scripts.

Environment: Hadoop, HDFS, Pig, Hive, MongoDB, Map Reduce, Sqoop, Oozie, Python, Big Data, java, Oracle 11g/10g, MySQL, LINUX, Windows, Oracle, Teradata, Teradata SQL Assistant, VSS, Outlook, Putty, MLOAD, TPUMP, FAST LOAD, FAST EXPORT, TDWM, PMON, DBQL .

Confidential, Bloomington, IL

Java Developer

Responsibilities:

  • Involved in the requirements gathering. Design, Development, Unit testing and Bug fixing.
  • Used Agile Methodologies to manage full life-cycle development of the project.
  • Developed application using Struts, spring and Hibernate.
  • Developed rich user interface using JavaScript, JSTL, CSS, JQuery and JSP’s.
  • Developed custom tags for implementing logic in JSP’s.
  • Used Java script, JQuery, JSTL, CSS and Struts 2 tags for developing the JSP’S.
  • Involved in making release builds for deploying the application for test environments.
  • Used Oracle database as backend database.
  • Wrote SQL to update and create database tables.
  • Used Eclipse as IDE.
  • Using RIDC Interface get content details and Create Content through application.
  • Used Spring IOC for injecting the beans.
  • Used Hibernate for connecting to the database and mapping the entities by using hibernate annotations.
  • Created JUnit test cases for unit testing application.
  • Used JUNIT and JMOCK for unit testing.

Environment: J2EE1.6, JSP, JSTL, Ajax, Spring 2.5, Struts 2.0, Ajax, Hibernate 3.2,JDBC, JNDI,XML, XSLT, Web Services, WSDL, Log4j, ORACLE 11g, Oracle Web logic Server 10.3, SVN, Windows XP, UML.

Confidential

Java Developer

Responsibilities:

  • Managed connectivity using JDBC for querying/inserting & data management including triggers and stored procedures.
  • Developed UI using HTML, JavaScript and JSP developed Business Logic and Interfacing components using Business Objects, XML.
  • Communicated with client to Analyze and Review of Business/Technical Requirements.
  • Designed and developed Co-Branding framework to change UI for different clients using same application.
  • Used Struts, Spring (MVC) Framework to develop the application.
  • Used Factory, DAO, Singleton and DTO and Value Object, Business Delegate design patterns.
  • Designed and developed database layer using ORM technologies like Hibernate.
  • Developed Web services client to consume with Vender Web services.
  • Generated Excel reports Using POI framework.
  • Implementing the business logic and creating dynamic web pages with JSP and using JavaScript to in corporate client side and server side validations and functionality.
  • Preparation and Review of Unit Test Plans, Scripts and Results.
  • Used respective designed patterns to implement the reusable component of the application.
  • Implemented various GUI screens using JSP, AJAX Frame work and JQuery.
  • Worked with web methods for deploying and administering services.
  • Used Hibernate to interact with database.
  • Worked with requirement analysis team to gather software requirements for application development.
  • Provided support and maintenance after deploying the web application.
  • Resolved issues reported by the client.

Environment: Linux, Windows, Core Java, JSP, Hibernate, Servlets, Spring MVC Framework, Hibernate, Oracle, JMS, Ajax, JQuery, XML, Log4j, Apache Tomcat and Eclipse.

We'd love your feedback!