We provide IT Staff Augmentation Services!

Sr. Hadoop Developer Resume

4.00/5 (Submit Your Rating)

Dallas, TX

SUMMARY:

  • Over 10+ years of professional IT experience in requirement gathering, design, development, testing, implementation and maintenance. Progressive experience in all phases of the iterative Software Development Life Cycle (SDLC).
  • Good Knowledge on Hadoop Cluster architecture and monitoring the cluster.
  • In - depth knowledge of Statistics, Machine Learning, Data mining.
  • In-depth understanding of Data Structure and Algorithms.
  • Experience in managing and reviewing Hadoop log files.
  • Excellent understanding and knowledge of NOSQL databases like HBase, Cassandra.
  • Experience in implementing in setting up standards and processes for Hadoop based application design and implementation.
  • Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems and vice versa.
  • Excellent understanding / knowledge of Hadoop architecture and various components such as HDFS, Job Tracker, Task Tracker, NameNode, Data Node and MapReduce programming paradigm.
  • Hands on experience in installing, configuring, and using Hadoop ecosystem components like Hadoop MapReduce, HDFS, HBase, Hive, Scala, Spark, Sqoop, Pig, oozie and Flume.
  • Good Exposure on Apache Hadoop Map Reduce programming, PIG Scripting and Distribute Application and HDFS.
  • Good Knowledge on mongodb.
  • Good knowledge on Python.
  • Experience in managing Hadoop clusters using Cloudera Manager Tool.
  • Very good experience in complete project life cycle (design, development, testing and implementation) of Client Server and Web applications.
  • Experience in handling the configuration and troubleshooting of Ubuntu.
  • Extensive experience working in SQL Server and My SQL database.
  • Hands on experience in VPN, Putty, winSCP.
  • Extensive experience in Selenium by using Java.
  • Hands on experience in application development using Java.
  • Ability to adapt to evolving technology, strong sense of responsibility and accomplishment.

PROFESSIONAL EXPERIENCE:

Confidential, Dallas, TX

Sr. Hadoop Developer

Responsibilities:

  • Installed and configured Hadoop Mapreduce, HDFS, Developed multiple MapReduce jobs in java for data cleaning and preprocessing.
  • Developed workflows using custom MapReduce, Pig, Hive, Sqoop
  • Built reusable Hive UDF libraries for business requirements which enabled users to use these UDF's in Hive querying.
  • The logs and semi structured content that are stored on HDFS were preprocessed using PIG and the processed data is imported into Hive warehouse which enabled business analysts to write Hive queries.
  • Configured big data workflows to run on the top of Hadoop using Control M and these workflows comprises of heterogeneous jobs like Pig, Hive, Sqoop and MapReduce.
  • Developed suit of Unit Test Cases for Mapper, Reducer and Driver classes using MR Testing library.
  • Developed workflow in Control M to automate tasks of loading data into HDFS and preprocessing with PIG.
  • Used Maven extensively for building jar files of MapReduce programs and deployed to Cluster.
  • Bug fixing and 24/7 production support.

Hadoop Developer

Confidential

Environment: Java, Eclipse, Oracle 10g, Sub Version, Hadoop, Hive, HBase, MapReduce, HDFS, Pig Hive, Cassandra, Java (JDK 1.6), Hadoop Distribution of Cloudera, MapReduce, IBM DataStage 8.1, Oracle 11g / 10g, Toad 9.6, Windows NT, LINUX.

Responsibilities:

  • Involved in review of functional and nonfunctional requirements.
  • Facilitated knowledge transfer sessions.
  • Installed and configured Hadoop Map reduce, HDFS, Developed multiple Map Reduce jobs in java for data cleaning and preprocessing.
  • Importing and exporting data into HDFS and Hive using Sqoop.
  • Experience in defining job flows.
  • Experience in managing and reviewing Hadoop log files.
  • Extracted files from RDBMS through Sqoop and placed in HDFS and processed.
  • Experience in running Hadoop streaming jobs to process terabytes of xml format data.
  • Got good experience with NOSQL database.
  • Supported Map Reduce Programs those are running on the cluster.
  • Involved in loading data from UNIX file system to HDFS.
  • Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
  • Replaced default Derby metadata storage system for Hive with MySQL system.
  • Executed queries using Hive and developed MapReduce jobs to analyze data.
  • Developed Pig Latin scripts to extract the data from the web server output files to load into HDFS.
  • Developed the Pig UDF's to preprocess the data for analysis.
  • Developed Hive queries for the analysts.
  • Involved in loading data from LINUX and UNIX file system to HDFS.
  • Supported in setting up QA environment and updating configurations for implementing scripts with Pig.
  • Developed a custom File System plug in for Hadoop so it can access files on Data Platform. This plugin allows Hadoop MapReduce programs, HBase, Pig and Hive to work unmodified and access files directly.
  • Designed and implemented MapReduce based large scale parallel relation learning system
  • Extracted feeds from social media sites such as Facebook, Twitter using Python scripts.
  • Setup and benchmarked Hadoop /HBase clusters for internal use
  • Integrated Oozie with the rest of the Hadoop stack supporting several types of Hadoop jobs out of the box (such as Map-Reduce, Pig, Hive, and Sqoop) as well as system specific jobs (such as Java programs and shell scripts).

Confidential

Sr. Analyst

Roles and Responsibilities:

  • Responsible for the design of Automation Frame Work, Automation Test Plan and the preparation of Project metrics.
  • Reviewing the Testing team test cases and brings them into their notice if any missing scenarios observed.
  • Setting up a Call with testing team on weekly basis and taking up their metrics and on discussing their schedules and deadlines.

Confidential

Automation Engineer

Roles and Responsibilities:

  • Responsible for the design of Test Plan, Reviewing the Test cases and sending the consolidated defect report on daily basis.
  • Sending the Project Metrics to the Project manager on weekly basis.
  • Mentor team members and assigning the tasks and work with them closely to get the quality output.
  • Performed testing on different Android API’s.
  • Performed globalization testing

Confidential

Automation Engineer

Technical Environment - J2ME, Blackberry JDE

Roles & Responsibilities:

  • Mentor team member on how to setup the environment and guided him on all the ways to understand the application.
  • Responsible for the design of Test Plan, Reviewing the Test cases and sending the consolidated defect report on daily basis
  • Performed End to End testing of Mobile conference application that includes functionality, regression, usability and compatibility.
  • Performed testing on different series of Blackberry Simulators which the application supports and make sure that nothing has broken.
  • Did globalization testing on MCC Application for different series of Blackberry Simulators.
  • Performed testing of MCC Application on the Blackberry device to make sure everything is working fine as it works in Simulator.

We'd love your feedback!