Sr.Hadoop Developer Resume

SUMMARY:

12+ years of experience in the Information Technology industry.
7+ years in Data Warehousing and Big Data Environment.
Excellent Analytical Thinking and Problem - Solving skills.
Exceptional Communication and Interpersonal skills
Big Data Programmer with experience across multiple components of the Hadoop Ecosystem like HIVE, SQOOP, Flume, Pig, Spark with Scala.
Experience in building, maintaining multiple Hadoop clusters of different sizes and configuration.
Understanding of Information Architecture and usability design concepts, along with OOPs.
Experienced in multiple Hadoop distributions like Cloudera, MapR, and Horton works.
Adept at performance tuning and throughput optimization techniques.
Well-versed with data import and export data in Hadoop tool suites.
Evaluation of ETL and OLAP tools and recommend the most suitable solutions based on business needs.
Adept at documenting Technical design and Application Software design and development of applications using Java,C and C#.
Experience spanning across Health Care, Retail, Banking and Mobile Telecommunication industry.

TECHNICAL SKILLS:

Operating System: Windows,Linux

Hadoop Ingestion Tools: Sqoop, Flume

Hadoop Data Processing Tools: HDFS, Spark,Scala, Hive, PIG,Shell Scripting.

Hadoop Scheduling & Monitoring Tools: Zena, Ambari

Database: MySQL,Oracle,Netezza

Language: Scala, C,Java,Python

IDE: Eclipse, IntelliJ

Repository: GIT

Other: Jira

Projects Summary

Confidential

Sr.Hadoop Developer

Roles and Responsibilities

Establishes end-to-end automation processes for all files using ZENA.
Experience in designing and developing applications in Spark using Scala.
Data ingested from fixed width flat files from HDFS to Raw layer using Spark - Scala.
Scala is used here to apply the business transformations rules as per mapping document for 32 tables.
Involved in creating HDFS directory and partitions.
Involved in creating an external Hive tables for all TMG government data.
The history data is moved to CDC and then merged in to current tables.
Altered table partiotions using hive.
Created GCF format tables using denormalized data.
Invloved in error handling tasks in the process.
Involved in running Hadoop jobs for processing millions of records of text data.
Invloved in performing the validation of the records count and File name.
Unit testing and validating the case outputs.
Performance tuning of spark job for optimal utilization of cluster resources.

Environment: Hadoop, HDFS, Hive, Apache,Spark,Scala and Shell Scripting.

Confidential

Sr.Hadoop Developer

Roles and Responsibilities

Establishes end-to-end automation processes for all files using ZENA.
Imported DB2 data into HDFS using sqoop.
Involved in creating HDFS directory and partitions.
Involved in creating Hive tables for all member’s Bluestar membership.
Creating Hive tables, and loading and analyzing data using hivequeries.
Performance tuning of Flume agent by configuring different properties.
Developed Spark scripts by using Scala shell commands as per the requirement.
Migrated Hive scripts to Scala for ingestion.
Invloved in error handling tasks in the process.
Involved in running Hadoop jobs for processing millions of records of text data.
Invloved in performing the validation of the records count and File name.
Unit testing and validating the case outputs.
Performance tuning of spark job for optimal utilization of cluster resources.
Creating Oozie workflow to schedule spark rule engine job.

Environment: Hadoop, HDFS, Hive, Pig, Sqoop, Apache,Spark, Scala, Shell Scripting.

Confidential

Sr.Hadoop Developer

Roles and Responsibilities

Provide technical designs, architecture, Support automation, installation and configuration tasks and upgrades and planning system upgrades of Hadoop cluster.
Ingested files from GCPS files to HDFS and creating External hive tables and data should be available for the consumption team.
Maintained Hadoop clusters for dev/staging/production. Trained the development, administration, testing and analysis teams on Hadoop framework and Hadoop eco system.
Developed the UNIX shell scripts for creating the reports from Hive data.
Integrating Big data technologies and analysis tools into the overall architecture.

Environment: Hadoop, HDFS, Hive, Apache, Shell Scripting.

Confidential

Sr.Hadoop Developer