Hadoop Developer Resume

PROFESSIONAL SUMMARY:

7 years of professional IT experience which includes over 2 years of experience in Big data ecosystem related technologies.
5 Years of experience in Analysis, Designing and Development of various Web based applications.
Over 2 years of extensive experience in Hadoop/Spark development and various components such as Hadoop MapReduce, HDFS, Spark Core, Spark SQL, Spark Streaming, Hive, Sqoop.
Experience using SQL queries to access and manipulate data in MySQL.
Self - driven, Quick learner and excellent team player.
Excellent verbal and written communication skills excel and presentation skills.

TECHNICAL SKILLS:

Big Data: Knowledge of Hadoop, Sqoop, Hive,Spark

Data Analytics & Visualization: Tableau

Databases: MySQL

Programming/Scripting Languages: Familiarity with Java, Python, R

Operating Systems: Windows Vista, XP and 98

PROFESSIONAL EXPERIENCE:

Confidential

Hadoop Developer

Environment: Hadoop Ecosystem, HDFS, Sqoop, Hive, Spark, Python, Tableau.

Responsibilities:

Create design architecture for Data Ingestion from multiple sources like RDBMS & Cloudera
Developed a SQOOP Incremental Import Job, Shell Script & CRONJOB for importing data into HDFS
Imported data from HDFS into Hive using Hive commands
Created Hive partition on Dates and Stocks for imported data
Developed a PySpark Script which dynamically downloads the Data files into the HDFS system.
Created PySpark RDDs for data transformation
Proficient in SQL Queries, triggers
Worked with Structured & Unstructured, RDBMS & CSV data.

Confidential

Hadoop Developer/Data Analyst

Responsibilities:

Project involved implementing big data solution to Confidential using Machine Learning techniques.

Confidential

Hadoop Developer/Data Analyst

Environment: Hadoop Ecosystem, HDFS, Sqoop, Hive, Spark, Python, Tableau.

Responsibilities:

Built data pipelines to Load and transform large sets of structured, semi structured and unstructured data.
Imported data from HDFS into Hive using HiveQL
Involved in creating Hive tables, loading and analyzing data using hive queries
Created Hive Partitioned and Bucketed tables to improve performance.
Developed a SQOOP Import Job, Shell Script & CRONJOB for importing data into HDFS
Used Tableau for visualization and building dashboards
To improve performance and optimization of the existing algorithms, explored different components like Spark Context, Spark-SQL, Data Frame, Pair RDD's, accumulators.
Processed millions of records usingHadoop jobs
Implemented Spark code using Python for RDD transformations & actions in Spark application

Software Engineer

Confidential

Responsibilities:

Programmer Analyst

Confidential

Responsibilities: