Hadoop Developer Resume

SUMMARY

Over 6 years of experience in all aspects of Software development methodology including gathering System Requirements, Analyzing the requirements, Designing and developing systems.
Good knowledge of all phases of Software Development Life Cycle (SDLC).
2+ years experienced Big Data/Hadoop developer having end to end experience in developing applications in Hadoop ecosystems.
Performed Importing and Exporting data from RDBMS into HDFS and Hive using Sqoop.
Experience in understanding the client’s Big Data business requirements and transform it into Hadoop centric technologies.
Experience in dealing with Apache Hadoop components like HDFS, MapReduce, HiveQL, Pig, Big Data and Big Data Analytics.
Hands on experience in designing and querying the NOSQL databases like Hbase.
Knowledge on SparkSQL, Spark Streaming, Cassandra and Kafka.
Used Oozie to schedule various jobs on Hadoopcluster.
Used Hive to analyses the partitioned and bucketed data.
Experienced in write Impala queries to retrieve data.
Strong experience in database design, writing complex SQL Queries and Stored Procedures.
Excellent experience in database such as Oracle 11g, Microsoft SQL Server, DB2.
Expertise in Developing and Supporting both Production and Development Systems.
Have good experience of all testing phases such as Unit testing, Integration testing and System testing.
Excellent team player with strong analytical, organizational and communication skills.
Adept at learning new technologies, analyzing problems, a self-starter with strong work ethics.

TECHNICAL SKILLS

Big Data Eco Systems: HDFS, MapReduce, Hive, Pig, Sqoop, Oozie, Flume, SparkSQL

Language: Java, PL/SQL

No Sql: HBase

Scripting: Shell Scripting

Methodologies: Agile, Waterfall

RDBMS/Databases: MS SQL Server, Oracle 10g/11g and MySQL

Operating Systems: Windows 7/8/XP, Unix, Linux

Tools Used: Winscp, Putty, Eclipse

PROFESSIONAL EXPERIENCE

Confidential

Hadoop Developer

Responsibilities:

Gathered the business requirements from the Business Improvement Team and Subject Matter Experts.
Ingested historical data from DB2 into HDFS.
Importing and exporting data into HDFS and Hive using Sqoop.
Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL-like access on Hadoop data.
Hive external tables were used for raw data and managed tables were used for intermediate tables.
Developed Hive Scripts (HQL) for automating the joins for different sources.
Created hive partitioning, bucketing and performed joins on hive tables and utilizing hive SerDes like CVS, JSON.
Involved in managing and reviewing Hadoop log files.
The logs that are stored on HDFS are analyzed and the cleaned data is imported into Hive warehouse, which enabled end business analysts to write Hive queries.
Involved in the tasks of resolving defects found in testing the new application.
Shell scripts were developed to add the process dates to the source files, to create trigger files
Developed various Big Data workflows using Oozie.
Analyzed Business Requirements and Identified mapping documents required for system and functional testing efforts for all test scenarios.
Created jobs to load data from HDFS into HBASE.

Environment: Hadoop, HDFS, Hive, MapReduce, Sqoop, Java, Pig, SQL Server, Shell Scripting, DB2

Hadoop Developer

Confidential

Responsibilities:

Transferring and exporting data from Oracle into HDFS and Hive using Sqoop.
Developing HQL queries to implement the select, insert, update and operations to the database by creating HQL named queries.
Automatically Importing data in regular basis using sqoop into the Hive partition by using apache Oozie
Experiencing in managing and reviewing Hadoop log files Load and transform large sets of data.
Conducting data extraction that may include analyzing, reviewing, modeling based on requirements using higher Level Tools such as Hive and Impala.
Involving in creating Hive tables, loading with data and writing hive queries.
Developed Sqoop Jobs to both import data into HDFS from Relational Database Management System like Oracle & DB2 and export data from HDFS to Oracle.
Developed Pig functions to preprocess the data for analysis.
Created Oozie workflows to sqoop the data from source to HDFS and then to target tables.
Created HBase tables to store all data.
Analyzed identified defects and its root cause and recommended course of actions.
Gathered business requirements in meetings for successful implementation and POC (Proof-of-Concept) of Hadoop Cluster.
Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL-like access on Hadoop data.
Worked on streaming the analyzed data to the existing relational databases using Sqoop for making it available for visualization and report generation by the BI team.

Environment: Hadoop, HDFS, Hive, MapReduce, Sqoop, Java, Pig, SQL Server, Shell Scripting.

SQL Developer

Confidential

Responsibilities:

Used Microsoft SQL Server Management Studio to create stored procedures that generated report data
Worked with Implementation team and closely moved with end users to implement manufacturing solutions.
Worked closely with developers, end users and Administrators in designing and creating databases and other objects.
Actively involved in designing databases for the system.
Developed several database objects such as tables, triggers and views and stored procedures.
Created and scheduled jobs and alerts.
Studied the existing environments and accumulated the requirements by interaction with various aspects.
Successfully loaded data from an EXCEL, Access, Flat files into a SQL Server database.
Developed & executed several optimized queries.
Written several stored procedures to achieve various functionalities of the system.
Created Database Objects like Tables, Stored Procedures, Views, Triggers, and UDFs.
Worked with external and internal customers to collect business requirements for complex reports
Developed stored procedures using T-SQL to replace outdated code.
Used SQL Server Integration Services to ensure timely processing of scheduled data import tasks
Used Visual Studio Team Foundation Server 2010 to maintain source code
Used SQL Profiler to debug and optimize existing queries for better performance

Environment: Microsoft SQL server 2008/2012, Oracle, SQL Server Management Studio, Team Foundation Server.

We provide IT Staff Augmentation Services!

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship