We provide IT Staff Augmentation Services!

Hadoop Developer Resume

4.00/5 (Submit Your Rating)

SUMMARY

  • Over 6 years of experience in all aspects of Software development methodology including gathering System Requirements, Analyzing the requirements, Designing and developing systems.
  • Good knowledge of all phases of Software Development Life Cycle (SDLC).
  • 2+ years experienced Big Data/Hadoop developer having end to end experience in developing applications in Hadoop ecosystems.
  • Performed Importing and Exporting data from RDBMS into HDFS and Hive using Sqoop.
  • Experience in understanding the client’s Big Data business requirements and transform it into Hadoop centric technologies.
  • Experience in dealing with Apache Hadoop components like HDFS, MapReduce, HiveQL, Pig, Big Data and Big Data Analytics.
  • Hands on experience in designing and querying the NOSQL databases like Hbase.
  • Knowledge on SparkSQL, Spark Streaming, Cassandra and Kafka.
  • Used Oozie to schedule various jobs on Hadoopcluster.
  • Used Hive to analyses the partitioned and bucketed data.
  • Experienced in write Impala queries to retrieve data.
  • Strong experience in database design, writing complex SQL Queries and Stored Procedures.
  • Excellent experience in database such as Oracle 11g, Microsoft SQL Server, DB2.
  • Expertise in Developing and Supporting both Production and Development Systems.
  • Have good experience of all testing phases such as Unit testing, Integration testing and System testing.
  • Excellent team player with strong analytical, organizational and communication skills.
  • Adept at learning new technologies, analyzing problems, a self-starter with strong work ethics.

TECHNICAL SKILLS

Big Data Eco Systems: HDFS, MapReduce, Hive, Pig, Sqoop, Oozie, Flume, SparkSQL

Language: Java, PL/SQL

No Sql: HBase

Scripting: Shell Scripting

Methodologies: Agile, Waterfall

RDBMS/Databases: MS SQL Server, Oracle 10g/11g and MySQL

Operating Systems: Windows 7/8/XP, Unix, Linux

Tools Used: Winscp, Putty, Eclipse

PROFESSIONAL EXPERIENCE

Confidential

Hadoop Developer

Responsibilities:

  • Gathered the business requirements from the Business Improvement Team and Subject Matter Experts.
  • Ingested historical data from DB2 into HDFS.
  • Importing and exporting data into HDFS and Hive using Sqoop.
  • Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL-like access on Hadoop data.
  • Hive external tables were used for raw data and managed tables were used for intermediate tables.
  • Developed Hive Scripts (HQL) for automating the joins for different sources.
  • Created hive partitioning, bucketing and performed joins on hive tables and utilizing hive SerDes like CVS, JSON.
  • Involved in managing and reviewing Hadoop log files.
  • The logs that are stored on HDFS are analyzed and the cleaned data is imported into Hive warehouse, which enabled end business analysts to write Hive queries.
  • Involved in the tasks of resolving defects found in testing the new application.
  • Shell scripts were developed to add the process dates to the source files, to create trigger files
  • Developed various Big Data workflows using Oozie.
  • Analyzed Business Requirements and Identified mapping documents required for system and functional testing efforts for all test scenarios.
  • Created jobs to load data from HDFS into HBASE.

Environment: Hadoop, HDFS, Hive, MapReduce, Sqoop, Java, Pig, SQL Server, Shell Scripting, DB2

Hadoop Developer

Confidential

Responsibilities:

  • Transferring and exporting data from Oracle into HDFS and Hive using Sqoop.
  • Developing HQL queries to implement the select, insert, update and operations to the database by creating HQL named queries.
  • Automatically Importing data in regular basis using sqoop into the Hive partition by using apache Oozie
  • Experiencing in managing and reviewing Hadoop log files Load and transform large sets of data.
  • Conducting data extraction that may include analyzing, reviewing, modeling based on requirements using higher Level Tools such as Hive and Impala.
  • Involving in creating Hive tables, loading with data and writing hive queries.
  • Developed Sqoop Jobs to both import data into HDFS from Relational Database Management System like Oracle & DB2 and export data from HDFS to Oracle.
  • Developed Pig functions to preprocess the data for analysis.
  • Created Oozie workflows to sqoop the data from source to HDFS and then to target tables.
  • Created HBase tables to store all data.
  • Analyzed identified defects and its root cause and recommended course of actions.
  • Gathered business requirements in meetings for successful implementation and POC (Proof-of-Concept) of Hadoop Cluster.
  • Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL-like access on Hadoop data.
  • Worked on streaming the analyzed data to the existing relational databases using Sqoop for making it available for visualization and report generation by the BI team.

Environment: Hadoop, HDFS, Hive, MapReduce, Sqoop, Java, Pig, SQL Server, Shell Scripting.

SQL Developer

Confidential

Responsibilities:

  • Used Microsoft SQL Server Management Studio to create stored procedures that generated report data
  • Worked with Implementation team and closely moved with end users to implement manufacturing solutions.
  • Worked closely with developers, end users and Administrators in designing and creating databases and other objects.
  • Actively involved in designing databases for the system.
  • Developed several database objects such as tables, triggers and views and stored procedures.
  • Created and scheduled jobs and alerts.
  • Studied the existing environments and accumulated the requirements by interaction with various aspects.
  • Successfully loaded data from an EXCEL, Access, Flat files into a SQL Server database.
  • Developed & executed several optimized queries.
  • Written several stored procedures to achieve various functionalities of the system.
  • Created Database Objects like Tables, Stored Procedures, Views, Triggers, and UDFs.
  • Worked with external and internal customers to collect business requirements for complex reports
  • Developed stored procedures using T-SQL to replace outdated code.
  • Used SQL Server Integration Services to ensure timely processing of scheduled data import tasks
  • Used Visual Studio Team Foundation Server 2010 to maintain source code
  • Used SQL Profiler to debug and optimize existing queries for better performance

Environment: Microsoft SQL server 2008/2012, Oracle, SQL Server Management Studio, Team Foundation Server.

We'd love your feedback!