Hadoop Developer Resume
SUMMARY
- Over 6 years of experience in all aspects of Software development methodology including gathering System Requirements, Analyzing the requirements, Designing and developing systems.
- Good knowledge of all phases of Software Development Life Cycle (SDLC).
- 2+ years experienced Big Data/Hadoop developer having end to end experience in developing applications in Hadoop ecosystems.
- Performed Importing and Exporting data from RDBMS into HDFS and Hive using Sqoop.
- Experience in understanding the client’s Big Data business requirements and transform it into Hadoop centric technologies.
- Experience in dealing with Apache Hadoop components like HDFS, MapReduce, HiveQL, Pig, Big Data and Big Data Analytics.
- Hands on experience in designing and querying the NOSQL databases like Hbase.
- Knowledge on SparkSQL, Spark Streaming, Cassandra and Kafka.
- Used Oozie to schedule various jobs on Hadoopcluster.
- Used Hive to analyses the partitioned and bucketed data.
- Experienced in write Impala queries to retrieve data.
- Strong experience in database design, writing complex SQL Queries and Stored Procedures.
- Excellent experience in database such as Oracle 11g, Microsoft SQL Server, DB2.
- Expertise in Developing and Supporting both Production and Development Systems.
- Have good experience of all testing phases such as Unit testing, Integration testing and System testing.
- Excellent team player with strong analytical, organizational and communication skills.
- Adept at learning new technologies, analyzing problems, a self-starter with strong work ethics.
TECHNICAL SKILLS
Big Data Eco Systems: HDFS, MapReduce, Hive, Pig, Sqoop, Oozie, Flume, SparkSQL
Language: Java, PL/SQL
No Sql: HBase
Scripting: Shell Scripting
Methodologies: Agile, Waterfall
RDBMS/Databases: MS SQL Server, Oracle 10g/11g and MySQL
Operating Systems: Windows 7/8/XP, Unix, Linux
Tools Used: Winscp, Putty, Eclipse
PROFESSIONAL EXPERIENCE
Confidential
Hadoop Developer
Responsibilities:
- Gathered the business requirements from the Business Improvement Team and Subject Matter Experts.
- Ingested historical data from DB2 into HDFS.
- Importing and exporting data into HDFS and Hive using Sqoop.
- Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL-like access on Hadoop data.
- Hive external tables were used for raw data and managed tables were used for intermediate tables.
- Developed Hive Scripts (HQL) for automating the joins for different sources.
- Created hive partitioning, bucketing and performed joins on hive tables and utilizing hive SerDes like CVS, JSON.
- Involved in managing and reviewing Hadoop log files.
- The logs that are stored on HDFS are analyzed and the cleaned data is imported into Hive warehouse, which enabled end business analysts to write Hive queries.
- Involved in the tasks of resolving defects found in testing the new application.
- Shell scripts were developed to add the process dates to the source files, to create trigger files
- Developed various Big Data workflows using Oozie.
- Analyzed Business Requirements and Identified mapping documents required for system and functional testing efforts for all test scenarios.
- Created jobs to load data from HDFS into HBASE.
Environment: Hadoop, HDFS, Hive, MapReduce, Sqoop, Java, Pig, SQL Server, Shell Scripting, DB2
Hadoop Developer
ConfidentialResponsibilities:
- Transferring and exporting data from Oracle into HDFS and Hive using Sqoop.
- Developing HQL queries to implement the select, insert, update and operations to the database by creating HQL named queries.
- Automatically Importing data in regular basis using sqoop into the Hive partition by using apache Oozie
- Experiencing in managing and reviewing Hadoop log files Load and transform large sets of data.
- Conducting data extraction that may include analyzing, reviewing, modeling based on requirements using higher Level Tools such as Hive and Impala.
- Involving in creating Hive tables, loading with data and writing hive queries.
- Developed Sqoop Jobs to both import data into HDFS from Relational Database Management System like Oracle & DB2 and export data from HDFS to Oracle.
- Developed Pig functions to preprocess the data for analysis.
- Created Oozie workflows to sqoop the data from source to HDFS and then to target tables.
- Created HBase tables to store all data.
- Analyzed identified defects and its root cause and recommended course of actions.
- Gathered business requirements in meetings for successful implementation and POC (Proof-of-Concept) of Hadoop Cluster.
- Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL-like access on Hadoop data.
- Worked on streaming the analyzed data to the existing relational databases using Sqoop for making it available for visualization and report generation by the BI team.
Environment: Hadoop, HDFS, Hive, MapReduce, Sqoop, Java, Pig, SQL Server, Shell Scripting.
SQL Developer
Confidential
Responsibilities:
- Used Microsoft SQL Server Management Studio to create stored procedures that generated report data
- Worked with Implementation team and closely moved with end users to implement manufacturing solutions.
- Worked closely with developers, end users and Administrators in designing and creating databases and other objects.
- Actively involved in designing databases for the system.
- Developed several database objects such as tables, triggers and views and stored procedures.
- Created and scheduled jobs and alerts.
- Studied the existing environments and accumulated the requirements by interaction with various aspects.
- Successfully loaded data from an EXCEL, Access, Flat files into a SQL Server database.
- Developed & executed several optimized queries.
- Written several stored procedures to achieve various functionalities of the system.
- Created Database Objects like Tables, Stored Procedures, Views, Triggers, and UDFs.
- Worked with external and internal customers to collect business requirements for complex reports
- Developed stored procedures using T-SQL to replace outdated code.
- Used SQL Server Integration Services to ensure timely processing of scheduled data import tasks
- Used Visual Studio Team Foundation Server 2010 to maintain source code
- Used SQL Profiler to debug and optimize existing queries for better performance
Environment: Microsoft SQL server 2008/2012, Oracle, SQL Server Management Studio, Team Foundation Server.
