We provide IT Staff Augmentation Services!

Senior Hadoop Developer Resume

2.00/5 (Submit Your Rating)

Confidential

SUMMARY:

  • Over 7+ years of professional IT experience including 4+ plus years of experience in Big Data, Hadoop Development, Ecosystem Analytics, Development and Design of Javabased enterprise applications.
  • Experience in different Hadoop distributions like Cloudera Distributions.
  • Extensive expertise in both MapReduce MRv1 and MapReduce MRv2 (YARN).
  • Extensive experience in Testing, Debugging and Deploying MapReduce Hadoop platforms.
  • Extensive experience in working withtechnologies and tools such as like Hadoop 1, Hadoop 2, Map Reduce, HDFS, HBase, PIG, Hive, Sqoop, Flume, Oozie, Kafka, Zookeeper and HBase.
  • Expert in writing MapReduce program, creating Pig and Hive UDFs using Java to analyze the data efficiently.
  • Expertise in installing, designing, sizing, configuring, provisioning and upgrading Hadoop environments.
  • Expertise in Commissioning and decommissioning the nodes in Hadoop Cluster.
  • Experience in Tuning and troubleshooting performance issues in Hadoop cluster with a large size of data.
  • Experience in importing, exporting and loading files from Oracle/SQL to Hive and HDFS from Oracle and SQL Server using SQOOP.
  • Experienced in developing simple to complex Map/Reduce jobs using Hive and Pig to handle files in multiple formats (JSON, Text, XML, Sequence File, etc.).
  • Worked with operating systems like Linux, CentOS and Windows with different technologies such as Big Data, Java, XML, HTML, SQL, PL/SQL, ShellScripting and Business Intelligence.
  • Strong Knowledge of Flume and Spark, Kafka.
  • Good knowledge in programming Scala and Spark and Spark components like Spark SQL, SQL Context, Hive Context.
  • Worked on Performance Tuning of Hadoop jobs by applying techniques such as Map Side Joins, Partitioning, and Bucketing.
  • Good working knowledge on Eclipse IDE for developing and debugging Java applications.
  • Well experienced in using networking tools like Putty and WinSCP.
  • Expertise in creating UI using HTML, CSS, and JavaScript.
  • Good experience in using databases - SQL Server, Stored Procedures, Constraints, and Triggers.
  • Experienced in creating and analyzing Software Requirement Specifications (SRS) and Functional Specification Document (FSD), Strong knowledge of Software Development Life Cycle (SDLC).
  • Extensive experience in creating PL/SQL or SQL Stored Procedures, packages, functions, cursors.
  • Experienced in preparing and executing Unit Test Plan and Unit Test Cases after software development. Defect Management and Defect Tracking to do Performance Tuning for delivering the utmost Quality product.
  • Experience in Scrum, Agile, and Waterfall models.
  • Excellent communication Skills, committed, result oriented, hard working with a quest to learn new technologies.
  • Participated in Project Requirement and Planning meetings, flexible in working independently and as a team, Quickie in learning new technologies and tools.
  • Highly motivated with excellent verbal and written communication skills, excellent presentation capabilities, and efficient requirement gathering ability and efficiently convey them to other members of the team.

TECHNICAL SKILLS:

Frameworks: HDFS, MapReduce, Pig, Hive, Sqoop, Oozie, Zookeeper, Flume and HBase, Spark, Scala, Kafka

Databases: Microsoft SQL Server, MySQL, Oracle

Languages: C, C#, Java, Scala, SQL, TSQL, Pig Latin, Hive

Web Technologies: JSP, JDBC, XML

Operating Systems: Windows 7, Windows 98/00/NT/XP, CentOS, Ubuntu

FrontEnd: HTML, CSS, JavaScript/JQuery

Development Tools: Microsoft SQL Studio, Toad, Eclipse, MySQL Workbench

Reporting Tool: SSRS

Office Tools: Microsoft Office suite( Excel, Word, PowerPoint, Visio)

Development Methodologies: Agile/Scrum, Waterfall

Other skills: Business Intelligence, GIT

PROFESSIONAL EXPERIENCE:

Confidential, New Jersey

Senior Hadoop Developer

Responsibilities:

  • Leading the development team, requirement gathering, developing, testing and tracking the deliverables and progress of the tasks and accomplishing them on time.
  • Responsible for delivery and review of all tasks delivery
  • Analyzing, writing Hadoop MapReduce jobs using Java API, Pig, and Hive.
  • Handling Code Management for various region architectures.
  • Preparing weekly status and monthly status
  • Involved in start to end process of Hadoop, Spark jobs that used different technologies such as Sqoop, PIG, Hive, MapReduce, Scala and Shell scripts(for scheduling of few jobs).
  • Installing cluster, commissioning & decommissioning of data node, name node high.
  • Configuring MySQL Database to store Hive metadata.
  • Solving performance issues in Hive and Pig scripts with the help of Joins, Group and aggregation tasks.
  • Writing MapReduce programs in Java on MRv2 / YARN environment
  • Troubleshooting performance problems and tuning Hadoop cluster.
  • Spark SQL and Spark Streaming for data streaming and analysis.
  • Importing and exporting data into HDFS and Hive using Sqoop.
  • Developing PIG scripts to transform the raw data into intelligent data as specified by business users.
  • Developing Pig Latin scripts to extract the data from the web server output files to load into HDFS
  • Exporting the analyzed data to the relational databases using Sqoop for visualization and to generate reports for the BI team.
  • Implementing test scripts to support Agile/Scrum development and continuous integration.
  • Responsible for cluster upkeep, including and expelling cluster hubs, cluster observing and investigating, oversee and survey information reinforcements, monitor, and audit Hadoop log documents.

Environment: Hadoop, Sqoop, Flume, Oozie, MapReduce (YARN), HDFS, Pig, Hive, HBase, Java, Oracle 10g, MySQL, Ubuntu, Linux scripting.

Confidential, New York, USA

Hadoop Developer

Responsibilities:

  • Created MapReduce projects to parse the crude information, populate organizing tables and store the refined information in apportioned tables.
  • Wrote MapReduce jobs using various Input and Output formats also used custom formats whenever necessary.
  • MapReduce code that will take input as log files and parse the logs and structure them in tabular format to facilitate effective querying on the log data.
  • Installed and configured Hadoop MapReduce, HDFS, Developed multiple MapReduce jobs in Java for data cleaning and preprocessing.
  • Designed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing, analyzing and training the classifier using MapReduce jobs, Pig jobs, and Hive jobs.
  • Developed a custom Filesystem plugin for Hadoop so it can access files on Data Platform. This plugin allows Hadoop MapReduce programs, Hbase, Pig and Hive to work unmodified and access files directly.
  • Used Sqoop to dump data from the relational database into HDFS for processing.
  • Created/modified UDF’s and for Hive and Pig script processing whenever necessary.
  • Taken a shot at Hive for further analysis and for generating transforming documents from various analytical arrangements to content records.
  • Included in information ingestion into HDFS utilizing Sqoop and Flume from the assortment of sources.
  • Overseen data from different sources utilizing Hadoop streaming to process terabytes data in XML.
  • Included in stacking data from UNIX File System framework to HDFS.
  • Executed Fair schedulers on the Job Tracker to share the assets of the Cluster for the MapReduce employments given by the clients.
  • Effectively used Sqoop to transfer data from databases to HDFS, Hive.
  • Involved in creating Hive tables, loading the data using it and in writing Hive queries to analyze the data.
  • Used Kafka for real-time data injection and processing.
  • Evaluated the use of Zookeeper in cluster coordination services.
  • Configuration and installation of Hive and also wrote Hive UDFs that helped spot market trends.
  • Gained business knowledge on suspect fraud identification, appeals process, etc.

Environment: Hadoop, HDFS, MapReduce (YARN), Pig, Hive, Sqoop, Oozie, Flume, Kafka, Zookeeper, Java, Linux Shell Scripting and Oracle.

Confidential, New Jersey

Big Data Developer

Responsibilities:

  • Responsible for designing, developing and implementing data flows jobs using Hive, Pig and MapReduce jobs using Java.
  • Participated in Hadoop training and development as a part of cross training program.
  • Understanding and translating complex functional and technical requirements into detailed design.
  • Implemented workflow jobs using Linux scripting to perform ETL task on Hadoop Platform.
  • Worked extensively in creating MapReduce jobs for search and analytics in the identification of various trends
  • Ingested data into Hadoop HDFS using various big data technologies like Pig and Hive.
  • Installed and configured Hadoop Map Reduce, HDFS, Developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
  • Experienced in writing Pig scripts and Pig UDFs to pre-process the data for analysis.
  • Experienced in managing and reviewing Hadoop log files.
  • Used Sqoop to dump data from a relational database into HDFS for processing.
  • Built front end using JSP, HTML, and JavaScript to create user-friendly and appealing interface.
  • Performed analysis of data sets and uncovered insights which helped in accomplishing development tasks.
  • Worked with the team on a day - to - day development tasks and did Code reviews etc.
  • Monitoring and maintenance of all software components in the project.

Environment: CDH4 with Hadoop, HDFS, Pig, Hive, Sqoop, JSPs, HTML, JavaScript, JQuery, CSS, Linux Shell Scripting.

Confidential, Connecticut

Java Developer

Responsibilities:

  • Involved in designing, coding, debugging, documenting and maintaining some applications.
  • Participated in training and development as a part of cross training program.
  • Included in the prerequisite examination and complete development of client side code.
  • Prepared use cases and designed class diagrams and object models.
  • Involved in the creation of SQL tables and indexes and also wrote queries to read/manipulate data.
  • Used JDBC to establish the connection between the database and the application.
  • Implemented view layer using JSPs and also made custom JSP tags.
  • Created the user interface using HTML, CSS, and JavaScript.
  • Created/modified shell scripts for scheduling and automating tasks.
  • Followed Sun coding and documentation standards.

Environment: Java, JSPs, HTML, CSS, JavaScript, SQL, JDBC and Oracle 9i/10g.

We'd love your feedback!