Senior Hadoop Developer Resume
Confidential
SUMMARY:
- Over 7+ years of professional IT experience including 4+ plus years of experience in Big Data, Hadoop Development, Ecosystem Analytics, Development and Design of Javabased enterprise applications.
- Experience in different Hadoop distributions like Cloudera Distributions.
- Extensive expertise in both MapReduce MRv1 and MapReduce MRv2 (YARN).
- Extensive experience in Testing, Debugging and Deploying MapReduce Hadoop platforms.
- Extensive experience in working withtechnologies and tools such as like Hadoop 1, Hadoop 2, Map Reduce, HDFS, HBase, PIG, Hive, Sqoop, Flume, Oozie, Kafka, Zookeeper and HBase.
- Expert in writing MapReduce program, creating Pig and Hive UDFs using Java to analyze the data efficiently.
- Expertise in installing, designing, sizing, configuring, provisioning and upgrading Hadoop environments.
- Expertise in Commissioning and decommissioning the nodes in Hadoop Cluster.
- Experience in Tuning and troubleshooting performance issues in Hadoop cluster with a large size of data.
- Experience in importing, exporting and loading files from Oracle/SQL to Hive and HDFS from Oracle and SQL Server using SQOOP.
- Experienced in developing simple to complex Map/Reduce jobs using Hive and Pig to handle files in multiple formats (JSON, Text, XML, Sequence File, etc.).
- Worked with operating systems like Linux, CentOS and Windows with different technologies such as Big Data, Java, XML, HTML, SQL, PL/SQL, ShellScripting and Business Intelligence.
- Strong Knowledge of Flume and Spark, Kafka.
- Good knowledge in programming Scala and Spark and Spark components like Spark SQL, SQL Context, Hive Context.
- Worked on Performance Tuning of Hadoop jobs by applying techniques such as Map Side Joins, Partitioning, and Bucketing.
- Good working knowledge on Eclipse IDE for developing and debugging Java applications.
- Well experienced in using networking tools like Putty and WinSCP.
- Expertise in creating UI using HTML, CSS, and JavaScript.
- Good experience in using databases - SQL Server, Stored Procedures, Constraints, and Triggers.
- Experienced in creating and analyzing Software Requirement Specifications (SRS) and Functional Specification Document (FSD), Strong knowledge of Software Development Life Cycle (SDLC).
- Extensive experience in creating PL/SQL or SQL Stored Procedures, packages, functions, cursors.
- Experienced in preparing and executing Unit Test Plan and Unit Test Cases after software development. Defect Management and Defect Tracking to do Performance Tuning for delivering the utmost Quality product.
- Experience in Scrum, Agile, and Waterfall models.
- Excellent communication Skills, committed, result oriented, hard working with a quest to learn new technologies.
- Participated in Project Requirement and Planning meetings, flexible in working independently and as a team, Quickie in learning new technologies and tools.
- Highly motivated with excellent verbal and written communication skills, excellent presentation capabilities, and efficient requirement gathering ability and efficiently convey them to other members of the team.
TECHNICAL SKILLS:
Frameworks: HDFS, MapReduce, Pig, Hive, Sqoop, Oozie, Zookeeper, Flume and HBase, Spark, Scala, Kafka
Databases: Microsoft SQL Server, MySQL, Oracle
Languages: C, C#, Java, Scala, SQL, TSQL, Pig Latin, Hive
Web Technologies: JSP, JDBC, XML
Operating Systems: Windows 7, Windows 98/00/NT/XP, CentOS, Ubuntu
FrontEnd: HTML, CSS, JavaScript/JQuery
Development Tools: Microsoft SQL Studio, Toad, Eclipse, MySQL Workbench
Reporting Tool: SSRS
Office Tools: Microsoft Office suite( Excel, Word, PowerPoint, Visio)
Development Methodologies: Agile/Scrum, Waterfall
Other skills: Business Intelligence, GIT
PROFESSIONAL EXPERIENCE:
Confidential, New Jersey
Senior Hadoop Developer
Responsibilities:
- Leading the development team, requirement gathering, developing, testing and tracking the deliverables and progress of the tasks and accomplishing them on time.
- Responsible for delivery and review of all tasks delivery
- Analyzing, writing Hadoop MapReduce jobs using Java API, Pig, and Hive.
- Handling Code Management for various region architectures.
- Preparing weekly status and monthly status
- Involved in start to end process of Hadoop, Spark jobs that used different technologies such as Sqoop, PIG, Hive, MapReduce, Scala and Shell scripts(for scheduling of few jobs).
- Installing cluster, commissioning & decommissioning of data node, name node high.
- Configuring MySQL Database to store Hive metadata.
- Solving performance issues in Hive and Pig scripts with the help of Joins, Group and aggregation tasks.
- Writing MapReduce programs in Java on MRv2 / YARN environment
- Troubleshooting performance problems and tuning Hadoop cluster.
- Spark SQL and Spark Streaming for data streaming and analysis.
- Importing and exporting data into HDFS and Hive using Sqoop.
- Developing PIG scripts to transform the raw data into intelligent data as specified by business users.
- Developing Pig Latin scripts to extract the data from the web server output files to load into HDFS
- Exporting the analyzed data to the relational databases using Sqoop for visualization and to generate reports for the BI team.
- Implementing test scripts to support Agile/Scrum development and continuous integration.
- Responsible for cluster upkeep, including and expelling cluster hubs, cluster observing and investigating, oversee and survey information reinforcements, monitor, and audit Hadoop log documents.
Environment: Hadoop, Sqoop, Flume, Oozie, MapReduce (YARN), HDFS, Pig, Hive, HBase, Java, Oracle 10g, MySQL, Ubuntu, Linux scripting.
Confidential, New York, USA
Hadoop Developer
Responsibilities:
- Created MapReduce projects to parse the crude information, populate organizing tables and store the refined information in apportioned tables.
- Wrote MapReduce jobs using various Input and Output formats also used custom formats whenever necessary.
- MapReduce code that will take input as log files and parse the logs and structure them in tabular format to facilitate effective querying on the log data.
- Installed and configured Hadoop MapReduce, HDFS, Developed multiple MapReduce jobs in Java for data cleaning and preprocessing.
- Designed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing, analyzing and training the classifier using MapReduce jobs, Pig jobs, and Hive jobs.
- Developed a custom Filesystem plugin for Hadoop so it can access files on Data Platform. This plugin allows Hadoop MapReduce programs, Hbase, Pig and Hive to work unmodified and access files directly.
- Used Sqoop to dump data from the relational database into HDFS for processing.
- Created/modified UDF’s and for Hive and Pig script processing whenever necessary.
- Taken a shot at Hive for further analysis and for generating transforming documents from various analytical arrangements to content records.
- Included in information ingestion into HDFS utilizing Sqoop and Flume from the assortment of sources.
- Overseen data from different sources utilizing Hadoop streaming to process terabytes data in XML.
- Included in stacking data from UNIX File System framework to HDFS.
- Executed Fair schedulers on the Job Tracker to share the assets of the Cluster for the MapReduce employments given by the clients.
- Effectively used Sqoop to transfer data from databases to HDFS, Hive.
- Involved in creating Hive tables, loading the data using it and in writing Hive queries to analyze the data.
- Used Kafka for real-time data injection and processing.
- Evaluated the use of Zookeeper in cluster coordination services.
- Configuration and installation of Hive and also wrote Hive UDFs that helped spot market trends.
- Gained business knowledge on suspect fraud identification, appeals process, etc.
Environment: Hadoop, HDFS, MapReduce (YARN), Pig, Hive, Sqoop, Oozie, Flume, Kafka, Zookeeper, Java, Linux Shell Scripting and Oracle.
Confidential, New Jersey
Big Data Developer
Responsibilities:
- Responsible for designing, developing and implementing data flows jobs using Hive, Pig and MapReduce jobs using Java.
- Participated in Hadoop training and development as a part of cross training program.
- Understanding and translating complex functional and technical requirements into detailed design.
- Implemented workflow jobs using Linux scripting to perform ETL task on Hadoop Platform.
- Worked extensively in creating MapReduce jobs for search and analytics in the identification of various trends
- Ingested data into Hadoop HDFS using various big data technologies like Pig and Hive.
- Installed and configured Hadoop Map Reduce, HDFS, Developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
- Experienced in writing Pig scripts and Pig UDFs to pre-process the data for analysis.
- Experienced in managing and reviewing Hadoop log files.
- Used Sqoop to dump data from a relational database into HDFS for processing.
- Built front end using JSP, HTML, and JavaScript to create user-friendly and appealing interface.
- Performed analysis of data sets and uncovered insights which helped in accomplishing development tasks.
- Worked with the team on a day - to - day development tasks and did Code reviews etc.
- Monitoring and maintenance of all software components in the project.
Environment: CDH4 with Hadoop, HDFS, Pig, Hive, Sqoop, JSPs, HTML, JavaScript, JQuery, CSS, Linux Shell Scripting.
Confidential, Connecticut
Java Developer
Responsibilities:
- Involved in designing, coding, debugging, documenting and maintaining some applications.
- Participated in training and development as a part of cross training program.
- Included in the prerequisite examination and complete development of client side code.
- Prepared use cases and designed class diagrams and object models.
- Involved in the creation of SQL tables and indexes and also wrote queries to read/manipulate data.
- Used JDBC to establish the connection between the database and the application.
- Implemented view layer using JSPs and also made custom JSP tags.
- Created the user interface using HTML, CSS, and JavaScript.
- Created/modified shell scripts for scheduling and automating tasks.
- Followed Sun coding and documentation standards.
Environment: Java, JSPs, HTML, CSS, JavaScript, SQL, JDBC and Oracle 9i/10g.
