We provide IT Staff Augmentation Services!

Senior Hadoop Developer / Administrator Resume

5.00/5 (Submit Your Rating)

Tampa, FL

PROFESSIONAL SUMMARY:

  • 7+ years of professional IT work experience in Analysis, Design, Development, Deployment and Maintenance of critical software and big data applications.
  • 4 years of hands on experience across Hadoop and that includes extensive experience into BigData technologies.
  • Well versed in Hadoop administration activities such as installation, configuration, supporting and managing of Big Data and underlying infrastructure of Hadoop Cluster using Cloudera Manager.
  • In depth understanding/knowledge of Hadoop Architecture and various components such as HDFS, HIVE, HUE, Sentry, Kerberos, Impala, PIG, HBase, Flume, Yarn, Spark, Sqoop, Oozie, Zookeeper, NameNode, DataNode and MapReduce concepts .
  • Hands on experience in developing and deploying enterprise based applications using major Hadoop ecosystem components like MapReduce, YARN, Hive, Hbase, Flume, Sqoop, SparkStreaming, SparkSQL, Storm, Kafka, Oozie.
  • Experience with Amazon Web Services, AWS command line interface, and AWS data pipeline.
  • Hands on experience of multiple distributions like Cloudera and Hortonworks.
  • Experience in managing Hadoop clusters using Cloudera manager tool.
  • Capable of processing large sets of structured, semi - structured and unstructured data and supporting systems application architecture.
  • Hands on experience in application development using Java, Scala and Linux shell scripting.
  • Hands on experience in using MapReduce programming model for Batch processing of data stored in HDFS.
  • Extensive experience in developing PIG Latin Scripts and using Hive Query Language for data analytics.
  • Experience in optimizing Hive queries, optimized joins and using different data files with Custom SerDe's.
  • Experience in using different file formats like CSV, Sequence, AVRO, ORC, JSON and PARQUET files and different compression Techniques like Gzip,Bzip2 and Snappy.
  • Experience in big data ingestion tools like Sqoop , Flume and ApacheKafka .
  • Experience in using Flume and Kafka to load the log data from multiple sources into HDFS .
  • Expert in implementing advanced procedures like text analytics and processing using the in-memory computing capabilities like ApacheSpark written in Scala .
  • Hands on experience with NoSQL Databases like Hbase .
  • Experience in retrieving data from databases like MYSQL , Teradata , and Oracle into HDFS using Sqoop.
  • Hands on experience in setting up workflow using Apache Oozie workflow engine for managing and scheduling Hadoop jobs.
  • Worked on backend database programming using SQL , PL/SQL , Stored Procedures, Functions, Indexes, Joins, Views, Packages and Database Triggers.
  • Good understanding and experience with Software Development methodologies like Agile and Waterfall.

TECHNICAL SKILLS:

Big Data Ecosystem: HDFS, Map Reduce, YARN, Pig, Hive, Hbase, Flume, Sqoop, Oozie, Spark, Spark-Streaming, Spark-SQL, Ambari, Kafka

Hadoop Distributions: HortonworksDataplatform2.3.6,Cloudera5.0

Databases : MYSQL, PL/SQL, Oracle, DB2, HBase, Cassandra, And MongoDB.

Operating Systems : Windows, UNIX, RHEL 6.8

Languages: Java, SQL, Linux shell scripting, Python.

Web Technologies: HTML5, CSS3, JavaScript, JQuery, AJAX, Servlets, JSP, JSON, XMLXHTML, JSF, Angular JS

Tools: Adobe, Sql Developer, Flume, Sqoop and Storm

J2EE Technologies: JSP, Java Bean, Servlets, JPA1.0, EJB3.0, Jenkins, JDBC

PROFESSIONAL EXPERIENCE:

Confidential, Tampa, FL

Senior Hadoop Developer / Administrator

Responsibilities :

  • Responsible for building scalable distributed data solutions using Hadoop.
  • Developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
  • Load and transform large data sets of structured, semi structured and unstructured data using Hadoop / Big Data concepts.
  • Responsible for importing log files from various sources into HDFS using Flume.
  • Hands on writing Map Reduce code to make semi structured data as structured data and for inserting data into Hbase from HDFS .
  • Developed Junit tests for testing Map Reduce and also performed testing using small sample data.
  • S upported Hbase Architecture Design with the Hadoop Architect team to develop a Database Design in HDFS.
  • Created EC2 instances and implemented large multi node Hadoop clusters in AWS cloud from scratch.
  • Managed and reviewed Hadoop Log files as a part of administration for troubleshooting purposes.
  • Hands on experience in Python and Scala .
  • Achieved automated deployment for custom applications on Hadoop and AWS.
  • Experience in setup, configuration and management of security for Hadoop clusters using Kerberos and integration with LDAP / AD at an Enterprise level.
  • Transformed and aggregated data for analysis by implementing work flow management of Sqoop and Hive scripts.
  • Key role in migrating production and development Hortonworks Hadoop clusters to a new cloud based cluster solution
  • Responsible for HBase REST server administration, backup and recovery.
  • Load and transform large data sets of structured, semi structured and unstructured data using Hadoop/Big Data concepts.
  • Used Impala to read, write and query the Hadoop data in HDFS from HBase or Cassandra
  • Write scripts to automate application deployments and configurations monitoring YARN .
  • Involved in creating Hive tables and loading them with data and writing Hive queries.
  • Involved in importing data from Oracle tables to HDFS and Hbase tables using Sqoop .
  • Real time streaming the data using Spark with Kafka for faster processing.
  • Responsible for developing PL/SQL Functions, Procedures, Packages, Cursors and Triggers .
  • Involved in Unit testing of the Reports, Forms and PL/SQL Stored procedures.
  • Configured Spark Streaming to receive real time data from the Kafka and store the stream data to HDFS using Scala .
  • As a Hadoop admin, monitoring cluster health status on daily basis, tuning system performance related configuration parameters, backing up configuration xml files.
  • Used Pig as ETL tool to do transformations, event joins and some pre-aggregations before storing the data onto HDFS.
  • Involved in converting Hive / SQL queries into Spark transformations using Spark RDDs in Scala.
  • Implemented Kafka Custom encoders for custom input format to load data into Kafka Partitions.
  • Hands on experience in application development using Java, RDBMS, and Linux shell scripting.
  • Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS), Teradata and vice versa.

Environment : Spark, Spark-Streaming, Spark-SQL, Scala, Hadoop, MapReduce, Yarn, Hive, Pig, Sqoop, Storm, Kafka, HBase, Impala, kerboros, AWS, PL/SQL, Cassandra, Ambari, Scala, Python, Junit, J2EE, Web services, Hibernate, Struts, JSP, JDBC, XML, Weblogic Workshop, Jenkins.

Confidential - Milwaukee, Wisconsin

Hadoop Developer/ Administrator

Responsibilities:

  • Worked on analyzing Hadoop stack and different big data analytic tools including Hive, Hbase database and Sqoop.
  • Having Knowledge to implement Hortonworks distribution system ( HDP 2.1, HDP 2.2 and HDP 2.3 ).
  • Developed Map Reduce programs for some refined queries on big data.
  • Benchmarked and developed POC's using technologies spanning HBase, Cloudera Impala and Shark.
  • Developed a frame work to handle loading and transform large sets of unstructured data from UNIX system to HIVE tables.
  • Responsible for building scalable distributed data solutions using Hadoop .
  • Continuous monitoring and managing the Hadoop cluster through Ganglia and Nagios .
  • Done major and minor upgrades to the Hadoop cluster.
  • Worked with business team in creating Hive queries for ad hoc access.
  • Developed data pipline using Flume, Sqoop, Pig and Java MapReduce to ingest behavioural data into HDFS for analysis.
  • In depth understanding of Classic MapReduce and YARN architectures.
  • Implemented Hive Generic UDF's to implement business logic.
  • Used Hive to analyze the partitioned and bucketed data and compute various metrics for reporting.
  • Analyzed the data by performing Hive queries, SparkSQL and SparkStreaming .
  • Developed Spark code using Scala and Spark -SQL/Streaming for faster testing and processing of data.
  • Worked with cloud services like Amazon Web Services (AWS) and involved in ETL, Data Integration and Migration
  • Deployed, and administered 80 node Hadoop cluster and administered two smaller clusters.
  • Responsible for Cluster maintenance, Adding and removing cluster nodes, Cluster Monitoring and Troubleshooting, Manage and review data backups, Manage and review Hadoop log files, Deployed, and administered Hadoop clusters.
  • Compared Hadoop to commercial big-data appliances from Netezza , Xtreme Data, and LexisNexis . Published and presented results.
  • Involved in Developing and handling the PL/SQL Packages, Procedures and Database Triggers.
  • Created and scheduled autosys jobs for automation of PL/SQL wrapper scripts in UNIX.
  • Implemented Kerberos security for hadoop cluster services and for end-user authentication, and apache ranger for fine grained access control (user authorization) over hadoop and Kerberos security on Hortonworks platform.
  • Developed Spark Streaming script which consumes topics from distributed messaging source Kafka and periodically pushes batch of data to Spark for real time processing.
  • Extracted files from Hbase through Sqoop and placed in HDFS for further processing.
  • Involved in creating generic Sqoop import script for loading data into Hive tables from RDBMS.
  • Involved in continuous monitoring of operations using Storm.
  • Involved in design and developed Kafka and Storm based data with the infrastructure team.
  • Involved in managing and scheduling jobs on a Hadoop cluster using Oozie.
  • Developed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing with Hive.
  • Understanding of RDBMs and SQL programming skills, such as Oracle and MySql.
  • Deliver the codes with high quality, including proper design review, unit testing, integration testing.

Environment: Hortonworks Data platform 2.3.6 , Map Reduce, HDFS, Ambari, Hive, Sqoop, MapReduce, Yarn, Pig, Kafka, Storm, Impala, Oozie, SQL, PL/SQL, Flume, Spark, Hbase, Java, J2EE, Web services, Hibernate, Struts, JSP, JDBC, XML, Weblogic Workshop, Jenkins, AWS

Confidential - Chicago, IL

Java Developer/hadoop Deveper

Responsibilities :

  • Involved in Design, Development and Support phases of Software Development Life Cycle.
  • Developed the Controller, Service layer using Spring MVC & Spring JDBC.
  • Creating Custom directives and dependency injection.
  • Responsible for data extraction and data ingestion from different data sources into Hadoop Data Lake by creating ETL pipelines using Pig, and Hive
  • Involved in developing XML, HTML, and JavaScript for client side presentation and, data validation on the client side with in the forms.
  • Used various libraries of JavaScript like jQuery, jQuery UI, backbone.js and node.js.
  • Created Master Pages, CSS Styles Sheets and Integrated to Silver light and got approval from Business Stake holders.
  • Involved in transforming data from Mainframe tables to HDFS, and HBASE tables using Sqoop and Pentaho Kettle And also worked on Impala to analyze stored data
  • Have deep and thorough understanding of ETL tools and how they can be applied in a Big Data environment And supporting and managing Hadoop Clusters using Apache, Horton works, Cloudera and MapReduce
  • Moved Relational Database data using Sqoop into Hive Dynamic partition tables using staging tables.
  • Optimizing the Hive queries using Partitioning and Bucketing techniques, for controlling the data distribution.
  • Worked with Kafka for the proof of concept for carrying out log processing on a distributed system. Worked with NoSQL database Hbase to create tables and store data.
  • Worked on custom Pig Loaders and storage classes to work with variety of data formats such as JSON and XML file formats.
  • Involved in Cassandra Data Modelling and Analysis and CQL (Cassandra Query Language).
  • Worked on migrating MapReduce programs into Spark transformations using Scala.
  • Created HTML navigation menu that is role based menu items changes dynamically, derived from the database in the form of XML.
  • Used JQuery core library functions for the logical implementation part at client side for all the applications.
  • Designed new classes and functionalities using various JQUERY components for customer service.
  • Designed and developed User Interface Web Forms using Adobe Flash, CSS, and JavaScript.
  • Used AJAX for implementing dynamic Webpages where the content was fetched via API calls and updated the DOM.

Environment : Java 1.6, CSS3, JavaScript, JQuery, Ext JS, JSON, Hadoop, HDFS, HBase, MapReduce, Pig, Sqoop, Flume, Kafka, Oozie, Zookeeper, Oracle 11g, Agile, WordPress, Eclipse.

Confidential, Boston, Massachusetts

Java Developer

Responsibilities:

  • Involved in Requirements analysis, design, and development and testing.
  • Designed UI using HTML, DHTML, Java Script, CSS and JSP.
  • Used Java script for client side validations.
  • Developed SQL queries to store and retrieve data from database & used PLSQL.
  • Used Struts Validation frame work to do business validation from server side.
  • Used Eclipse IDE to develop the application.
  • Developing customized reports and Unit Testing using JUnit.
  • Involved in running PMD defects & Find bugs.
  • Dealt with business processes and Updated Project documents.
  • Following Coding guide lines & maintain quality of code.
  • Involved in fixing defects & Provided support and maintenance and customization
  • Developing customized reports and Unit Testing using JUnit.
  • Supported the application in IST, QA and Production environments.
  • Involved in building the code & deploying on the JBOSS application server.
  • Involved in validating the application for different browser compatibility & users load

Environment: Core Java, J2EE, JSP, JBoss, Servlets, Oracle, Eclipse, JMS, XML, HTML, HTML, Java Script, CSS, AJAX, Struts.

Confidential

Java Developer

Responsibilities:

  • Involved in projects utilizing Java, Java EE web applications to create fully-integrated client management systems.
  • Developed UI using HTML, JavaScript, JSP and developed business Logic and interfacing components using Business Objects, JDBC and XML.
  • Participated in user requirement sessions to analysis and gather Business requirements.
  • Development of user visible site using s , back end admin sites using Python and big data using core java.
  • Involved in development of the application using Spring Web MVC and other components of the
  • Elaborated Use Cases based on business requirements and was responsible for creation of class Diagrams, Sequence Diagrams.
  • Implemented Object-relation mapping in the persistence layer using Hibernate (ORM) framework.
  • Implemented REST Web Services with Jersey API to deal with customer requests
  • Experienced in developing Restful web services: consumed and also produced.
  • Used Hibernate for the Database connection and Hibernate Query Language (HQL) to add and retrieve the information from the Database.
  • Implemented Spring Jdbc for connecting oracle database.
  • Designed the application using MVC framework for easy maintainability
  • Provided bug fixing and testing for existing web applications.
  • Involved in full system life cycle and responsible for Developing, Testing, Implementing.
  • Written Hibernate components.
  • Developed client side validations with Java script.

Environment: Spring, JSP, Servlets, REST, Oracle, AJAX, Java Script, JQuery, Hibernate, Web Logic, HTML, XML, CVS, Eclipse, SOAP Web Services, XSLT, XSD, UNIX, shell scripting, MVS, ISPF.

We'd love your feedback!