Senior Hadoop Developer / Administrator Resume
Tampa, FL
PROFESSIONAL SUMMARY:
- 7+ years of professional IT work experience in Analysis, Design, Development, Deployment and Maintenance of critical software and big data applications.
- 4 years of hands on experience across Hadoop and that includes extensive experience into BigData technologies.
- Well versed in Hadoop administration activities such as installation, configuration, supporting and managing of Big Data and underlying infrastructure of Hadoop Cluster using Cloudera Manager.
- In depth understanding/knowledge of Hadoop Architecture and various components such as HDFS, HIVE, HUE, Sentry, Kerberos, Impala, PIG, HBase, Flume, Yarn, Spark, Sqoop, Oozie, Zookeeper, NameNode, DataNode and MapReduce concepts .
- Hands on experience in developing and deploying enterprise based applications using major Hadoop ecosystem components like MapReduce, YARN, Hive, Hbase, Flume, Sqoop, SparkStreaming, SparkSQL, Storm, Kafka, Oozie.
- Experience with Amazon Web Services, AWS command line interface, and AWS data pipeline.
- Hands on experience of multiple distributions like Cloudera and Hortonworks.
- Experience in managing Hadoop clusters using Cloudera manager tool.
- Capable of processing large sets of structured, semi - structured and unstructured data and supporting systems application architecture.
- Hands on experience in application development using Java, Scala and Linux shell scripting.
- Hands on experience in using MapReduce programming model for Batch processing of data stored in HDFS.
- Extensive experience in developing PIG Latin Scripts and using Hive Query Language for data analytics.
- Experience in optimizing Hive queries, optimized joins and using different data files with Custom SerDe's.
- Experience in using different file formats like CSV, Sequence, AVRO, ORC, JSON and PARQUET files and different compression Techniques like Gzip,Bzip2 and Snappy.
- Experience in big data ingestion tools like Sqoop , Flume and ApacheKafka .
- Experience in using Flume and Kafka to load the log data from multiple sources into HDFS .
- Expert in implementing advanced procedures like text analytics and processing using the in-memory computing capabilities like ApacheSpark written in Scala .
- Hands on experience with NoSQL Databases like Hbase .
- Experience in retrieving data from databases like MYSQL , Teradata , and Oracle into HDFS using Sqoop.
- Hands on experience in setting up workflow using Apache Oozie workflow engine for managing and scheduling Hadoop jobs.
- Worked on backend database programming using SQL , PL/SQL , Stored Procedures, Functions, Indexes, Joins, Views, Packages and Database Triggers.
- Good understanding and experience with Software Development methodologies like Agile and Waterfall.
TECHNICAL SKILLS:
Big Data Ecosystem: HDFS, Map Reduce, YARN, Pig, Hive, Hbase, Flume, Sqoop, Oozie, Spark, Spark-Streaming, Spark-SQL, Ambari, Kafka
Hadoop Distributions: HortonworksDataplatform2.3.6,Cloudera5.0
Databases : MYSQL, PL/SQL, Oracle, DB2, HBase, Cassandra, And MongoDB.
Operating Systems : Windows, UNIX, RHEL 6.8
Languages: Java, SQL, Linux shell scripting, Python.
Web Technologies: HTML5, CSS3, JavaScript, JQuery, AJAX, Servlets, JSP, JSON, XMLXHTML, JSF, Angular JS
Tools: Adobe, Sql Developer, Flume, Sqoop and Storm
J2EE Technologies: JSP, Java Bean, Servlets, JPA1.0, EJB3.0, Jenkins, JDBC
PROFESSIONAL EXPERIENCE:
Confidential, Tampa, FL
Senior Hadoop Developer / Administrator
Responsibilities :
- Responsible for building scalable distributed data solutions using Hadoop.
- Developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
- Load and transform large data sets of structured, semi structured and unstructured data using Hadoop / Big Data concepts.
- Responsible for importing log files from various sources into HDFS using Flume.
- Hands on writing Map Reduce code to make semi structured data as structured data and for inserting data into Hbase from HDFS .
- Developed Junit tests for testing Map Reduce and also performed testing using small sample data.
- S upported Hbase Architecture Design with the Hadoop Architect team to develop a Database Design in HDFS.
- Created EC2 instances and implemented large multi node Hadoop clusters in AWS cloud from scratch.
- Managed and reviewed Hadoop Log files as a part of administration for troubleshooting purposes.
- Hands on experience in Python and Scala .
- Achieved automated deployment for custom applications on Hadoop and AWS.
- Experience in setup, configuration and management of security for Hadoop clusters using Kerberos and integration with LDAP / AD at an Enterprise level.
- Transformed and aggregated data for analysis by implementing work flow management of Sqoop and Hive scripts.
- Key role in migrating production and development Hortonworks Hadoop clusters to a new cloud based cluster solution
- Responsible for HBase REST server administration, backup and recovery.
- Load and transform large data sets of structured, semi structured and unstructured data using Hadoop/Big Data concepts.
- Used Impala to read, write and query the Hadoop data in HDFS from HBase or Cassandra
- Write scripts to automate application deployments and configurations monitoring YARN .
- Involved in creating Hive tables and loading them with data and writing Hive queries.
- Involved in importing data from Oracle tables to HDFS and Hbase tables using Sqoop .
- Real time streaming the data using Spark with Kafka for faster processing.
- Responsible for developing PL/SQL Functions, Procedures, Packages, Cursors and Triggers .
- Involved in Unit testing of the Reports, Forms and PL/SQL Stored procedures.
- Configured Spark Streaming to receive real time data from the Kafka and store the stream data to HDFS using Scala .
- As a Hadoop admin, monitoring cluster health status on daily basis, tuning system performance related configuration parameters, backing up configuration xml files.
- Used Pig as ETL tool to do transformations, event joins and some pre-aggregations before storing the data onto HDFS.
- Involved in converting Hive / SQL queries into Spark transformations using Spark RDDs in Scala.
- Implemented Kafka Custom encoders for custom input format to load data into Kafka Partitions.
- Hands on experience in application development using Java, RDBMS, and Linux shell scripting.
- Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS), Teradata and vice versa.
Environment : Spark, Spark-Streaming, Spark-SQL, Scala, Hadoop, MapReduce, Yarn, Hive, Pig, Sqoop, Storm, Kafka, HBase, Impala, kerboros, AWS, PL/SQL, Cassandra, Ambari, Scala, Python, Junit, J2EE, Web services, Hibernate, Struts, JSP, JDBC, XML, Weblogic Workshop, Jenkins.
Confidential - Milwaukee, Wisconsin
Hadoop Developer/ Administrator
Responsibilities:
- Worked on analyzing Hadoop stack and different big data analytic tools including Hive, Hbase database and Sqoop.
- Having Knowledge to implement Hortonworks distribution system ( HDP 2.1, HDP 2.2 and HDP 2.3 ).
- Developed Map Reduce programs for some refined queries on big data.
- Benchmarked and developed POC's using technologies spanning HBase, Cloudera Impala and Shark.
- Developed a frame work to handle loading and transform large sets of unstructured data from UNIX system to HIVE tables.
- Responsible for building scalable distributed data solutions using Hadoop .
- Continuous monitoring and managing the Hadoop cluster through Ganglia and Nagios .
- Done major and minor upgrades to the Hadoop cluster.
- Worked with business team in creating Hive queries for ad hoc access.
- Developed data pipline using Flume, Sqoop, Pig and Java MapReduce to ingest behavioural data into HDFS for analysis.
- In depth understanding of Classic MapReduce and YARN architectures.
- Implemented Hive Generic UDF's to implement business logic.
- Used Hive to analyze the partitioned and bucketed data and compute various metrics for reporting.
- Analyzed the data by performing Hive queries, SparkSQL and SparkStreaming .
- Developed Spark code using Scala and Spark -SQL/Streaming for faster testing and processing of data.
- Worked with cloud services like Amazon Web Services (AWS) and involved in ETL, Data Integration and Migration
- Deployed, and administered 80 node Hadoop cluster and administered two smaller clusters.
- Responsible for Cluster maintenance, Adding and removing cluster nodes, Cluster Monitoring and Troubleshooting, Manage and review data backups, Manage and review Hadoop log files, Deployed, and administered Hadoop clusters.
- Compared Hadoop to commercial big-data appliances from Netezza , Xtreme Data, and LexisNexis . Published and presented results.
- Involved in Developing and handling the PL/SQL Packages, Procedures and Database Triggers.
- Created and scheduled autosys jobs for automation of PL/SQL wrapper scripts in UNIX.
- Implemented Kerberos security for hadoop cluster services and for end-user authentication, and apache ranger for fine grained access control (user authorization) over hadoop and Kerberos security on Hortonworks platform.
- Developed Spark Streaming script which consumes topics from distributed messaging source Kafka and periodically pushes batch of data to Spark for real time processing.
- Extracted files from Hbase through Sqoop and placed in HDFS for further processing.
- Involved in creating generic Sqoop import script for loading data into Hive tables from RDBMS.
- Involved in continuous monitoring of operations using Storm.
- Involved in design and developed Kafka and Storm based data with the infrastructure team.
- Involved in managing and scheduling jobs on a Hadoop cluster using Oozie.
- Developed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing with Hive.
- Understanding of RDBMs and SQL programming skills, such as Oracle and MySql.
- Deliver the codes with high quality, including proper design review, unit testing, integration testing.
Environment: Hortonworks Data platform 2.3.6 , Map Reduce, HDFS, Ambari, Hive, Sqoop, MapReduce, Yarn, Pig, Kafka, Storm, Impala, Oozie, SQL, PL/SQL, Flume, Spark, Hbase, Java, J2EE, Web services, Hibernate, Struts, JSP, JDBC, XML, Weblogic Workshop, Jenkins, AWS
Confidential - Chicago, IL
Java Developer/hadoop Deveper
Responsibilities :
- Involved in Design, Development and Support phases of Software Development Life Cycle.
- Developed the Controller, Service layer using Spring MVC & Spring JDBC.
- Creating Custom directives and dependency injection.
- Responsible for data extraction and data ingestion from different data sources into Hadoop Data Lake by creating ETL pipelines using Pig, and Hive
- Involved in developing XML, HTML, and JavaScript for client side presentation and, data validation on the client side with in the forms.
- Used various libraries of JavaScript like jQuery, jQuery UI, backbone.js and node.js.
- Created Master Pages, CSS Styles Sheets and Integrated to Silver light and got approval from Business Stake holders.
- Involved in transforming data from Mainframe tables to HDFS, and HBASE tables using Sqoop and Pentaho Kettle And also worked on Impala to analyze stored data
- Have deep and thorough understanding of ETL tools and how they can be applied in a Big Data environment And supporting and managing Hadoop Clusters using Apache, Horton works, Cloudera and MapReduce
- Moved Relational Database data using Sqoop into Hive Dynamic partition tables using staging tables.
- Optimizing the Hive queries using Partitioning and Bucketing techniques, for controlling the data distribution.
- Worked with Kafka for the proof of concept for carrying out log processing on a distributed system. Worked with NoSQL database Hbase to create tables and store data.
- Worked on custom Pig Loaders and storage classes to work with variety of data formats such as JSON and XML file formats.
- Involved in Cassandra Data Modelling and Analysis and CQL (Cassandra Query Language).
- Worked on migrating MapReduce programs into Spark transformations using Scala.
- Created HTML navigation menu that is role based menu items changes dynamically, derived from the database in the form of XML.
- Used JQuery core library functions for the logical implementation part at client side for all the applications.
- Designed new classes and functionalities using various JQUERY components for customer service.
- Designed and developed User Interface Web Forms using Adobe Flash, CSS, and JavaScript.
- Used AJAX for implementing dynamic Webpages where the content was fetched via API calls and updated the DOM.
Environment : Java 1.6, CSS3, JavaScript, JQuery, Ext JS, JSON, Hadoop, HDFS, HBase, MapReduce, Pig, Sqoop, Flume, Kafka, Oozie, Zookeeper, Oracle 11g, Agile, WordPress, Eclipse.
Confidential, Boston, Massachusetts
Java Developer
Responsibilities:
- Involved in Requirements analysis, design, and development and testing.
- Designed UI using HTML, DHTML, Java Script, CSS and JSP.
- Used Java script for client side validations.
- Developed SQL queries to store and retrieve data from database & used PLSQL.
- Used Struts Validation frame work to do business validation from server side.
- Used Eclipse IDE to develop the application.
- Developing customized reports and Unit Testing using JUnit.
- Involved in running PMD defects & Find bugs.
- Dealt with business processes and Updated Project documents.
- Following Coding guide lines & maintain quality of code.
- Involved in fixing defects & Provided support and maintenance and customization
- Developing customized reports and Unit Testing using JUnit.
- Supported the application in IST, QA and Production environments.
- Involved in building the code & deploying on the JBOSS application server.
- Involved in validating the application for different browser compatibility & users load
Environment: Core Java, J2EE, JSP, JBoss, Servlets, Oracle, Eclipse, JMS, XML, HTML, HTML, Java Script, CSS, AJAX, Struts.
Confidential
Java Developer
Responsibilities:
- Involved in projects utilizing Java, Java EE web applications to create fully-integrated client management systems.
- Developed UI using HTML, JavaScript, JSP and developed business Logic and interfacing components using Business Objects, JDBC and XML.
- Participated in user requirement sessions to analysis and gather Business requirements.
- Development of user visible site using s , back end admin sites using Python and big data using core java.
- Involved in development of the application using Spring Web MVC and other components of the
- Elaborated Use Cases based on business requirements and was responsible for creation of class Diagrams, Sequence Diagrams.
- Implemented Object-relation mapping in the persistence layer using Hibernate (ORM) framework.
- Implemented REST Web Services with Jersey API to deal with customer requests
- Experienced in developing Restful web services: consumed and also produced.
- Used Hibernate for the Database connection and Hibernate Query Language (HQL) to add and retrieve the information from the Database.
- Implemented Spring Jdbc for connecting oracle database.
- Designed the application using MVC framework for easy maintainability
- Provided bug fixing and testing for existing web applications.
- Involved in full system life cycle and responsible for Developing, Testing, Implementing.
- Written Hibernate components.
- Developed client side validations with Java script.
Environment: Spring, JSP, Servlets, REST, Oracle, AJAX, Java Script, JQuery, Hibernate, Web Logic, HTML, XML, CVS, Eclipse, SOAP Web Services, XSLT, XSD, UNIX, shell scripting, MVS, ISPF.
