Senior Hadoop Developer / Administrator Resume Tampa, FL - Hire IT People

PROFESSIONAL SUMMARY:

7+ years of professional IT work experience in Analysis, Design, Development, Deployment and Maintenance of critical software and big data applications.
4 years of hands on experience across Hadoop and that includes extensive experience into BigData technologies.
Well versed in Hadoop administration activities such as installation, configuration, supporting and managing of Big Data and underlying infrastructure of Hadoop Cluster using Cloudera Manager.
In depth understanding/knowledge of Hadoop Architecture and various components such as HDFS, HIVE, HUE, Sentry, Kerberos, Impala, PIG, HBase, Flume, Yarn, Spark, Sqoop, Oozie, Zookeeper, NameNode, DataNode and MapReduce concepts .
Hands on experience in developing and deploying enterprise based applications using major Hadoop ecosystem components like MapReduce, YARN, Hive, Hbase, Flume, Sqoop, SparkStreaming, SparkSQL, Storm, Kafka, Oozie.
Experience with Amazon Web Services, AWS command line interface, and AWS data pipeline.
Hands on experience of multiple distributions like Cloudera and Hortonworks.
Experience in managing Hadoop clusters using Cloudera manager tool.
Capable of processing large sets of structured, semi - structured and unstructured data and supporting systems application architecture.
Hands on experience in application development using Java, Scala and Linux shell scripting.
Hands on experience in using MapReduce programming model for Batch processing of data stored in HDFS.
Extensive experience in developing PIG Latin Scripts and using Hive Query Language for data analytics.
Experience in optimizing Hive queries, optimized joins and using different data files with Custom SerDe's.
Experience in using different file formats like CSV, Sequence, AVRO, ORC, JSON and PARQUET files and different compression Techniques like Gzip,Bzip2 and Snappy.
Experience in big data ingestion tools like Sqoop , Flume and ApacheKafka .
Experience in using Flume and Kafka to load the log data from multiple sources into HDFS .
Expert in implementing advanced procedures like text analytics and processing using the in-memory computing capabilities like ApacheSpark written in Scala .
Hands on experience with NoSQL Databases like Hbase .
Experience in retrieving data from databases like MYSQL , Teradata , and Oracle into HDFS using Sqoop.
Hands on experience in setting up workflow using Apache Oozie workflow engine for managing and scheduling Hadoop jobs.
Worked on backend database programming using SQL , PL/SQL , Stored Procedures, Functions, Indexes, Joins, Views, Packages and Database Triggers.
Good understanding and experience with Software Development methodologies like Agile and Waterfall.

TECHNICAL SKILLS:

Big Data Ecosystem: HDFS, Map Reduce, YARN, Pig, Hive, Hbase, Flume, Sqoop, Oozie, Spark, Spark-Streaming, Spark-SQL, Ambari, Kafka

Hadoop Distributions: HortonworksDataplatform2.3.6,Cloudera5.0

Databases : MYSQL, PL/SQL, Oracle, DB2, HBase, Cassandra, And MongoDB.

Operating Systems : Windows, UNIX, RHEL 6.8

Languages: Java, SQL, Linux shell scripting, Python.

Web Technologies: HTML5, CSS3, JavaScript, JQuery, AJAX, Servlets, JSP, JSON, XMLXHTML, JSF, Angular JS

Tools: Adobe, Sql Developer, Flume, Sqoop and Storm

J2EE Technologies: JSP, Java Bean, Servlets, JPA1.0, EJB3.0, Jenkins, JDBC

PROFESSIONAL EXPERIENCE:

Confidential, Tampa, FL

Senior Hadoop Developer / Administrator

Responsibilities :

Responsible for building scalable distributed data solutions using Hadoop.
Developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
Load and transform large data sets of structured, semi structured and unstructured data using Hadoop / Big Data concepts.
Responsible for importing log files from various sources into HDFS using Flume.
Hands on writing Map Reduce code to make semi structured data as structured data and for inserting data into Hbase from HDFS .
Developed Junit tests for testing Map Reduce and also performed testing using small sample data.
S upported Hbase Architecture Design with the Hadoop Architect team to develop a Database Design in HDFS.
Created EC2 instances and implemented large multi node Hadoop clusters in AWS cloud from scratch.
Managed and reviewed Hadoop Log files as a part of administration for troubleshooting purposes.
Hands on experience in Python and Scala .
Achieved automated deployment for custom applications on Hadoop and AWS.
Experience in setup, configuration and management of security for Hadoop clusters using Kerberos and integration with LDAP / AD at an Enterprise level.
Transformed and aggregated data for analysis by implementing work flow management of Sqoop and Hive scripts.
Key role in migrating production and development Hortonworks Hadoop clusters to a new cloud based cluster solution
Responsible for HBase REST server administration, backup and recovery.
Load and transform large data sets of structured, semi structured and unstructured data using Hadoop/Big Data concepts.
Used Impala to read, write and query the Hadoop data in HDFS from HBase or Cassandra
Write scripts to automate application deployments and configurations monitoring YARN .
Involved in creating Hive tables and loading them with data and writing Hive queries.
Involved in importing data from Oracle tables to HDFS and Hbase tables using Sqoop .
Real time streaming the data using Spark with Kafka for faster processing.
Responsible for developing PL/SQL Functions, Procedures, Packages, Cursors and Triggers .
Involved in Unit testing of the Reports, Forms and PL/SQL Stored procedures.
Configured Spark Streaming to receive real time data from the Kafka and store the stream data to HDFS using Scala .
As a Hadoop admin, monitoring cluster health status on daily basis, tuning system performance related configuration parameters, backing up configuration xml files.
Used Pig as ETL tool to do transformations, event joins and some pre-aggregations before storing the data onto HDFS.
Involved in converting Hive / SQL queries into Spark transformations using Spark RDDs in Scala.
Implemented Kafka Custom encoders for custom input format to load data into Kafka Partitions.
Hands on experience in application development using Java, RDBMS, and Linux shell scripting.
Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS), Teradata and vice versa.

Environment : Spark, Spark-Streaming, Spark-SQL, Scala, Hadoop, MapReduce, Yarn, Hive, Pig, Sqoop, Storm, Kafka, HBase, Impala, kerboros, AWS, PL/SQL, Cassandra, Ambari, Scala, Python, Junit, J2EE, Web services, Hibernate, Struts, JSP, JDBC, XML, Weblogic Workshop, Jenkins.

Confidential - Milwaukee, Wisconsin

Hadoop Developer/ Administrator

Responsibilities:

Worked on analyzing Hadoop stack and different big data analytic tools including Hive, Hbase database and Sqoop.
Having Knowledge to implement Hortonworks distribution system ( HDP 2.1, HDP 2.2 and HDP 2.3 ).
Developed Map Reduce programs for some refined queries on big data.
Benchmarked and developed POC's using technologies spanning HBase, Cloudera Impala and Shark.
Developed a frame work to handle loading and transform large sets of unstructured data from UNIX system to HIVE tables.
Responsible for building scalable distributed data solutions using Hadoop .
Continuous monitoring and managing the Hadoop cluster through Ganglia and Nagios .
Done major and minor upgrades to the Hadoop cluster.
Worked with business team in creating Hive queries for ad hoc access.
Developed data pipline using Flume, Sqoop, Pig and Java MapReduce to ingest behavioural data into HDFS for analysis.
In depth understanding of Classic MapReduce and YARN architectures.
Implemented Hive Generic UDF's to implement business logic.
Used Hive to analyze the partitioned and bucketed data and compute various metrics for reporting.
Analyzed the data by performing Hive queries, SparkSQL and SparkStreaming .
Developed Spark code using Scala and Spark -SQL/Streaming for faster testing and processing of data.
Worked with cloud services like Amazon Web Services (AWS) and involved in ETL, Data Integration and Migration
Deployed, and administered 80 node Hadoop cluster and administered two smaller clusters.
Responsible for Cluster maintenance, Adding and removing cluster nodes, Cluster Monitoring and Troubleshooting, Manage and review data backups, Manage and review Hadoop log files, Deployed, and administered Hadoop clusters.
Compared Hadoop to commercial big-data appliances from Netezza , Xtreme Data, and LexisNexis . Published and presented results.
Involved in Developing and handling the PL/SQL Packages, Procedures and Database Triggers.
Created and scheduled autosys jobs for automation of PL/SQL wrapper scripts in UNIX.
Implemented Kerberos security for hadoop cluster services and for end-user authentication, and apache ranger for fine grained access control (user authorization) over hadoop and Kerberos security on Hortonworks platform.
Developed Spark Streaming script which consumes topics from distributed messaging source Kafka and periodically pushes batch of data to Spark for real time processing.
Extracted files from Hbase through Sqoop and placed in HDFS for further processing.
Involved in creating generic Sqoop import script for loading data into Hive tables from RDBMS.
Involved in continuous monitoring of operations using Storm.
Involved in design and developed Kafka and Storm based data with the infrastructure team.
Involved in managing and scheduling jobs on a Hadoop cluster using Oozie.
Developed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing with Hive.
Understanding of RDBMs and SQL programming skills, such as Oracle and MySql.
Deliver the codes with high quality, including proper design review, unit testing, integration testing.

Environment: Hortonworks Data platform 2.3.6 , Map Reduce, HDFS, Ambari, Hive, Sqoop, MapReduce, Yarn, Pig, Kafka, Storm, Impala, Oozie, SQL, PL/SQL, Flume, Spark, Hbase, Java, J2EE, Web services, Hibernate, Struts, JSP, JDBC, XML, Weblogic Workshop, Jenkins, AWS

Confidential - Chicago, IL

Java Developer/hadoop Deveper

Responsibilities :

Involved in Design, Development and Support phases of Software Development Life Cycle.
Developed the Controller, Service layer using Spring MVC & Spring JDBC.
Creating Custom directives and dependency injection.
Responsible for data extraction and data ingestion from different data sources into Hadoop Data Lake by creating ETL pipelines using Pig, and Hive
Involved in developing XML, HTML, and JavaScript for client side presentation and, data validation on the client side with in the forms.
Used various libraries of JavaScript like jQuery, jQuery UI, backbone.js and node.js.
Created Master Pages, CSS Styles Sheets and Integrated to Silver light and got approval from Business Stake holders.
Involved in transforming data from Mainframe tables to HDFS, and HBASE tables using Sqoop and Pentaho Kettle And also worked on Impala to analyze stored data
Have deep and thorough understanding of ETL tools and how they can be applied in a Big Data environment And supporting and managing Hadoop Clusters using Apache, Horton works, Cloudera and MapReduce
Moved Relational Database data using Sqoop into Hive Dynamic partition tables using staging tables.
Optimizing the Hive queries using Partitioning and Bucketing techniques, for controlling the data distribution.
Worked with Kafka for the proof of concept for carrying out log processing on a distributed system. Worked with NoSQL database Hbase to create tables and store data.
Worked on custom Pig Loaders and storage classes to work with variety of data formats such as JSON and XML file formats.
Involved in Cassandra Data Modelling and Analysis and CQL (Cassandra Query Language).
Worked on migrating MapReduce programs into Spark transformations using Scala.
Created HTML navigation menu that is role based menu items changes dynamically, derived from the database in the form of XML.
Used JQuery core library functions for the logical implementation part at client side for all the applications.
Designed new classes and functionalities using various JQUERY components for customer service.
Designed and developed User Interface Web Forms using Adobe Flash, CSS, and JavaScript.
Used AJAX for implementing dynamic Webpages where the content was fetched via API calls and updated the DOM.

Environment : Java 1.6, CSS3, JavaScript, JQuery, Ext JS, JSON, Hadoop, HDFS, HBase, MapReduce, Pig, Sqoop, Flume, Kafka, Oozie, Zookeeper, Oracle 11g, Agile, WordPress, Eclipse.

Confidential, Boston, Massachusetts

Java Developer

Responsibilities:

Involved in Requirements analysis, design, and development and testing.
Designed UI using HTML, DHTML, Java Script, CSS and JSP.
Used Java script for client side validations.
Developed SQL queries to store and retrieve data from database & used PLSQL.
Used Struts Validation frame work to do business validation from server side.
Used Eclipse IDE to develop the application.
Developing customized reports and Unit Testing using JUnit.
Involved in running PMD defects & Find bugs.
Dealt with business processes and Updated Project documents.
Following Coding guide lines & maintain quality of code.
Involved in fixing defects & Provided support and maintenance and customization
Developing customized reports and Unit Testing using JUnit.
Supported the application in IST, QA and Production environments.
Involved in building the code & deploying on the JBOSS application server.
Involved in validating the application for different browser compatibility & users load

Environment: Core Java, J2EE, JSP, JBoss, Servlets, Oracle, Eclipse, JMS, XML, HTML, HTML, Java Script, CSS, AJAX, Struts.

Confidential

Java Developer

Responsibilities:

Involved in projects utilizing Java, Java EE web applications to create fully-integrated client management systems.
Developed UI using HTML, JavaScript, JSP and developed business Logic and interfacing components using Business Objects, JDBC and XML.
Participated in user requirement sessions to analysis and gather Business requirements.
Development of user visible site using s , back end admin sites using Python and big data using core java.
Involved in development of the application using Spring Web MVC and other components of the
Elaborated Use Cases based on business requirements and was responsible for creation of class Diagrams, Sequence Diagrams.
Implemented Object-relation mapping in the persistence layer using Hibernate (ORM) framework.
Implemented REST Web Services with Jersey API to deal with customer requests
Experienced in developing Restful web services: consumed and also produced.
Used Hibernate for the Database connection and Hibernate Query Language (HQL) to add and retrieve the information from the Database.
Implemented Spring Jdbc for connecting oracle database.
Designed the application using MVC framework for easy maintainability
Provided bug fixing and testing for existing web applications.
Involved in full system life cycle and responsible for Developing, Testing, Implementing.
Written Hibernate components.
Developed client side validations with Java script.

Environment: Spring, JSP, Servlets, REST, Oracle, AJAX, Java Script, JQuery, Hibernate, Web Logic, HTML, XML, CVS, Eclipse, SOAP Web Services, XSLT, XSD, UNIX, shell scripting, MVS, ISPF.

We provide IT Staff Augmentation Services!

Senior Hadoop Developer / Administrator Resume

Tampa, FL

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship