Sr. Hadoop Consultant Resume Sunnyvale, CA - Hire IT People

PROFESSIONAL SUMMARY:

8 Years of Professional IT experience in Big Data, Hadoop, Java /J2EE and Cloud technologies in Financial, Retail and HealthCare domains.
Over 4 years of experience in Big Data platform as both Developer and Administrator.
Experience in building high performance and scalable solutions using various Hadoop ecosystem tools like Pig, Hive, Sqoop, Spark, Solr and Kafka.
Responsible for designing and building a DataLake using Hadoop and its ecosystem components.
Handled Data Movement, data transformation, Analysis and visualization across the lake by integrating it with various tools.
Defined extract - translate-load (ETL) and extract-load-translate (ELT) processes for the Data Lake.
Extensively worked on Spark and its components like Sparksql, SparkR and Spark streaming.
Defined real time data streaming solutions across the cluster using Spark Streaming, Apache Storm, Kafka, Nifi and Flume.
Good Expertise in Planning, Installing and Configuring Hadoop Cluster based on the business needs.
Installed and configured multiple Hadoop clusters of different sizes and with ecosystem components like Pig, Hive, Sqoop, Flume, HBase, Oozie and Zookeeper.
Worked on all major distributions of Hadoop Cloudera (CDH4, CDH5), Hortonworks (HDP 2.2, 2.4) and Pivotal.
Experience in implementing Failover mechanisms for Namenode, Resource Manager and Hive.
Configured AWSEC2 instances, S3Buckets, Cloud services and architected the flow of data to and from AWS.
Transformed and aggregated data for analysis by implementing work flow management of Sqoop, Hive and Pig scripts.
Experience working on different file formats like Avro, Parquet, ORC, Sequence and Compression techniques like Gzip, Lzo, snappy in Hadoop.
Experience writing Oozie workflows and Job Controllers for job automation.
Integrated Oozie with Hue and scheduled workflows for multiple Hive, Pig and Spark Jobs.
In-Depth knowledge of Scala and Experience building Spark applications using Scala.
Good experience working on Tableau and Spotfire and enabled the Jdbc/Odbc data connectivity from those to Hive tables.
Adequate knowledge of Scrum, Agile and Waterfall methodologies.
Experience in developing Applications using Java, J2EE, JSP, MVC, Servlets, Struts, Hibernate, JDBC, JSF, EJB, XML, AJAX and web based development tools.
Expertise in web Technologies like HTML, CSS, PHP, XML.
Worked on various Tools and IDEs like Eclipse, IBM Rational, Apache Ant-Build Tool, MS-Office, PLSQL Developer, SQL*Plus.
Highly motivated with the ability to work independently or as an integral part of a team and Committed to highest levels of profession.

TECHNICAL SKILLS:

Big Data / Hadoop: HDFS, MapReduce, HBase, Kafka, PIG, HIVE, Sqoop, Impala and Flume

Real time/Stream Processing: Apache Storm, Apache Spark

Cloud Technologies: Amazon web services, EC2, S3, EMR, Redshift

Operating Systems: Windows, Unix and Linux

Programming Language: C, Java, J2EE, SQL

Data Base: Oracle 9i/10g, SQL Server, MS Access

Web Technologies: HTML, XML, JavaScript

IDE Development Tools: Eclipse, NetBeans

Methodologies: Agile, Scrum and Waterfall

PROFESSIONAL EXPERIENCE:

Confidential, Sunnyvale, CA

Sr. Hadoop Consultant

Responsibilities:

Installing, configuring and testing Hadoop ecosystem components like MapReduce, HDFS, Pig, Hive, Sqoop, Flume, Oozie, Hue and HBase.
Imported data from various sources into HDFS and Hive using Sqoop.
Involved in writing custom MapReduce, Pig and Hive programs.
Experience in writing customized UDF's in java to extend Hive and Pig Latin functionality.
Created Partitions and Buckets in Hive for both Managed and External tables for optimizing performance.
Worked on several PoC's involving No SQL Databases like HBase, MongoDB and Cassandra.
Configured Tez as execution engine for Hive queries to improve the performance.
Developed a data pipeline using Kafka and Storm to store data into HDFS and performed the real time analytics on the incoming data.
Hands on experience in Spark and Spark Streaming creating RDD's, Applying operations -Transformation and Actions on it.
Configured Spark streaming to receive real time data from the Kafka and store the stream data to HDFS using Scala.
In-depth knowledge of Scala and experienced in building the Spark applications using Scala.
Configured Flume to stream data into HDFS and Hive using HDFS Sinks and Hive sinks.
Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
Involved in scheduling Oozie workflow engine to run multiple Hive, Pig and Spark jobs.
Worked with application teams to install operating system, Hadoop updates, patches, version upgrades as required.
Experience in Commissioning, Decommissioning, Balancing, and Managing Nodes and tuning server for optimal performance of the cluster.

Environment: Hadoop, HDFS, Pig, Hive, MapReduce, Sqoop, LINUX, and Big Data

Confidential, Atlanta, GA

Sr. Hadoop Developer

Responsibilities:

Installed, Configured and Maintained Apache Hadoop clusters for application development and Hadoop tools like Hive, Pig, Hbase and HDFS.
Designing and implementing semi-structured data analytics platform leveraging Hadoop.
Worked on performance analysis and improvements for Hive and Pig scripts at MapReduce job tuning level.
Used Sqoop to load data from RDBMS into HDFS.
Worked on implementing several POCs to validate and fit the several Hadoop eco system tools on CDH and Hortonworks distributions
Involved in Hadoop cluster task like Adding and Removing Nodes without any effect to running jobs and data.
Designed and Implemented Error-Free Data Warehouse-ETL and Hadoop Integration.
Proficient in data modelling with Hive partitioning, bucketing, and other optimization techniques in Hive
Developed Pig Latin scripts to extract the data from the web server output files to load into HDFS.
Developed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing with Pig.
Set up standards and processes for Hadoop based application design and implementation.
Wrote Shell scripts for several day-to-day processes and worked on its automation.
Collected the logs data from web servers and integrated in to HDFS using Flume.
Implemented Fair Schedulers on the Job tracker to share the resources of the Cluster for the Map Reduce jobs given by the users.
Worked on establishing connectivity between Tableau and Spotfire.

Environment: Hadoop, HDFS, Map Reduce, Mongo DB, Java, VMware, HIVE, Eclipse, PIG, Hive, HBase, Sqoop, Flume, Linux, UNIX.

Confidential, King of Prussia, PA

Hadoop Developer

Responsibilities:

Responsible for building scalable distributed data solutions using Hadoop.
Collection and Downloading of data generated by sensors from the Patients body activities to HDFS.
Performed necessary transformations and aggregation to build the common learner data model in NoSQL store (Hbase).
Used Pig, Hive and MapReduce for analyzing the Health insurance data and patient information.
Developed workflow in Oozie to orchestrate a series of Pig scripts to remove, merge and compress files using pig pipelines in the data preparation stage.
Used Pig UDF's in Python, Java code and used sampling of large data sets.
Moving all log files generated from various sources to HDFS for further processing through Flume.
Extensively used PIG to communicate with Hive and Hbase using Hcatalog and Handlers.
Involved in transforming data from legacy tables to HDFS, and Hbase tables using Sqoop.
Implemented test scripts to support test driven development and continuous integration.
Exported analyzed data to relational databases using Sqoop for visualization and generate reports for the BI team.
Good understanding of ETL tools and their application to Big Data environment.

Environment: Hadoop, Map Reduce, Spark, HDFS, Hive, Pig, Oozie, Core Java, Hbase, Flume, Cloud era, Oracle 10g, UNIX Shell Scripting.

Confidential, St Petersburg, FL

Java Developer

Responsibilities:

Designed the application in J2EE architecture and developed dynamic and browser compatible User Interfaces for on-line account management, order and payment processing.
Used Hibernate Object relational mapping (ORM) to achieve data persistence.
Developed Servlets and JSPs based on MVC pattern using Spring Framework.
Developed required helper classes following Core Java multi-threaded programming.
Developed the presentation layer using JSP, Tag libraries, HTML, CSS and client validations using JavaScript.
Developed hibernate DAO Classes using Spring JDBC Template and Methods in the DAO layer to persist the POJOS in the database.
Designed and developed Web services based on SOAP and WSDL for handling transaction history.
Involved in designing and developing the JSON, XML Objects with MySQL.
Developed web applications using Spring MVC, jQuery and implemented Spring Dependency Injection mechanism.
Integrated user interface, server layer and persistence layer using Spring IOC, AOP and Spring MVC integration with OBPM and Hibernate.
Developed data access classes using JDBC and created SQL queries and used PL/SQL procedures with Oracle Database.
Used LOG4J & JUnit for debugging, testing and maintaining the system state and tested the website with older and latest versions/releases on multiple browsers.
Implemented test cases for Unit testing of modules using JUnit and used ANT for building the project.
Provided production support for two of the applications involving swing and struts framework.

Environment: JDK 1.6, JSP, HTML, JavaScript, JSON, XML, jQuery, Servlets, Spring MVC, Hibernate, Web Services, SOAP, NetBeans.

Confidential, Charlotte, NC

Java Developer

Responsibilities:

Worked with Business analysts and Product owners to analyse and understand the requirements and giving the estimates.
Implement J2EE design patterns such as Singleton, DAO, DTO and MVC.
Developed this web application to store all system information in a central location using Spring MVC, JSP, Servlet and HTML.
Used SpringAOP module to handle transaction management services for objects in any Spring-based application.
Implemented Spring DI and Spring Transactions in business layer.
Developed data access components using JDBC, DAOs, and Beans for data manipulation.
Designed and developed database objects like Tables, Views, Stored Procedures, User Functions using PL/SQL, SQL Developer and used them in WEB components.
Used iBATIS for dynamically building SQL queries based on parameters.
Developed JavaScript and JQuery functions for all Client side Validations.
Developed Junit test cases for Unit Testing &Used Maven as build and configuration tool.
Used Shell scripting to create jobs to run on daily basis.
Debugged the application using Firebug and traversed through the nodes of the tree using DOM functions.
Monitored the error logs using log4jand fixed the problems.
Used Eclipse IDE and deployed the application on Web Logic server.
Responsible for configuring and deploying the builds on Web Sphere App Server.

Environment: Java, J2EE, Java Script, XML, JavaScript, JDBC, Spring Framework, Hibernate, Rest Full Web services, Web Logic Server, Log4j, JUnit, ANT, SoapUI, Oracle11g.

Confidential, Plano, TX

Java Developer

Responsibilities:

Design and development of Java classes using Object Oriented Methodology.
Worked in system using Java, JSP and SERVLET.
Development of Java classes and methods for handling Data from database.
Created and modified web pages using HTML and CSS with JavaScript validation.
Used JDBC/Jconnect for Oracle.
Create SQL script to create/drop database objects like tables, views, indexes, constraints, sequences and synonyms.
Developed SQL*Loader scripts to load the data from external files that is exported from SQL Server.
Creating complex PLSQL packages incorporating multi-org functionality with many modules merged together, by working with complex queries, complex Joins and conditions.
Developing efficient queries and views to produce customers delight.
Creating Servlets, JSP for administration module.
Creating Unix Shell Scripts for sequential execution of Java scripts including data extraction, loading and Oracle Stored Procedure execution.
Developing many KSH scripts for data file movement and scheduling.
Attended and Conducted User meetings for requirement analysis and project reporting.
Testing and bug fixing and providing support the production.

Environment: Windows XP, Oracle 9i database, EJB 2.1, JSP, Struts Framework, BEA Web logic 8.1, HTML, JavaScript, and Eclipse.

Confidential

Java Developer

Responsibilities:

Collecting and understanding the User requirements and Functional specifications.
Development of GUI Using HTML, CSS, JSP and JavaScript.
Creating components for isolated business logic.
Deployment of application in J2EE Architecture.
Using Oracle 8i as the Database Server.
Designing EJB 2.0 components with various design patterns like Service Locator and Business Delegate.
Finalize the design specifications for the new system.
Involvement in design, development and maintenance of the application.
Performing Unit Integration and performance testing and continuous interaction with Quality Assurance group.
Provided on call support based on the priority of the issues.

Environment: Java, JSP, SQL, MS-Access, JavaScript, HTML.

We provide IT Staff Augmentation Services!

Sr. Hadoop Consultant Resume

Sunnyvale, CA

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship