Sr. Hadoop Developer Resume
Boston, MA
SUMMARY:
- 7 years of IT experience in analysis, architectural design,prototyping, development, Integration and testing of applications using
- Java/J2EE,and also on Linux, over 4 years of work experience on Hadoop 2.0 environment in developing
- HADOOP Ecosystem, Confidential 2.2, CDH 5, Big Data Analytics,ETL QA, Big Data Hadoop, and NoSQL, SQL technologies as Hadoop Developer.
- 4+ years of experience in various Hadoop Distributions like Confidential and Confidential .
- Experience in analyzing data in HDFS through Hive, Pig, Hbase.
- Experience creating use case model, use case, class, sequence diagrams using Microsoft Visio and Rational Rose.
- Experience in design and development of object oriented analysis design (OOAD) based system using Rational Rose.
- Extensive knowledge in developing ANT scripts to build and deploy application and experience in Maven to build and manage Java projects.
- Experience in working with Confidential made things easy to more easily read and write data on the grid.
- Experience in data streaming using Apache Kafka, Storm, Spark.
- Experience on working in the project Apache Kafka which aims to provide a unified, high - throughput, low-latency platform for handling real-time data feed.
- Handled importing of data from various data sources, performed transformations using Hive, MapReduce, loaded data into HDFS and Extracted the data from MySQL into HDFS using Sqoop.
- Experience in bi-directional data pipelines from HDFS to Relational Database with Sqoop.
- Good experience in installing and configuring security around Hadoop environment.
- Worked on open-source cluster computing framework based on Apache Spark, Storm
- Experience in configuring and Confidential zookeeper quorum to support large clusters.
- Experience in ETL tools.
TECHNICAL SKILLS:
Hadoop/Big Data platform: HDFS, Map Reduce, HBase, Storm, Spark, Hive, Pig, Oozie, Zookeeper, Flume, Sqoop, Kafka.
Hadoop distribution: Confidential , Confidential
Programming languages: C/C++, Java, Java 8, UNIX shell scripts, Python, Pig Latin, PL/SQL, Scala, TALend, NoSQL, ETL Informatica, Datameer.
Operating Systems: Linux, Cent OS
Linux Experience: System Administration Tools, Puppet, Apache
Virtual Experience: VMware, Xen.
Application Software: AMR, AWS, SSH, telnet, ftp, Terminal client and Remote Desktop Connection.
Hardware: HP ProLiant, HP Blades, Dell Power edge & IBM xseries.
Data Storage and Data Base: Oracle 9i, 10g,11g,12, Oracle rac, MS Access, Cassandra, MYSQL 3.X, M.S BI(SSIS,SSRS, SSRS)
PROFESSIONAL EXPERIENCE:
Confidential, Boston, MA
Sr. Hadoop Developer
Responsibilities:
- Worked with Confidential 2.2 distribution as it has an abundance of new data combined with Hadoop ability to store & process.
- Worked using Confidential as it provides an advantage by adding our own datasets, and connect it to your existing tools and applications.
- Worked in the project Apache Kafka which aims to provide a unified, high-throughput, low-latency platform for handling real-time data feed.
- Using Kafka functionalities like distribution, partition, replicated commit log service for messaging systems by maintaining feeds.
- Worked on Kafka while dealing with raw data, by transforming into new Kafka topics for further consumption.
- Worked on open-source cluster computing framework based on Apache Spark, Storm.
- Developed Java programming while working with Hbase, Spark, Storm, and Kafkaalso for some Map Reduce jobs.
- Worked on analyzing Hadoop stack and different big data analytic tools including Pig and Hive, Hbase database and Sqoop .
- Developed Simple to complex Map/reduce Jobs using Hive and Pig.
- Worked on Data Management by developing Hadoop MapReduce applications, including a close look at framework components, use of Hadoop for a variety of data analysis tasks.
- Worked with Confidential provides read and write interfaces for Pig and Map Reduce and uses Hive's command line interface for issuing data definition and its metadata.
- Worked on Confidential table for storage management layer of Hadoop that enables users with different data processing tools like Pig, Hive and Map Reduce.
Technology Used: Apache Hadoop 2.0, Map Reduce, HDFS, Confidential 2/2.2, Hbase, Hive, Pig, Oozie, Flume, Kafka, Storm, Spark(scala),Java (jdk 1.6).
Hadoop Developer
Confidential
Responsibilities:
- Responsible for building scalable distributed data solutions using Hadoop.
- Developed Simple to complex Map/reduce Jobs using Hive and Pig.
- Worked on analyzing Hadoop stack and different big data analytic tools including Pig and Hive, Hbase database and Sqoop .
- Worked on Hive for exposing data for further analysis and for generating transforming files from different analyticall formats to text files.
- Involved in writing the shell scripting for loading data from LINUX file system to HDFS .
- Involved in creating Hive tables , loading with data and writing hive queries which will run internally in map reduce way.
- Worked on cloud environments like AWS standing up and automating the deployment of solutions.
- Designed and developed software for Bioinformatics, Next Generation Sequencing (NGS) in
- Hadoop Map Reduce framework, Mango DB using Confidential S3, Confidential EC2 , Confidential Elastic Map Reduce (EMR).
- Worked on Splunk dashboard to perform data analytics & integration for analyze and visualize data in Hadoop.
- Worked with Splunk to generate charts and for visualizations, dashboards using the pivot interface.
- Transparently cache search results in Hadoop.
- Designed a data warehouse using Hive .
- Worked extensively with Sqoop for importing metadata. Extensively used Pig for data cleansing.
- Created partitioned tables in Hive Worked with business teams and created Hive queries for ad hoc access.
- Evaluated usage of Oozie for Workflow Orchestration Mentored analyst and test team for writing Hive Queries.
- Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms.
- Worked in data streaming using Kafka.
- Using Kafka on publish-subscribe messaging as a distributed commit log, have experienced in its fast, scalable and durability.
- Handled importing of data from various data sources, performed transformations using Hive, MapReduce, loaded data into HDFS and Extracted the data using Sqoop.
- Analyzed the data by performing Hive queries and running Pig scripts to study customer behavior.
- Used UDF’s to implement business logic in Hadoop.
- Implemented business logic by writing UDFs in Java and used various UDFs from Piggybanks and other sources.
- Worked with application teams to install operating system, Hadoop updates, patches, version upgrades as required.
Technology Used: Confidential EC2, Hadoop, HDFS, Hive, Pig, Sqoop, Hbase, Java (jdk1.6), LINUX, MapReduce, Oozie, Oracle 11g/10g,
Hadoop Administrator/Developer
Confidential, Charlotte, NC
Responsibilities:
- Provisioning, building and support of Linux servers both Physical and Virtual using VMware for Production, QA and Developers environment.
- Responsible for implementation and ongoing administration of Hadoop infrastructure.
- Deploy new hardware and software environments required for Hadoop and to expand existing environments.
- HDFS support and maintenance.
- Diligently teaming with the infrastructure, network, database, application and business intelligence teams to guarantee high data quality and availability.
- Collaborating with application teams to install operating system and Hadoop updates, patches, version upgrades when required.
- Responsible for Cluster maintenance, Adding and removing cluster nodes, Cluster Monitoring and Confidential, Manage and review data backups, Manage and review Hadoop log files.
- Continuous monitoring and managing the Hadoop cluster through Confidential manager and Confidential health tests.
- Commissioned and decommissioned the Data Nodes in the cluster in case of the problems.
- Used Confidential Navigator for generating cluster usage reports.
- Published operational procedure for deleting and cleaning Confidential component without affecting the OS level configuration.
- Screen Hadoop cluster job performances and capacity planning.
- Monitoring Hadoop cluster connectivity and security.
- Working with data delivery teams to setup new Hadoop users which also include setting up Linux users and testing HDFS, Hive, Pig and Map Reduce access for the new users.
- Experience in loading and transforming of large sets of structured, semi structured and unstructured data from HBase through Sqoop and placed in HDFS for further processing.
- Performing Linux systems administration on production and development servers (Red Hat Linux, CentOS and other UNIX utilities).
- Installing Patches and packages on Unix/Linux Servers.
- Installation and Configuration of VMwarevSphere client, Virtual Server creation and resource allocation.
- Performance Tuning, Client/Server Connectivity and Database Consistency Checks using different Utilities.
- Shell scripting for Linux/Unix Systems Administration and related tasks.
Technology Used:Red hat Linux/Centos 4, 5, 6, Logical Volume Manager, Hadoop, VMware ESX 5.1/5.5, Apache and Tomcat Web Server, Oracle 11,12, Oracle Rac 12c, HPSM, HPSA.
Confidential
Java Developer
Responsibilities:
- Involved in development of business domain concepts into Use Cases, Sequence Diagrams, Class Diagrams, Component Diagrams and Implementation Diagrams.
- Implemented various J2EE Design Patterns such as Model-View-Controller, Data Access Object, Business Delegate and Transfer Object.
- Worked on JVM implementations in JavaScript projects unsuitable for production deployment and development tools to avoid having to recompile .
- Worked on direct buffer which allocates the VM effectively registers a cleanup method with the garbage collector.
- Responsible for analysis and design of the application based on MVC Architecture, using open source Struts Framework.
- Involved in configuringStruts, Tiles and developing the configuration files.
- Developed Struts Action classes and Validation classes using Struts controller component and Struts validation framework.
- Developed and deployed UI layer logics using JSP, XML, JavaScript, HTML /DHTML.
- Used Spring Frameworkand integrated it with Struts.
- Involved in Configuring web.xml and struts-config.xml according to the struts framework.
- Provided connections using JDBC to the database and developed SQL queries to manipulate the data.
- Developed DAO using spring JDBC Template to run performance intensive queries.
- Developed ANT script for auto generation and deployment of the web service.
- Wrote stored procedure and used JAVA API sto call these procedures.
- Developed various test cases such as unit tests, mock tests, and integration tests using the JUNIT.
- Experience writing Stored Procedures, Functions and Packages.
- Used log4j to perform logging in the applications.
Technology Used: Java, J2EE, JVM, Buffers, NIO, Struts MVC, Tiles, JDBC, JSP, JavaScript, HTML, Spring IOC, Spring AOP, JAX-WS, Ant, Web sphere Application Server, Oracle, JUNIT and Log4j, Eclipse.
Confidential
Java Developer
Responsibilities:
- Responsible for gathering and analyzing requirements and converting them into technical specifications.
- Used Rational Rose for creating sequence and class diagrams.
- Developed presentation layer using JSP, Java, HTML and JavaScript.
- Used Spring Core Annotations for Dependency Injection.
- Designed and developed a ‘Convention Based Coding’ utilizing Hibernates persistence framework and
- O-R mapping capability to enable dynamic fetching and displaying of various table data with JSF tag libraries.
- Designed and developed Hibernate configuration and session-per-request design pattern for making database connectivity and accessing the session for database transactions respectively.
- Used HQL and SQL for fetching and storing data in databases.
- Participated in the design and development of database schema and Entity-Relationship diagrams of the backend Oracle database tables for the application.
- Implemented web services with Apache Axis.
- Designed and Developed Stored Procedures, Triggers in Oracle to cater the needs for the entire application.
- Developed complex SQL queries for extracting data from the database.
- Designed and built SOAP web service interfaces implemented in Java.
- Used Apache Ant for the build process.
- Used ClearCase for version control and Clear Quest for bug tracking.
Technology Used: Java, JDK 1.5, Servlets, Hibernate, Ajax, Oracle 10g, Eclipse, Apache Ant, Web Services (SOAP), Apache Axis, Apache Ant, Web Logic Server, JavaScript, HTML, CSS, XML
