Sr. Hadoop Developer Resume Durham NC - Hire IT People

SUMMARY

Having 8+ years of IT experience as a Developer, Designer & quality reviewer wif cross - platform integration experience usingHadoop,Java, J2EE and SOA.
Strong experience wif Hadoop components:Hive,Pig, HBase, Zookeeper, Sqoop and Flume.
Experience in Hadoop Distributed File System and Ecosystem (MapReduce, Pig, Hive, Sqoop and HBase)
Hands on experience in installing, configuring, and using Apache Hadoop ecosystems such as MapReduce, HIVE, PIG, SQOOP, SPARK, FLUME and OOZIE.
Experience on installing, configuring, and using Hadoop components like Hadoop MapReduce (MR1),YARN (MR2), HDFS, Hive, Pig, Avro, Deflate, FlumeandSqoop.
Experience working onNoSQLdatabases includingHBase and data access using HIVE.
Extensive experience inMVC(Model View Controller) architecture, design, development of multi-tier enterprise applications for J2EE platform/SOA using Java, JDBC, Servlets, EJB, Struts, Tag Libraries, Hibernate, and XML.
Experience wif a variety of data formats and protocols such as JSON,and AVRO.
Hands on experience in dealing wif Compression Codecs likeSnappy, and Gzip.
Good working experience-using Sqoop to import data into HDFS from RDBMS.
Hands on experience on apache and ClouderaHadoopenvironments.
Experienced in importing-exporting data into HDFS format.
Experienced in handlingHadoopEcosystem Projects such as Hive, Pig and Sqoop.
Experienced in developing UDFs for Hive usingJava.
Experienced in using Flume to transfer log data files toHadoopDistributed File System (HDFS)
Experience in Multiple Relational Databases like Oracle 10g and NOSQL database HBase.
Strong understanding of databases like HBase, Mongo DB & Cassandra.
Hands on experience wifHadoop, HDFS, MapReduce andHadoopEcosystem (Pig, Hive, Oozie, Flume and HBase).
Extensive experience in design, development and support Model View Controller using Struts and Spring Framework.
Develop reusable solution to maintain proper coding standard across differentjavaproject.
Closely worked wif Hadoop Admin and performed various job roles and responsibilities of Hadoop Admin.
Involved in designing, capacity arrangement, cluster setup, performance fine-tuning, monitoring, and structure planning.
Experience in installing, administering, and supporting Windows and Linux operating systems in an enterprise environment.
Ability to work TEMPeffectively in cross-functional team environments and experience of providing training to business users.
TEMPEffective leadership quality wif good skills in strategy, business development, client management and project management

TECHNICAL SKILLS

Languages/Tools: Java, C, C++, XML, HTML/XHTML, DHTML.

Hadoop: HDFS, MapReduce, Cloudera, HIVE, PIG, HBase, SQOOP, Oozie, Zookeeper, Spark, and Kafka

J2EE Standards: JDBC, JNDI, JMS, Java Mail & XML Deployment Descriptors

Web/Distributed Technologies: J2EE, Servlets, JSP, Struts, Hibernate, EJB, XML, MVC, Struts, Spring.

Operating System: Windows 95/98/NT/2000/XP, MS-DOS, UNIX, multiple flavors of Linux.

Databases / NO SQL: Oracle 10g, MS SQL Server 2000, DB2, MS Access & MySQL. Teradata, Cassandra, Greenplum and MongoDB

App/Web Servers: IBM Websphere 5.1.2/5.0/4.0/3.5 , BEA Web logic 5.1/7.0, Jdeveloper, Apache Tomcat, JBoss.

Messaging & Web Services Technology: SOAP, WSDL, UDDI, XML, SOA, JAX-RPC, IBM WebSphere MQ, JMS.

Testing & Case Tools: JUnit, Log4j, Rational Clear case, CVS, ANT, JBuilder.

Version Control Systems: Github, SVN, CVS

PROFESSIONAL EXPERIENCE

Confidential, Durham NC

Sr. Hadoop Developer

Responsibilities:

Worked on analyzing, writingHadoopMapReduce jobsusingJavaAPI,Pig and Hive.
Implemented MapReduce programs to handle semi/ unstructured data like XML, JSON, Avro data files and sequence files for log files.
CustomizedAvrotools used in MapReduce, Pig and Hive for deserialization and to work wifAvroingestion framework.
Analyze large and critical datasets using Cloudera, HDFS, HBase, MapReduce, Hive, Hive UDF, Pig, Sqoop, Zookeeper, & Spark.
Writing Sqoop jobs to import/export data from Hadoop
Involved in loading data from edge node toHDFS using shell scripting.
CreatedHBasetables to store variable data formats of PII data coming from different portfolios.
Customize Flume interceptors to encrypt and mask customer sensitive data as per requirement
Worked wif NoSQL database HBase to create tables and store data.
Developed custom aggregate functions usingSparkSQL and performed interactive querying.
Used Pig to store the data into HBase.
Hands-on experience in developing capabilities in Python using Spark framework.
Creating Hive tables, dynamic partitions, buckets for sampling, and working on them using HiveQL
Stored the data in tabular formats using Hive tables and Hive SerDe's.
Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
Worked wif NoSQL databases like HBase in creating HBase tables to load large sets of semi structured data coming from various sources.
Used Kafka as a streaming tool to load the data on Hadoop File System and move the same data to NoSQL databases.
Implemented a script to transmit sysprint information from Oracle toHBaseusingSqoop.
Implemented test scripts to support test driven development and continuous integration.
Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
Involved in writing the shell scripts for exporting log files toHadoopcluster through automated process.
Exported the analyzed data to the relational databases usingSqoopfor visualization and to generate reportsfor the BI team.
Hands on experience in running Webpack tasks to build projects properly using Node.js
Hands on experience on Hadoop Administration by working on configuration management for the nodes, and testing and benchmarking the new nodes.
Involved in cluster management via Ambari/Cloudera Manager and worked on Cluster Performance Drills.
Experience in handling backups to the metadata of the cluster and other Eco-system metadata.
Involved in standard System Admin work like creating new users in Hadoop, handling permissions and performing upgrades.
Also involved in solving Day to Day Cluster issues like finding out which jobs are taking more time, if users say dat jobs are stuck to find out the reason.
Utilized Agile Scrum Methodology to help manage and organize a team of 4 developers wif regular code review sessions.
Worked in agile environment and participated in daily scrum meetings.
Responsible for cluster maintenance, adding and removing cluster nodes, cluster monitoring and troubleshooting, manage and reviewdata backups, manage and reviewHadooplog files.
InstalledOozieworkflow engine to run multipleHiveandpigjobs.
Involved in installing Kafka on Hadoop Cluster and configure producer and consumer coding part to establish connection from twitter source to HDFS wif popular hash tags.
Supported in setting upQA environmentand updating configurations for implementing scripts wif Pig and Sqoop.

Environment: Hadoop, HDFS, Pig, Hive, Sqoop, Flume, Kafka, Spark, MapReduce, Cloudera, Avro, Snappy, Zookeeper, NoSQL, HBase, Shell Scripting, Ubuntu, Linux Red Hat.

Confidential, New York NY

Sr. Hadoop Developer

Responsibilities:

Defined, designed and developed Javaapplications, specially usingHadoop Map/Reduce by leveraging frameworks such as Cascading and Hive.
Developed workflow using Oozie for runningMap Reduce jobsandHive Queries.
Worked on loading log data directly intoHDFS using Flume.
Worked onClouderato analyze data present on top ofHDFS
Responsible for managing data from multiple sources.
Load data from various data sources intoHDFSusing Flume.
This plugin allowsHadoopMapReduceprograms, HBase,PigandHiveto work unmodified and access files directly.
Designed and implementedMapReduce-based large-scale parallel relation-learning system
Successfully loaded files to Hive and HDFS from Mongo DB Solar.
Familiarity wif a NoSQL database such as MongoDb Solar.
Successfully loaded files to Hive and HDFS from Mongo DB Solar.
Extracted files from MySQL through Sqoop and placed in HDFS and processed.
Developed Pig Latin scripts to extract data from the web server output files to load into HDFS.
Built reusable Hive UDF libraries for business requirements, which enabled users to use these UDF's in Hive Querying.
Worked on debugging, performance tuning of Hive & Pig Jobs.
Created HBase tables to store various data formats of PII data coming from different portfolios.
DevelopedPigScripts,PigUDFs andHiveScripts,HiveUDFs to load data files intoHadoop
Implemented test scripts to support test driven development and continuous integration.
Worked on tuning the performance Pig queries.
Created and maintained Technical documentation for launching Hadoop Clusters and for executing Hive queries and Pig Scripts.
Hands on experience in creating use cases based on business and user requirements to develop system functions.
Prepare Developer (Unit) Test cases and execute developer testing.
Developed unit test cases for Hadoop MapReduce jobs wif JUnit.
Involved in loading data from LINUX file system to HDFS.
Importing and exporting data into HDFS and Hive using Sqoop.
Worked on processing unstructured data using Pig and Hive.
Supported MapReduce Programs those are running on the cluster.

Environment: Hadoop, HDFS, Pig, Hive, Sqoop, Flume, HBase, Shell Scripting, Maven,Hudson/Jenkins, Ubuntu, Linux Red Hat, Mongo DB.

Confidential

Hadoop Developer

Responsibilities:

Involved in design and development phases of Software Development Life Cycle (SDLC) using Scrum methodology.
Worked on analyzing Hadoop cluster using different big data analytic tools including Pig, Hive, and MapReduce.
Developed data pipeline usingFlume, Sqoopto ingest customer behavioral data and purchase histories intoHDFSfor analysis.
Continuous monitoring and managing theHadoopclusterusingClouderaManager.
UsedPigto perform data validation on the data ingested using sqoop and flume and the cleansed data set is pushed intoHBase.
Participated in development/implementation ofClouderaHadoopenvironment.
Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
Designed and implemented MapReduce-based large-scale parallel relation-learning system.
Worked wif Zookeeper, Oozie, AppWorx and Data Pipeline Operational Services for coordinating the cluster and scheduling workflows.
Designed and built the Reporting Application, which uses theSparkSQL to fetch and generate reports on HBase table data.
Extracted the needed data from the server into HDFS and Bulk Loaded the cleaned data into HBase.
Responsible for creatingHivetables, loading the structured data resulted from MapReduce jobs into the tables and writinghivequeries to further analyze the logs to identify issues and behavioral patterns.
Involved in scheduling Oozie workflow engine to run multiple Hive and pig jobs
DevelopedHivequeries andPigscripts to analyze large datasets.
Involved in importing and exporting the data from RDBMS toHDFSand vice versa using Sqoop.
Involved in generating the Adhoc reports usingPigandHivequeries.
UsedHiveto analyze data ingested intoHBaseby usingHive-HBaseintegration and compute various metrics for reporting on the dashboard.
Developed job flows inOozieto automate the workflow for pig and hive jobs.
Loaded the aggregated data ontoOraclefromHadoopenvironment usingSqoopfor reporting on the dashboard.

Environment: RedHat Linux, HDFS, Map-Reduce, Hive,JavaJDK1.6, Pig, Sqoop, Flume, Zookeeper, Oozie, Oracle, HBase.

Confidential

Java Developer

Responsibilities:

As part of the lifecycle development prepared class model, sequence model and flow diagrams by analyzing Use cases using Rational Tools.
Reviewing and analyzing data model for developing the Presentation layer and Value Objects.
Involved in developing Database access components using Spring DAO integrated wif Hibernate for accessing the data.
Responsible for technical and application architecture for the enterprise business management software.
The technical architecture included integration wif Java objects, XML message structures, Java Messaging Service and Tibco RV for pub-sub services.
Extensive use of Struts Framework for Controller components and view components.
Involved in writing the exception and validation classes using Struts validation rules.
Involved in writing the validation rules classes for general server side validations for implementing validation rules as part observer J2EE design pattern.
Used Hibernate for the persistence of the project.
Used Spring AOP and Dependency injection during various modules of project.
Implemented Service Oriented Architecture (SOA) using JMS for sending and receiving messages while creating web services.
Spring framework was used for dependency injection and was integrate wif different frameworks like Struts, Hibernate
Developed various java objects (POJO) as part of persistence classes for OR mapping.
Developed web services using SOAP and WSDL wif Axis.
Implemented EJB (Message Driven Beans) in the Service Layer.
Involved in working wif JMS MQ Queues (Producers/Consumers) in Sending and Receiving Asynchronous messages via MDB’s.
Developed, implemented, and maintained an asynchronous, AJAX based rich client for improved customer experience using XML data and XSLT templates.
Involved in writing the parsers for parsing and building the XML documents using SAX and DOM Parsers.
Designed and developed architecture plans, timelines, and system technical and data architecture.
Developed SQL stored procedures and prepared statements for updating and accessing data from database.
Used JBoss for deploying various components of application and MAVEN as build tool and developed build file for compiling the code of creating WAR files.
Used CVS for version control.
Performed Unit testing and rigorous integration testing of the whole application.

Environment: Java, J2EE, EJB, JMS, Strut, JBoss, Hibernate, JSP, JSTL, AJAX, CVS, JavaScript, HTML, XML, MAVEN, SQL, Oracle, SOA, SAX and DOM Parser, Web Services (SOAP, WSDL), Spring, Windows.

Confidential

Java Developer

Responsibilities:

Involved in design and development phases of Software Development Life Cycle (SDLC)
Involved in designing UML Use case diagrams, Class diagrams, and Sequence diagrams using Rational Rose.
Followed agile methodology and SCRUM meetings to track, optimize and tailored features to customer needs.
Developed user interface using JSP, JSP Tag libraries, and Java Script to simplify the complexities of the application.
Implemented Model View Controller (MVC) architecture using Jakarta Struts frameworks at presentation tier.
Developed a Dojo based front end including forms and controls and programmed event handling.
Implemented SOA architecture wif web services using JAX-RS (REST) and JAX-WS (SOAP)
Developed various Enterprise Java Bean components to fulfill the business functionality.
Created Action Classes which route submittals to appropriate EJB components and render retrieved information.
Validated all forms using Struts validation framework and implemented Tiles framework in the presentation layer.
Used Core java and object oriented concepts.
Extensively used Hibernate in data access layer to access and update information in the database.
Used Spring Framework for Dependency injection and integrated it wif the Struts Framework and Hibernate.
Used JDBC to connect to backend databases, Oracle and SQL Server 2005.
Proficient in writing SQL queries, stored procedures for multiple databases, Oracle and SQL Server 2005.
Wrote Stored Procedures using PL/SQL. Performed query optimization to achieve faster indexing and making the system more scalable.
Deployed application on windows using IBM Web Sphere Application Server.
Used Java Messaging Services (JMS) for reliable and asynchronous exchange of important information such as payment status report.
Used Web Services - WSDL and REST for getting credit card information from third party and used SAX and DOM XML parsers for data retrieval.
Implemented SOA architecture wif web services using Web Services like JAX-WS.
Used ANT scripts to build the application and deployed on Web Sphere Application Server

Environment: Core Java, J2EE, Oracle, SQL Server, JSP, Struts, Spring, JDK, Hibernate, JavaScript, HTML, CSS, AJAX, JUnit, Log4j, Web Services, Windows.

We provide IT Staff Augmentation Services!

Sr. Hadoop Developer Resume

Durham, NC

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship