We provide IT Staff Augmentation Services!

Sr. Hadoop Developer Resume

5.00/5 (Submit Your Rating)

Durham, NC

SUMMARY

  • Having 8+ years of IT experience as a Developer, Designer & quality reviewer wif cross - platform integration experience usingHadoop,Java, J2EE and SOA.
  • Strong experience wif Hadoop components:Hive,Pig, HBase, Zookeeper, Sqoop and Flume.
  • Experience in Hadoop Distributed File System and Ecosystem (MapReduce, Pig, Hive, Sqoop and HBase)
  • Hands on experience in installing, configuring, and using Apache Hadoop ecosystems such as MapReduce, HIVE, PIG, SQOOP, SPARK, FLUME and OOZIE.
  • Experience on installing, configuring, and using Hadoop components like Hadoop MapReduce (MR1),YARN (MR2), HDFS, Hive, Pig, Avro, Deflate, FlumeandSqoop.
  • Experience working onNoSQLdatabases includingHBase and data access using HIVE.
  • Extensive experience inMVC(Model View Controller) architecture, design, development of multi-tier enterprise applications for J2EE platform/SOA using Java, JDBC, Servlets, EJB, Struts, Tag Libraries, Hibernate, and XML.
  • Experience wif a variety of data formats and protocols such as JSON,and AVRO.
  • Hands on experience in dealing wif Compression Codecs likeSnappy, and Gzip.
  • Good working experience-using Sqoop to import data into HDFS from RDBMS.
  • Hands on experience on apache and ClouderaHadoopenvironments.
  • Experienced in importing-exporting data into HDFS format.
  • Experienced in handlingHadoopEcosystem Projects such as Hive, Pig and Sqoop.
  • Experienced in developing UDFs for Hive usingJava.
  • Experienced in using Flume to transfer log data files toHadoopDistributed File System (HDFS)
  • Experience in Multiple Relational Databases like Oracle 10g and NOSQL database HBase.
  • Strong understanding of databases like HBase, Mongo DB & Cassandra.
  • Hands on experience wifHadoop, HDFS, MapReduce andHadoopEcosystem (Pig, Hive, Oozie, Flume and HBase).
  • Extensive experience in design, development and support Model View Controller using Struts and Spring Framework.
  • Develop reusable solution to maintain proper coding standard across differentjavaproject.
  • Closely worked wif Hadoop Admin and performed various job roles and responsibilities of Hadoop Admin.
  • Involved in designing, capacity arrangement, cluster setup, performance fine-tuning, monitoring, and structure planning.
  • Experience in installing, administering, and supporting Windows and Linux operating systems in an enterprise environment.
  • Ability to work TEMPeffectively in cross-functional team environments and experience of providing training to business users.
  • TEMPEffective leadership quality wif good skills in strategy, business development, client management and project management

TECHNICAL SKILLS

Languages/Tools: Java, C, C++, XML, HTML/XHTML, DHTML.

Hadoop: HDFS, MapReduce, Cloudera, HIVE, PIG, HBase, SQOOP, Oozie, Zookeeper, Spark, and Kafka

J2EE Standards: JDBC, JNDI, JMS, Java Mail & XML Deployment Descriptors

Web/Distributed Technologies: J2EE, Servlets, JSP, Struts, Hibernate, EJB, XML, MVC, Struts, Spring.

Operating System: Windows 95/98/NT/2000/XP, MS-DOS, UNIX, multiple flavors of Linux.

Databases / NO SQL: Oracle 10g, MS SQL Server 2000, DB2, MS Access & MySQL. Teradata, Cassandra, Greenplum and MongoDB

App/Web Servers: IBM Websphere 5.1.2/5.0/4.0/3.5 , BEA Web logic 5.1/7.0, Jdeveloper, Apache Tomcat, JBoss.

Messaging & Web Services Technology: SOAP, WSDL, UDDI, XML, SOA, JAX-RPC, IBM WebSphere MQ, JMS.

Testing & Case Tools: JUnit, Log4j, Rational Clear case, CVS, ANT, JBuilder.

Version Control Systems: Github, SVN, CVS

PROFESSIONAL EXPERIENCE

Confidential, Durham NC

Sr. Hadoop Developer

Responsibilities:

  • Worked on analyzing, writingHadoopMapReduce jobsusingJavaAPI,Pig and Hive.
  • Implemented MapReduce programs to handle semi/ unstructured data like XML, JSON, Avro data files and sequence files for log files.
  • CustomizedAvrotools used in MapReduce, Pig and Hive for deserialization and to work wifAvroingestion framework.
  • Analyze large and critical datasets using Cloudera, HDFS, HBase, MapReduce, Hive, Hive UDF, Pig, Sqoop, Zookeeper, & Spark.
  • Writing Sqoop jobs to import/export data from Hadoop
  • Involved in loading data from edge node toHDFS using shell scripting.
  • CreatedHBasetables to store variable data formats of PII data coming from different portfolios.
  • Customize Flume interceptors to encrypt and mask customer sensitive data as per requirement
  • Worked wif NoSQL database HBase to create tables and store data.
  • Developed custom aggregate functions usingSparkSQL and performed interactive querying.
  • Used Pig to store the data into HBase.
  • Hands-on experience in developing capabilities in Python using Spark framework.
  • Creating Hive tables, dynamic partitions, buckets for sampling, and working on them using HiveQL
  • Stored the data in tabular formats using Hive tables and Hive SerDe's.
  • Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
  • Worked wif NoSQL databases like HBase in creating HBase tables to load large sets of semi structured data coming from various sources.
  • Used Kafka as a streaming tool to load the data on Hadoop File System and move the same data to NoSQL databases.
  • Implemented a script to transmit sysprint information from Oracle toHBaseusingSqoop.
  • Implemented test scripts to support test driven development and continuous integration.
  • Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
  • Involved in writing the shell scripts for exporting log files toHadoopcluster through automated process.
  • Exported the analyzed data to the relational databases usingSqoopfor visualization and to generate reportsfor the BI team.
  • Hands on experience in running Webpack tasks to build projects properly using Node.js
  • Hands on experience on Hadoop Administration by working on configuration management for the nodes, and testing and benchmarking the new nodes.
  • Involved in cluster management via Ambari/Cloudera Manager and worked on Cluster Performance Drills.
  • Experience in handling backups to the metadata of the cluster and other Eco-system metadata.
  • Involved in standard System Admin work like creating new users in Hadoop, handling permissions and performing upgrades.
  • Also involved in solving Day to Day Cluster issues like finding out which jobs are taking more time, if users say dat jobs are stuck to find out the reason.
  • Utilized Agile Scrum Methodology to help manage and organize a team of 4 developers wif regular code review sessions.
  • Worked in agile environment and participated in daily scrum meetings.
  • Responsible for cluster maintenance, adding and removing cluster nodes, cluster monitoring and troubleshooting, manage and reviewdata backups, manage and reviewHadooplog files.
  • InstalledOozieworkflow engine to run multipleHiveandpigjobs.
  • Involved in installing Kafka on Hadoop Cluster and configure producer and consumer coding part to establish connection from twitter source to HDFS wif popular hash tags.
  • Supported in setting upQA environmentand updating configurations for implementing scripts wif Pig and Sqoop.

Environment: Hadoop, HDFS, Pig, Hive, Sqoop, Flume, Kafka, Spark, MapReduce, Cloudera, Avro, Snappy, Zookeeper, NoSQL, HBase, Shell Scripting, Ubuntu, Linux Red Hat.

Confidential, New York NY

Sr. Hadoop Developer

Responsibilities:

  • Defined, designed and developed Javaapplications, specially usingHadoop Map/Reduce by leveraging frameworks such as Cascading and Hive.
  • Developed workflow using Oozie for runningMap Reduce jobsandHive Queries.
  • Worked on loading log data directly intoHDFS using Flume.
  • Worked onClouderato analyze data present on top ofHDFS
  • Responsible for managing data from multiple sources.
  • Load data from various data sources intoHDFSusing Flume.
  • This plugin allowsHadoopMapReduceprograms, HBase,PigandHiveto work unmodified and access files directly.
  • Designed and implementedMapReduce-based large-scale parallel relation-learning system
  • Successfully loaded files to Hive and HDFS from Mongo DB Solar.
  • Familiarity wif a NoSQL database such as MongoDb Solar.
  • Successfully loaded files to Hive and HDFS from Mongo DB Solar.
  • Extracted files from MySQL through Sqoop and placed in HDFS and processed.
  • Developed Pig Latin scripts to extract data from the web server output files to load into HDFS.
  • Built reusable Hive UDF libraries for business requirements, which enabled users to use these UDF's in Hive Querying.
  • Worked on debugging, performance tuning of Hive & Pig Jobs.
  • Created HBase tables to store various data formats of PII data coming from different portfolios.
  • DevelopedPigScripts,PigUDFs andHiveScripts,HiveUDFs to load data files intoHadoop
  • Implemented test scripts to support test driven development and continuous integration.
  • Worked on tuning the performance Pig queries.
  • Created and maintained Technical documentation for launching Hadoop Clusters and for executing Hive queries and Pig Scripts.
  • Hands on experience in creating use cases based on business and user requirements to develop system functions.
  • Prepare Developer (Unit) Test cases and execute developer testing.
  • Developed unit test cases for Hadoop MapReduce jobs wif JUnit.
  • Involved in loading data from LINUX file system to HDFS.
  • Importing and exporting data into HDFS and Hive using Sqoop.
  • Worked on processing unstructured data using Pig and Hive.
  • Supported MapReduce Programs those are running on the cluster.

Environment: Hadoop, HDFS, Pig, Hive, Sqoop, Flume, HBase, Shell Scripting, Maven,Hudson/Jenkins, Ubuntu, Linux Red Hat, Mongo DB.

Confidential

Hadoop Developer

Responsibilities:

  • Involved in design and development phases of Software Development Life Cycle (SDLC) using Scrum methodology.
  • Worked on analyzing Hadoop cluster using different big data analytic tools including Pig, Hive, and MapReduce.
  • Developed data pipeline usingFlume, Sqoopto ingest customer behavioral data and purchase histories intoHDFSfor analysis.
  • Continuous monitoring and managing theHadoopclusterusingClouderaManager.
  • UsedPigto perform data validation on the data ingested using sqoop and flume and the cleansed data set is pushed intoHBase.
  • Participated in development/implementation ofClouderaHadoopenvironment.
  • Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
  • Designed and implemented MapReduce-based large-scale parallel relation-learning system.
  • Worked wif Zookeeper, Oozie, AppWorx and Data Pipeline Operational Services for coordinating the cluster and scheduling workflows.
  • Designed and built the Reporting Application, which uses theSparkSQL to fetch and generate reports on HBase table data.
  • Extracted the needed data from the server into HDFS and Bulk Loaded the cleaned data into HBase.
  • Responsible for creatingHivetables, loading the structured data resulted from MapReduce jobs into the tables and writinghivequeries to further analyze the logs to identify issues and behavioral patterns.
  • Involved in scheduling Oozie workflow engine to run multiple Hive and pig jobs
  • DevelopedHivequeries andPigscripts to analyze large datasets.
  • Involved in importing and exporting the data from RDBMS toHDFSand vice versa using Sqoop.
  • Involved in generating the Adhoc reports usingPigandHivequeries.
  • UsedHiveto analyze data ingested intoHBaseby usingHive-HBaseintegration and compute various metrics for reporting on the dashboard.
  • Developed job flows inOozieto automate the workflow for pig and hive jobs.
  • Loaded the aggregated data ontoOraclefromHadoopenvironment usingSqoopfor reporting on the dashboard.

Environment: RedHat Linux, HDFS, Map-Reduce, Hive,JavaJDK1.6, Pig, Sqoop, Flume, Zookeeper, Oozie, Oracle, HBase.

Confidential

Java Developer

Responsibilities:

  • As part of the lifecycle development prepared class model, sequence model and flow diagrams by analyzing Use cases using Rational Tools.
  • Reviewing and analyzing data model for developing the Presentation layer and Value Objects.
  • Involved in developing Database access components using Spring DAO integrated wif Hibernate for accessing the data.
  • Responsible for technical and application architecture for the enterprise business management software.
  • The technical architecture included integration wif Java objects, XML message structures, Java Messaging Service and Tibco RV for pub-sub services.
  • Extensive use of Struts Framework for Controller components and view components.
  • Involved in writing the exception and validation classes using Struts validation rules.
  • Involved in writing the validation rules classes for general server side validations for implementing validation rules as part observer J2EE design pattern.
  • Used Hibernate for the persistence of the project.
  • Used Spring AOP and Dependency injection during various modules of project.
  • Implemented Service Oriented Architecture (SOA) using JMS for sending and receiving messages while creating web services.
  • Spring framework was used for dependency injection and was integrate wif different frameworks like Struts, Hibernate
  • Developed various java objects (POJO) as part of persistence classes for OR mapping.
  • Developed web services using SOAP and WSDL wif Axis.
  • Implemented EJB (Message Driven Beans) in the Service Layer.
  • Involved in working wif JMS MQ Queues (Producers/Consumers) in Sending and Receiving Asynchronous messages via MDB’s.
  • Developed, implemented, and maintained an asynchronous, AJAX based rich client for improved customer experience using XML data and XSLT templates.
  • Involved in writing the parsers for parsing and building the XML documents using SAX and DOM Parsers.
  • Designed and developed architecture plans, timelines, and system technical and data architecture.
  • Developed SQL stored procedures and prepared statements for updating and accessing data from database.
  • Used JBoss for deploying various components of application and MAVEN as build tool and developed build file for compiling the code of creating WAR files.
  • Used CVS for version control.
  • Performed Unit testing and rigorous integration testing of the whole application.

Environment: Java, J2EE, EJB, JMS, Strut, JBoss, Hibernate, JSP, JSTL, AJAX, CVS, JavaScript, HTML, XML, MAVEN, SQL, Oracle, SOA, SAX and DOM Parser, Web Services (SOAP, WSDL), Spring, Windows.

Confidential

Java Developer

Responsibilities:

  • Involved in design and development phases of Software Development Life Cycle (SDLC)
  • Involved in designing UML Use case diagrams, Class diagrams, and Sequence diagrams using Rational Rose.
  • Followed agile methodology and SCRUM meetings to track, optimize and tailored features to customer needs.
  • Developed user interface using JSP, JSP Tag libraries, and Java Script to simplify the complexities of the application.
  • Implemented Model View Controller (MVC) architecture using Jakarta Struts frameworks at presentation tier.
  • Developed a Dojo based front end including forms and controls and programmed event handling.
  • Implemented SOA architecture wif web services using JAX-RS (REST) and JAX-WS (SOAP)
  • Developed various Enterprise Java Bean components to fulfill the business functionality.
  • Created Action Classes which route submittals to appropriate EJB components and render retrieved information.
  • Validated all forms using Struts validation framework and implemented Tiles framework in the presentation layer.
  • Used Core java and object oriented concepts.
  • Extensively used Hibernate in data access layer to access and update information in the database.
  • Used Spring Framework for Dependency injection and integrated it wif the Struts Framework and Hibernate.
  • Used JDBC to connect to backend databases, Oracle and SQL Server 2005.
  • Proficient in writing SQL queries, stored procedures for multiple databases, Oracle and SQL Server 2005.
  • Wrote Stored Procedures using PL/SQL. Performed query optimization to achieve faster indexing and making the system more scalable.
  • Deployed application on windows using IBM Web Sphere Application Server.
  • Used Java Messaging Services (JMS) for reliable and asynchronous exchange of important information such as payment status report.
  • Used Web Services - WSDL and REST for getting credit card information from third party and used SAX and DOM XML parsers for data retrieval.
  • Implemented SOA architecture wif web services using Web Services like JAX-WS.
  • Used ANT scripts to build the application and deployed on Web Sphere Application Server

Environment: Core Java, J2EE, Oracle, SQL Server, JSP, Struts, Spring, JDK, Hibernate, JavaScript, HTML, CSS, AJAX, JUnit, Log4j, Web Services, Windows.

We'd love your feedback!