We provide IT Staff Augmentation Services!

Hadoop Developer Resume

0/5 (Submit Your Rating)

San Antonio, TexaS

SUMMARY

  • Over 7+years of IT experience in analysis, design and development using Talend, Hadoop, Java and J2EE.
  • Experience in developing and Implementing MapReduce programs using Hadoop to work with BigDataas per the requirement.
  • Hands on experience in Apache Hadoop ecosystem components like Hadoop Distributing File System (HDFS), MapReduce, Hive, Sqoop, Maven, HBase, PIG, Kafka, Zoo Keeper, Scala, Flume, Storm and Oozie.
  • Worked on Hive and HBase integration.
  • Good Knowledge of Hadoop architecture and various components such as HDFS Framework, Job Tracker, Task Tracker, Name Node, Data Node and MRV1 and MRV2 (YARN).
  • Experience in developing MapReduce jobs in Java for data cleansing, transformations, pre - processing and analysis. Multiple mappers are implemented to handle data from multiple sources.
  • Experience in installation, configuration and management of Hadoop Clusters.
  • Experienced in installing, configuring, and administrating Hadoop cluster of major Hadoop distributions Hortonworks, Cloudera.
  • Knowledge on Hadoop daemon functionalities, resource utilizations and dynamic tuning in order to make cluster available and efficient.
  • Hands-on writing custom UDF’s for extending Hive and Pig core functionality.
  • Knowledge on installing, configuring, and using Hadoop components like Hadoop Map Reduce(MR1), YARN(MR2), HDFS, Hive, Pig, Flume and Scoop.
  • Experience in setting up data gathering tools such as Flume and Sqoop.
  • Installed, Configured Talend ETL on single and multi server environments
  • Created standard and best practices for Talend ETL components and jobs.
  • Experience in working with Flume to load the log data from multiple sources directly into HDFS.
  • Hands on experience with Apache Spark, SparkSQL, SparkStreaming.
  • Knowledge on NOSQL Databases like Cassandra, MongoDB and HBase.
  • Experience in Hive Partitioning, bucketing and perform different types of joins on Hive tables and implementing Hive serdes like REGEX, JSON and Avro.
  • Experience in Scripting using UNIX shell script.
  • Experience in analysing, designing and developing ETL strategies and processes, writing ETL specifications, Informatica development.
  • Extensively used Informatica Power Centre for Extraction, Transformation and Loading process.
  • Used Talend Studio for creation of talend jobs to load data in various Oracle tables and data integration.
  • Extensively worked on the ETL mappings, analysis and documentation of OLAP reports requirements. A good understanding of OLAP concepts working especially with large data sets.
  • Experience in Dimensional Data Modelling using star and snow flake schema.
  • Good knowledge on Data Mining and Machine Learning techniques.
  • Proficient in Oracle 9i/10g/11g, SQL and PL/SQL.
  • Experience in integration of various data sources like Oracle, DB2, Sybase, SQL server and MS access and non-relational sources like flat files into staging area.
  • Developed core modules in large cross-platform applications using JAVA, J2EE with experience in Java core concepts like OOPS, Multi-threading, Collections and IO.
  • Hands-on JDBC, java script, Jquery, Linux, Unix HTML, AWS.
  • Developed applications using Java, RDBMS, and Linux shell scripting.
  • Have good interpersonal skills, good communication, problem solving skills and a motivated team player.
  • Have the ability to be a value contribution to the company.

TECHNICAL SKILLS

Hadoop Eco System: Hadoop, Map Reduce, Sqoop, Hive, Oozie, Pig, HDFS, ZooKeeper, Flume, HBase,Impala, Spark, Storm, Hadoop( Cloudera, Horton Works and Pivotal), Solr

No SQL Databases: HBase, Cassandra, MongoDB

Java & J2EE Technologies: Core Java, Servlets, JSP, JDBC, NetBeans, Eclipse

Languages: C, Java, Python, SQL, PL/SQL, PIG Latin, HiveQL, Unix Shell Scripting

Databases: Oracle 11g/10g/9i, My SQL, DB2, MS SQL Server

Application Server: Apache Tomcat, JBoss, IBM Web sphere, Web Logic

Web Services: WSDL, SOAP, REST

Methodologies: Agile, Scrum

Web Technologies: AJAX

PROFESSIONAL EXPERIENCE

Confidential, San Antonio, TEXAS

Hadoop Developer

Responsibilities:

  • Installed and configured MapReduce, HIVE and the HDFS; implemented CDH3 Hadoop cluster on CentOS. Assisted with performance tuning and monitoring.
  • Interact with Solution Architects and Business Analysts to gather requirements and update Solution Architect Document.
  • Created jobs to perform record count validation and schema validation.
  • Created contexts to use the values throughout the process to pass from parent to child jobs and child to parent jobs.
  • Created Hadoop cluster connections to access HDFS.
  • Performed analysis, design, development, Testing and deployment for Ingestion, Integration, provisioning using Agile Methodology.
  • Worked on Talend Administrator Console (TAC) for scheduling jobs and adding users.
  • Extensively used components like tWaitForFile, tIterateToFlow, tFlowToIterate, tHashoutput, tHashInput, tMap, tRunjob, tJava, tNormalize and tfile components to create Talend jobs.
  • Developed jobs to move inbound files to HDFS file location based on monthly, weekly, daily and hourly partitioning.
  • Developed jobs to export HDFS files to Hive tables and Views depending up on the schema versions.
  • Developed joblets that are reused in different processes in the flow.
  • Developed error logging module to capture both system errors and logical errors that contains Email notification, updating tables and also moving files to error directories.
  • Performed unit testing and also integration testing after the development and got the code reviewed.

Environment: Hortonworks, Talend Studio Data Integration 5.2.2 and 5.6, Talend Studio Big Data Platform 5.3.1, UNIX, XML files, Flat files, HL7 files, Hadoop 2.4.1, HDFS, Hive 0.13, Agile Methodology.

Confidential, Moline, ILLINOIS

Hadoop Developer

Responsibilities:

  • Worked on Distributed/Cloud Computing (Map Reduce/Hadoop, HBase, Hive, Pig, Spark, Storm, Kafka, Sqoop, Oozie etc.).
  • Loaded the customer profiles data, customer spending data, credit from legacy warehouses onto HDFS using Sqoop.
  • Created a Hive aggregator to update the Hive table after running the data profiling job.
  • Involved in creating Hive tables, loading with data and writing Hive queries that will run internally in map reduce way.
  • Implemented Partitioning, Dynamic Partitioning and Bucketing in Hive.
  • Developed Hive queries to process the data and generate the data cubes for visualizing.
  • Built reusable Hive UDF libraries for business requirements, which enabled users to use these UDF's in Hive Querying.
  • Written Hive UDF to sort Structure fields and return complex data type.
  • Modelled Hive partitions extensively for data separation and faster data processing and followed Pig and Hive best practices for tuning.
  • Developed Pig Latin scripts to extract the data from the web server output files to load into HDFS.
  • Developed the Pig UDF's to pre-process the data for analysis.
  • Supported Map Reduce Programs those are running on the cluster.
  • Hands on using log files and to copy them into HDFS using Flume.
  • Configured Sqoop Jobs to import data from RDBMS into HDFS using Oozie workflows.
  • Implemented a script to transmit sys print information from Oracle to HBase using Sqoop.
  • Load the data into Spark RDD and do in memory data computation to generate the output response.
  • Exported the analysed data to the relational databases using Sqoop for visualization and to generate reports.
  • Experience with Amazon Web Services, AWS command line interface and AWS data pipeline.
  • Involved in loading data from local file system (Linux) to HDFS.
  • Knowledge on pushing data as delimited files into HDFS using Talend Big Data Studio.
  • Created visual component that became a feature of new release using Adobe Flex.

Environment: Hadoop, Map Reducer, HDFS, HBase, Hive, Pig, Spark, Storm, Kafka, Flume, Sqoop, Oozie, SQL, Scala, Python, Java (jdk 1.6),AWS, Hadoop(Horton Works), Eclipse, Talend Studio and Informatica 9.1.

Confidential, Little Rock, Arkansas

Hadoop Developer

Responsibilities:

  • Gathered business requirements from the Business Partners and subject matter experts and prepared Business Requirement document.
  • Involved in converting Business Requirements into Technical Design.
  • Developed simple to complex Map/Reduce jobs using Hive and Pig.
  • Optimized Map/Reduce jobs to use HDFS efficiently by using various compression mechanisms.
  • Handled importing of data from various data sources performed transformations using Hive, MapReduce, loaded data into HDFS and extracted the data from MySQL into HDFS using sqoop.
  • Analysed the data by performing Hive queries and Pig scripts to study customer behaviour.
  • Used UDF’s to implement business logic in Hadoop.
  • Experience using NoSQL databases Cassandra and MongoDB for information retrieval.
  • Have experience in implementation of Regression analysis using MapReduce.
  • Exported the analysed data to the relational databases using Sqoop for visualization and to generate reports for the BI team.
  • Experience in developing scripts and Batch Jobs to schedule various Hadoop programs using Oozie.
  • Experience in testing MapReduce code using JUnit testing.
  • Cloudera manager is used to monitor the health of the jobs which are running on the cluster.

Environment: Java (jdk 1.6), Hadoop, MapReduce, Pig, Hive, Cassandra,MongoDB, Sqoop, Oozie, HDFS, Hadoop(Cloudera), MySQL, Eclipse, Oracle.

Confidential

JAVA Developer

Responsibilities:

  • Prepared High Level and Low Level Design document implementing applicable Design Patterns with UML diagrams to depict components, class level details.
  • Interacting with the system analysts & business users for design & requirement clarification.
  • Developed Web Services using SOAP, SOA, WSDL Spring MVC and developed DTDs, XSD schemas for XML (parsing, processing, and design) to communicate with Active Directory application using Restful API.
  • Developed JSPs according to requirement.
  • Developed the Web Based Rich Internet Application (RIA) using Adobe Flex.
  • Excellent knowledge of NOSQL on Mongo and Cassandra DB.
  • Developed integration services using SOA, Mule ESB, Web Services, SOAP, and WSDL.
  • Designed, developed and maintained the data layer using the ORM framework in Hibernate.
  • Involved in Analysis, Design, Development, and Production of the Application and develop UML diagrams.
  • Developed web applications using Spring MVC, HTML4, Bootstrap.
  • Presented top level design documentation to the transition of various groups.
  • Used Spring framework's JMS support for writing to JMS Queue, Hibernate Dao Support for interfacing with the database and integrated spring with JSF.
  • Wrote AngularJS controllers, views, and services.
  • Ant is used for building, and the application is deployed on JBoss application server.
  • Taken care of Java Multithreading part in back end components.
  • Experience in web development, client-server and n-tier Enterprise applications usingJava/J2eetechnologies andAdobe Flex.
  • Implemented by using MYSQL.
  • Developed HTML reports for various modules as per the requirement.
  • Analyse known information into concrete concepts and technical solutions.
  • Assisted in writing the SQL scripts to create and maintain the database, roles, users, tables in SQL Server.

Environment: Java, JDBC, Spring, JSP, JBoss, Servlets, Maven, Jenkins, Flex, HTML, AngularJS, Mongo DB, Hibernate, JavaScript, Eclipse, Struts, SQL Server2000.

Confidential

Jr. JAVADeveloper

Responsibilities:

  • Analysed Object Oriented Design and presented with UML Sequence, Class Diagrams.
  • Developed Admission & Census module, which monitors a wide range of detailed information for each resident upon admission.
  • Developed Plans module, which provides a comprehensive library of problems, goals and approaches. You have the option of tailoring (adding, deleting, or editing problems, goals and approaches) these libraries and the disciplines you will use for your plans.
  • Developed General Ledger module, which streamlines analysis, reporting and recording of accounting information. General Ledger automatically integrates with a powerful spreadsheet solution for budgeting, comparative analysis and tracking facility information for flexible reporting.
  • Developed UI using HTML, JavaScript, and JSP, and developed Business Logic and Interfacing components using Business Objects, XML, and JDBC.
  • Designed user-interface and checking validations using JavaScript.
  • Managed connectivity using JDBC for querying/inserting & data management including triggers and stored procedures.
  • Developed components using Java multithreading concept.
  • Developed various EJBs (session and entity beans) for handling business logic and data manipulations from database.
  • Involved in design of JSP’s and Servlets for navigation among the modules.
  • Designed cascading style sheets and XSLT and XML part of Order entry Module & Product Search Module and did client side validations with java script.
  • Hosted the application on Web Sphere.

Environment: J2EE, Java/JDK, PL/SQL, JDBC, JSP, Servlets, JavaScript, EJB, JavaBeans, UML, XML, XSLT, Oracle 9i, HTML/ DHTML, UML, JavaScript.

We'd love your feedback!