We provide IT Staff Augmentation Services!

Hadoop Developer Resume

4.00/5 (Submit Your Rating)

TampA

SUMMARY

  • Over 8 years of IT experience as a Developer, Designer & quality reviewer with cross platform integration experience using Hadoop, Hadoop architecture, Java, J2EE and SQL.
  • Hands on experience on major components in Hadoop Ecosystem like Hadoop Map Reduce, HDFS, YARN, Cassandra, IMPALA, Hive, Pig, HBase, Sqoop, Oozie, Flume, Spark, Zookeeper and Avro.
  • In depth understanding/knowledge of Hadoop Architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and MapReduce concepts
  • Excellent understanding of Hadoop architecture, Hadoop Distributed File System, Map Reduce framework and API's.
  • Strong understanding of NoSQL databases like Hbase Cassandra, MongoDB.
  • Expertise in integration of variousdatasources like RDBMS, Spreadsheets, Text files, JSON and XML files.
  • Solid experience in Storage, Querying, Processing and Analysis of Big Data usingHadoop framework.
  • Hands on experience on MapR and Cloudera version of Hadoop.
  • Developed Batch Processing jobs using Java Map Reduce, Pig and Hive.
  • Expertise in writing Hadoop Jobs for analyzing data using Hive QL (Queries), Pig Latin (Data flow language), and custom MapReduce programs in Java.
  • Expertise in using Pig scripts to do transformations, event joins, filters and some pre - aggregations before storing the data onto HDFS.
  • Extending Hive and Pig core functionality by writing custom UDFs.
  • Expert in working with Hive data warehouse tool-creating tables, data distribution by implementing partitioning and bucketing, writing and optimizing the HiveQL queries.
  • Involved in using SQOOP for importing and exporting data into HDFS and Hive.
  • Has In-depth knowledge and experience on Data warehousing ETL (Extraction, Transformation, and Loading).
  • Worked on Data migration, Data cleansing, Data profiling, and ETL Processes features for data warehouses.
  • Diverse experience utilizing Java tools in business, Web, and client-server environments including Java Platform, J2EE, EJB, JSP, Java Servlets, Struts, Spring Framework, MVC, Hibernate and Java database Connectivity (JDBC) technologies.
  • Extensively development experience in different IDE's like Eclipse, NetBeans.
  • Proficiency with the application servers like WebSphere, WebLogic, JBOSS and Tomcat.
  • Strong experience in RDBMS technologies like MySQL, Oracle and Teradata.
  • Extensive experience with SQL, PL/SQL and database concepts
  • Hands on experience in writing Linux/Unix shell scripting.
  • Ability to balance technical skill with strong Theoretical, Business and Analytical skills to provide quality solutions and result-oriented problem solving technique and leadership skills.

TECHNICAL SKILLS

Hadoop/Big Data: Hadoop/BigData, HDFS, Map Reduce, Pig, Sqoop, Hive, Oozie, Flume and ZooKeeper.

No SQL Databases: HBase, Cassandra, MongoDB.

Database Servers: Teradata, IBM DB2, Oracle, MYSQLs.

Languages: C, C++, JAVA, J2EE, JNDI, Servlets, JSP, Pig Latin, HiveQL, Unix Shell Scripts.

Java/J2EE Technologies: Data types, Collections, Threads, Generic types, JDBC, JNDI, JSON, Java Script, JSP, Servlets.

Frameworks: MVC, Spring, Struts, Hibernate

Web/Application servers: BEA Web Logic, IBM WebSphere, JBoss, Tomcat.

Web Technologies: HTML, AJAX, CSS, XHTML, XML, XSL, XSLT, WSDL, SOAP

Version control: SVN

IDEs: Eclipse, NetBeans.

Operating Systems: Linux, UNIX, Windows, Mac OS.

PROFESSIONAL EXPERIENCE

Confidential, Tampa

Hadoop Developer

Responsibilities:

  • Architecting and Delivering projects for large Customers on Big Data Platforms.
  • Design and build Hadoop solutions for big data problems.
  • Developed MapReduce application using Hadoop, MapReduce programming and Hbase.
  • Developed transformations using custom MapReduce, Pig and Hive
  • Involved in developing the Pig scripts
  • Involved in developing the Hive Reports.
  • Implemented Map-Side Join and Reduce-Side Join in Java MapReduce.
  • Developed the Sqoop scripts in order to make the interaction between Pig and MySQL Database.
  • Involved in HBase data modelling and row key design.
  • Developed and configured HBase and Hive tables to load data to HBase and Hive respectively.
  • Data Ingestion into HDFS using tools like Sqoop, Flume and HDFS client APIs.
  • Implemented POC using Spark.
  • Implemented test scripts to support test driven development and continuous integration.
  • Created Hive external tables, added partitions and worked to improve the performance of hive.
  • Configured various big data workflows to run on top ofHadoopand these workflows comprise of heterogeneous jobs like Pig, Hive, Sqoop and MapReduce.
  • Imported/exported data from/to relational DB / NoSQL DB toHadoopusing SQOOP.
  • Worked on tuning the performance of Hive and Pig queries.
  • Writing java code for custom partitioner and writable.

Environment: Hadoop, Map Reduce, Apache Pig, Hive, Hbase, Oozie, Flume, Sqoop, UNIX, MySQL, Teradata, Cassandra, Linux/Unix shell Scripting, JavaLinux, SQL,Big Data, Spark, Cloudera Hadoop Distribution

Confidential, Schaumburg, IL

Hadoop Developer

Responsibilities:

  • Worked on analyzingHadoopstack and differentbigdataanalytic tools including Pig and Hive, HBase database and Sqoop.
  • Designed high level ETL architecture for overalldatatransfer from the OLTP to OLAP.
  • Created various Documents such as Source-To-TargetDatamapping Document, Unit Test Cases andDataMigration Document.
  • Developed Pig Latin scripts to extract thedatafrom the web server output files to load in HDFS.
  • Cluster co-ordination service through Zookeeper.
  • Created mappings using the transformations like Source Qualifier, Aggregator, Expression, Lookup, Router, Normalizer, Filter, Update Strategy and Joiner transformations.
  • Worked on Hive for exposingdatafor further analysis and for generating transforming files from different analytical formats to text files.
  • Implemented best income logic using Pig scripts and UDFs.
  • Designed and implemented Spark test bench application to evaluate quality of recommendations made by the engine.
  • Tool monitored log input from severaldatacentres, via Spark Stream, was analysed in Apache Storm anddatawas parsed and saved into Cassandra.
  • Streaming and complex analytics are handled with use of Spark.
  • Implemented test scripts to support test driven development and continuous integration.
  • Worked on tuning the performance of Hive and Pig queries.
  • Worked on Impala for Massive parallel processing of Hive queries.
  • StreamingdatatoHadoopusing Kafka.
  • Writing java code for custom partitioner and writable.
  • Worked on the Analytics Infrastructure team to develop a stream filtering system on top of Apache Kafka.
  • Worked on to ease the jobs by building the applications on top of Cassandra.
  • Dataingestion to HBase and Hive using Storm bolts.
  • Unit tested and tuned SQLs and ETL Code for better performance.
  • Monitored the performance and identified performance bottlenecks in ETL code.
  • Worked ondatautilizing aHadoop, Zookeeper, and Accumulate stack, aiding in the development of specialized indexes for performant queries onbigdataimplementations

Environment: Informatica Power Centre 9.5,Hadoop, HDFS, MapReduce, HBase, Hive, PIG, Sqoop, Oozie, Flume, Cassandra, Linux/Unix shell scripting,BigData, Java, Tableau, Spark, Kafka.

Confidential - Dallas, TX

Hadoop Developer

Responsibilities:

  • Knowledge on the real-time message processing systems (Storm, S4)
  • Collected the business requirements from the Business Partners and Experts.
  • Involved in installing Hadoop Ecosystem components.
  • Responsible to manage data coming from different sources.
  • Used Apache flume to ingest log data from multiple sources directly into HDFS.
  • Customized flume to enrich data with LDAP lookups and GOIP lookups.
  • Involved in writing Map Reduce Programs which are running on the cluster.
  • Involved in HDFS maintenance and loading of structured and unstructured data.
  • Installed and configured Pig and also written PigLatin scripts.
  • Wrote MapReduce job using Java API.
  • Wrote MapReduce job using Pig Latin.
  • Imported data from MySQL to HDFS by using Sqoop to load data.
  • Developed Scripts and Batch Job to schedule various Hadoop Program.
  • Wrote Hive queries for data analysis to meet the business requirements and generated reports.
  • Created Hive tables by using Hive QL and worked on them.
  • Wrote Hive UDF for frequently used HiveQL queries.
  • Utilized Agile Scrum Methodology to help manage and organize a team of 4 developers with regular code review sessions.
  • Regular meetings with technical teams and active participation in code review sessions with other developers.
  • Used Continuum for integration testing and JUnit for unit testing.

Environment: Hadoop, HDFS, MapReduce, Unix, Flume, Python, Pig, MySQL, MySQL Workbench Hive, Java, Hbase, Storm, Flume, Zookeeper, Putty, Eclipse, Cloudera, Eclipse, Linux.

Confidential - Princeton, NJ

Java Developer

Responsibilities:

  • Collecting requirements from Business and updating Technical Design Documents.
  • Involved in the development of interfaces for the application using JSP, Servlets, and JavaScript
  • J2EE Front-End and Back-End supporting business logic, integration, and persistence.
  • The application is developed by using Spring MVC Framework by implementing controller.
  • Used Spring MVC to decouple business logic and view components.
  • Developed controllers, updated the Spring MVC configuration files and implemented validation using Spring MVC Validator framework.
  • Developed rich client user interfaces using Ajax and wired with Spring MVC framework.
  • Developed server side utilities using J2EE technologies like JDBC,JSP and Servlets
  • Developed the JSP's using the Spring MVC framework tag libraries.
  • Created database tables and set up session variables depending upon the user privileges.
  • Creation of technical specifications, coding, unit and system integration testing for the enhancements and conducted reviews with user
  • Developed and executed the business validation logic in form beans.
  • Responsible to analyze existing project to prepare business logic documents.
  • Created and administered JNDI files for the JMS application.
  • Used JDBC for pulling the data from database.
  • Session Beans are designed in way to create the data, updating, and deleting data from the database.
  • Created graphical user interfaces (GUIs) front-end using JSP, HTML and Ajax function with JavaScript.
  • Used Object oriented JavaScript for developing rich client page.
  • SVN is used as a version management tool.
  • Responsible to communicate with End client to support the application, analyzed and fixed the issue.
  • Developed spring for storing patient details and billing work.

Environment: JDBC,MySQL, CSS, HTML, Java/J2EE,JQuery,Servlets,Spring, Spring MVC,JSP, JavaScript, JQuery, Ajax,XML, ORACLE 11g, Junit, SVN, Eclipse, and Windows

Confidential

Java Developer

Responsibilities:

  • Involved in Design, Development and Support phases of Software Development Life Cycle (SDLC).
  • Reviewed the functional, design, source code and test specifications.
  • Involved in developing the complete front end development using Java Script and CSS.
  • Author for Functional, Design and Test Specifications.
  • Analyzed, designed and developed the component.
  • Used JDBC for database access.
  • Experienced in reading logs files and responding to it quickly
  • UsedDataTransfer Object (DTO) design patterns.
  • Followed UML standards, created class and sequence diagrams.
  • Unit testing and rigorous integration testing of the whole application.
  • Preparing and executing test cases.
  • Actively involved in the system testing.
  • Performed Unit Testing and documented the test results.
  • Prepared the Installation, Customer guide and Configuration document which were delivered to the customer along with the product.

Environment: Java/J2EE, SQL, Oracle 10g, JSP 2.0, EJB, AJAX, Java Script, Web Logic 8.0, HTML, JDBC 3.0, log4j, Junit, Servlets, MVC, My Eclipse

We'd love your feedback!