Hadoop Developer Resume
TampA
SUMMARY
- Over 8 years of IT experience as a Developer, Designer & quality reviewer with cross platform integration experience using Hadoop, Hadoop architecture, Java, J2EE and SQL.
- Hands on experience on major components in Hadoop Ecosystem like Hadoop Map Reduce, HDFS, YARN, Cassandra, IMPALA, Hive, Pig, HBase, Sqoop, Oozie, Flume, Spark, Zookeeper and Avro.
- In depth understanding/knowledge of Hadoop Architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and MapReduce concepts
- Excellent understanding of Hadoop architecture, Hadoop Distributed File System, Map Reduce framework and API's.
- Strong understanding of NoSQL databases like Hbase Cassandra, MongoDB.
- Expertise in integration of variousdatasources like RDBMS, Spreadsheets, Text files, JSON and XML files.
- Solid experience in Storage, Querying, Processing and Analysis of Big Data usingHadoop framework.
- Hands on experience on MapR and Cloudera version of Hadoop.
- Developed Batch Processing jobs using Java Map Reduce, Pig and Hive.
- Expertise in writing Hadoop Jobs for analyzing data using Hive QL (Queries), Pig Latin (Data flow language), and custom MapReduce programs in Java.
- Expertise in using Pig scripts to do transformations, event joins, filters and some pre - aggregations before storing the data onto HDFS.
- Extending Hive and Pig core functionality by writing custom UDFs.
- Expert in working with Hive data warehouse tool-creating tables, data distribution by implementing partitioning and bucketing, writing and optimizing the HiveQL queries.
- Involved in using SQOOP for importing and exporting data into HDFS and Hive.
- Has In-depth knowledge and experience on Data warehousing ETL (Extraction, Transformation, and Loading).
- Worked on Data migration, Data cleansing, Data profiling, and ETL Processes features for data warehouses.
- Diverse experience utilizing Java tools in business, Web, and client-server environments including Java Platform, J2EE, EJB, JSP, Java Servlets, Struts, Spring Framework, MVC, Hibernate and Java database Connectivity (JDBC) technologies.
- Extensively development experience in different IDE's like Eclipse, NetBeans.
- Proficiency with the application servers like WebSphere, WebLogic, JBOSS and Tomcat.
- Strong experience in RDBMS technologies like MySQL, Oracle and Teradata.
- Extensive experience with SQL, PL/SQL and database concepts
- Hands on experience in writing Linux/Unix shell scripting.
- Ability to balance technical skill with strong Theoretical, Business and Analytical skills to provide quality solutions and result-oriented problem solving technique and leadership skills.
TECHNICAL SKILLS
Hadoop/Big Data: Hadoop/BigData, HDFS, Map Reduce, Pig, Sqoop, Hive, Oozie, Flume and ZooKeeper.
No SQL Databases: HBase, Cassandra, MongoDB.
Database Servers: Teradata, IBM DB2, Oracle, MYSQLs.
Languages: C, C++, JAVA, J2EE, JNDI, Servlets, JSP, Pig Latin, HiveQL, Unix Shell Scripts.
Java/J2EE Technologies: Data types, Collections, Threads, Generic types, JDBC, JNDI, JSON, Java Script, JSP, Servlets.
Frameworks: MVC, Spring, Struts, Hibernate
Web/Application servers: BEA Web Logic, IBM WebSphere, JBoss, Tomcat.
Web Technologies: HTML, AJAX, CSS, XHTML, XML, XSL, XSLT, WSDL, SOAP
Version control: SVN
IDEs: Eclipse, NetBeans.
Operating Systems: Linux, UNIX, Windows, Mac OS.
PROFESSIONAL EXPERIENCE
Confidential, Tampa
Hadoop Developer
Responsibilities:
- Architecting and Delivering projects for large Customers on Big Data Platforms.
- Design and build Hadoop solutions for big data problems.
- Developed MapReduce application using Hadoop, MapReduce programming and Hbase.
- Developed transformations using custom MapReduce, Pig and Hive
- Involved in developing the Pig scripts
- Involved in developing the Hive Reports.
- Implemented Map-Side Join and Reduce-Side Join in Java MapReduce.
- Developed the Sqoop scripts in order to make the interaction between Pig and MySQL Database.
- Involved in HBase data modelling and row key design.
- Developed and configured HBase and Hive tables to load data to HBase and Hive respectively.
- Data Ingestion into HDFS using tools like Sqoop, Flume and HDFS client APIs.
- Implemented POC using Spark.
- Implemented test scripts to support test driven development and continuous integration.
- Created Hive external tables, added partitions and worked to improve the performance of hive.
- Configured various big data workflows to run on top ofHadoopand these workflows comprise of heterogeneous jobs like Pig, Hive, Sqoop and MapReduce.
- Imported/exported data from/to relational DB / NoSQL DB toHadoopusing SQOOP.
- Worked on tuning the performance of Hive and Pig queries.
- Writing java code for custom partitioner and writable.
Environment: Hadoop, Map Reduce, Apache Pig, Hive, Hbase, Oozie, Flume, Sqoop, UNIX, MySQL, Teradata, Cassandra, Linux/Unix shell Scripting, JavaLinux, SQL,Big Data, Spark, Cloudera Hadoop Distribution
Confidential, Schaumburg, IL
Hadoop Developer
Responsibilities:
- Worked on analyzingHadoopstack and differentbigdataanalytic tools including Pig and Hive, HBase database and Sqoop.
- Designed high level ETL architecture for overalldatatransfer from the OLTP to OLAP.
- Created various Documents such as Source-To-TargetDatamapping Document, Unit Test Cases andDataMigration Document.
- Developed Pig Latin scripts to extract thedatafrom the web server output files to load in HDFS.
- Cluster co-ordination service through Zookeeper.
- Created mappings using the transformations like Source Qualifier, Aggregator, Expression, Lookup, Router, Normalizer, Filter, Update Strategy and Joiner transformations.
- Worked on Hive for exposingdatafor further analysis and for generating transforming files from different analytical formats to text files.
- Implemented best income logic using Pig scripts and UDFs.
- Designed and implemented Spark test bench application to evaluate quality of recommendations made by the engine.
- Tool monitored log input from severaldatacentres, via Spark Stream, was analysed in Apache Storm anddatawas parsed and saved into Cassandra.
- Streaming and complex analytics are handled with use of Spark.
- Implemented test scripts to support test driven development and continuous integration.
- Worked on tuning the performance of Hive and Pig queries.
- Worked on Impala for Massive parallel processing of Hive queries.
- StreamingdatatoHadoopusing Kafka.
- Writing java code for custom partitioner and writable.
- Worked on the Analytics Infrastructure team to develop a stream filtering system on top of Apache Kafka.
- Worked on to ease the jobs by building the applications on top of Cassandra.
- Dataingestion to HBase and Hive using Storm bolts.
- Unit tested and tuned SQLs and ETL Code for better performance.
- Monitored the performance and identified performance bottlenecks in ETL code.
- Worked ondatautilizing aHadoop, Zookeeper, and Accumulate stack, aiding in the development of specialized indexes for performant queries onbigdataimplementations
Environment: Informatica Power Centre 9.5,Hadoop, HDFS, MapReduce, HBase, Hive, PIG, Sqoop, Oozie, Flume, Cassandra, Linux/Unix shell scripting,BigData, Java, Tableau, Spark, Kafka.
Confidential - Dallas, TX
Hadoop Developer
Responsibilities:
- Knowledge on the real-time message processing systems (Storm, S4)
- Collected the business requirements from the Business Partners and Experts.
- Involved in installing Hadoop Ecosystem components.
- Responsible to manage data coming from different sources.
- Used Apache flume to ingest log data from multiple sources directly into HDFS.
- Customized flume to enrich data with LDAP lookups and GOIP lookups.
- Involved in writing Map Reduce Programs which are running on the cluster.
- Involved in HDFS maintenance and loading of structured and unstructured data.
- Installed and configured Pig and also written PigLatin scripts.
- Wrote MapReduce job using Java API.
- Wrote MapReduce job using Pig Latin.
- Imported data from MySQL to HDFS by using Sqoop to load data.
- Developed Scripts and Batch Job to schedule various Hadoop Program.
- Wrote Hive queries for data analysis to meet the business requirements and generated reports.
- Created Hive tables by using Hive QL and worked on them.
- Wrote Hive UDF for frequently used HiveQL queries.
- Utilized Agile Scrum Methodology to help manage and organize a team of 4 developers with regular code review sessions.
- Regular meetings with technical teams and active participation in code review sessions with other developers.
- Used Continuum for integration testing and JUnit for unit testing.
Environment: Hadoop, HDFS, MapReduce, Unix, Flume, Python, Pig, MySQL, MySQL Workbench Hive, Java, Hbase, Storm, Flume, Zookeeper, Putty, Eclipse, Cloudera, Eclipse, Linux.
Confidential - Princeton, NJ
Java Developer
Responsibilities:
- Collecting requirements from Business and updating Technical Design Documents.
- Involved in the development of interfaces for the application using JSP, Servlets, and JavaScript
- J2EE Front-End and Back-End supporting business logic, integration, and persistence.
- The application is developed by using Spring MVC Framework by implementing controller.
- Used Spring MVC to decouple business logic and view components.
- Developed controllers, updated the Spring MVC configuration files and implemented validation using Spring MVC Validator framework.
- Developed rich client user interfaces using Ajax and wired with Spring MVC framework.
- Developed server side utilities using J2EE technologies like JDBC,JSP and Servlets
- Developed the JSP's using the Spring MVC framework tag libraries.
- Created database tables and set up session variables depending upon the user privileges.
- Creation of technical specifications, coding, unit and system integration testing for the enhancements and conducted reviews with user
- Developed and executed the business validation logic in form beans.
- Responsible to analyze existing project to prepare business logic documents.
- Created and administered JNDI files for the JMS application.
- Used JDBC for pulling the data from database.
- Session Beans are designed in way to create the data, updating, and deleting data from the database.
- Created graphical user interfaces (GUIs) front-end using JSP, HTML and Ajax function with JavaScript.
- Used Object oriented JavaScript for developing rich client page.
- SVN is used as a version management tool.
- Responsible to communicate with End client to support the application, analyzed and fixed the issue.
- Developed spring for storing patient details and billing work.
Environment: JDBC,MySQL, CSS, HTML, Java/J2EE,JQuery,Servlets,Spring, Spring MVC,JSP, JavaScript, JQuery, Ajax,XML, ORACLE 11g, Junit, SVN, Eclipse, and Windows
Confidential
Java Developer
Responsibilities:
- Involved in Design, Development and Support phases of Software Development Life Cycle (SDLC).
- Reviewed the functional, design, source code and test specifications.
- Involved in developing the complete front end development using Java Script and CSS.
- Author for Functional, Design and Test Specifications.
- Analyzed, designed and developed the component.
- Used JDBC for database access.
- Experienced in reading logs files and responding to it quickly
- UsedDataTransfer Object (DTO) design patterns.
- Followed UML standards, created class and sequence diagrams.
- Unit testing and rigorous integration testing of the whole application.
- Preparing and executing test cases.
- Actively involved in the system testing.
- Performed Unit Testing and documented the test results.
- Prepared the Installation, Customer guide and Configuration document which were delivered to the customer along with the product.
Environment: Java/J2EE, SQL, Oracle 10g, JSP 2.0, EJB, AJAX, Java Script, Web Logic 8.0, HTML, JDBC 3.0, log4j, Junit, Servlets, MVC, My Eclipse
