We provide IT Staff Augmentation Services!

Senior Hadoop Developer Resume

4.00/5 (Submit Your Rating)

San Jose, CA

SUMMARY

  • Over 8 years of overall experience as software developer in design, development, deploying and supporting large scale distributed systems.
  • Over 3.5 years of extensive experience as Hadoop Developer and Big Data Analyst.
  • Primary technical skills in HDFS, MapReduce, YARN, Pig, Hive, Impala, Sqoop, HBase, Cloudera,Spark.
  • Have good experience in extracting and generating statistical analysis using Business Intelligence tool Tableau for better analysis of data.
  • Experience in importing and exporting data using Sqoop and SFTP for Hadoop to/from RDBMS.
  • Excellent understanding of Hadoop architecture and its components such as HDFS, Application Master, Resource Manager, Node Manager and MapReduce programming paradigm.
  • Good experience in Core Java, J2EE, JavaScript, Servlets, Struts, spring, Hibernate, JDBC, EJB, XML, PL/SQL and working with Agile methodologies.
  • Extensive experience with Databases such as MySQL, Oracle 11G.
  • Experience in writing SQL queries, Stored Procedures, Triggers, Cursors and Packages.
  • Good experience in writing optimized Map Reduce jobs using Java.
  • Experience in implementing User Defined Functions for Pig and Hive.
  • Experience with working on web Services REST, JAX - WS, SOAP and AWS.
  • Very Good knowledge and Hands-on experience in Cassandra, Flume and Spark (YARN).
  • Good knowledge in distributed coordination system ZooKeeper and search platform Solr.
  • Expertise in preparing the test cases, documenting and performing unit testing and Integration testing.
  • In-depth understanding of Data Structures and Algorithms and Optimization.
  • Strong knowledge of Software Development Life Cycle and expertise in detailed design documentation.
  • Fast learner with good interpersonal skills, having strong analytical and communication skills and interested in problem solving and troubleshooting.
  • Self-motivated, excellent team player, with positive attitude and adhere to strict deadlines.

TECHNICAL SKILLS

Languages: Java, Scala, C, C++

BigData Technologies: Hadoop, HDFS, YARN, MapReduce, Hive, Pig, HBase, Storm, kafka, Impala, Sqoop, Informatica, Tidal, Spark, Cassandra, MapR, HiveQL, PigLatin, Git

RDBMS: Oracle, MySQL, SQL Server.

No SQL: HBase, Cassandra

Scripting & Query Languages: Python, Shell, SQL & PL/SQL.

Web/Application Servers: IBM Web sphere, Tomcat and LDAP.

Technologies: J2EE, JDBC, Multi-threading, JSP, Servlets, Struts, JSF, AJAX, SOAP, XSLT, DOM, CSS, DTD and Schema

PROFESSIONAL EXPERIENCE

Senior Hadoop Developer

Confidential, San Jose, CA

Responsibilities:

  • Worked closely with client and proposed how Hadoop can benefit their purpose and gave a high level picture of what hadoop is capable of.
  • Contacted the MapR distribution team for the creation of account for GCS(Global Customer Success) for Dev, stage and Prod environments.
  • Converted the existing business logic and enhanced it with new changes and prepared the corresponding Hive queries.
  • Connected to different sources to obtain the data directly instead of flat files which were processed and filtered before being sent to the data team through Informatica.
  • Developed code to fetch data from C3 and CR database based on the requirement and to get the output in a required format using sqoop.
  • Written python scripts for conversion of .xlsx files into .csv for compatibility to load into hadoop using PIG.
  • Written Shell scripts to automate most of the procedure using Tidal scheduler.
  • Written UDF’s, UDAF’s for custom operations in Java.
  • Created the QA Test cases based on the business rules defined and performed validation.
  • Pushed the output files into S3 bucket from where the Gainsight team downloads the data and loads it into their system for the availability of CuSM and vCuSM’s.
  • Distributed the work efficiently and assigned tasks to the team members to meet the SLA as a part of Agile environment.
  • Trained the other team members to get business overview and to understand the functionality.
  • Took sessions for technical assistance for the new team members.

Environment: Hive, PIG, Spark, MapR, Informatica, sqoop, Java, J2EE, python scripting, Shell Scripting, Amazon S3, Oracle 11g

Senior Hadoop Developer

Confidential, San Jose, CA

Responsibilities:

  • Successfully completed the POC and created that into a project for data migration and analysis.
  • Sqooped the required business views from Teradata and loaded into respective table into hive and MapR DB.
  • Developed Hive queries for the business use case and optimized the queries.
  • Implemented partitioning and dynamic partitioning on hive tables for faster search.
  • Benchmarked Impala to Hive on performance for the respective business use case.
  • Integrated Hive table with MapR DB tables for OLTP transactions.
  • Created a UI which fetches data from the MapR DB table and updates it if modified in the UI and reflect back in the search.
  • Scheduled sqoop jobs for the incremental load based on the business requirement from Teradata.
  • Written shell scripts for automation of the queries to a certain point in the business usecase.
  • Projected the results on Tableau Dashboard connected to the Hive tables and prepared reports for delivery.
  • Written QA test cases as a part of Agile unit testing process for processing of data and persistence.
  • Trained team members with non-hadoop background to understand the work flow and tools involved.

Environment: Teradata, Sqoop, Hive, MapR DB, Tableau, Platfora, Java, Shell scripting, Impala, Spark-sql.

Big Data / Hadoop Developer

Confidential, Rocky Hill, CT

Responsibilities:

  • Have setup the 64 node cluster and configured the entire Hadoop platform.
  • Migrating the needed data from Oracle, MySQL in to HDFS using Sqoop and importing various formats of flat files in to HDFS.
  • Mainly worked on Hive queries to categorize data of different claims.
  • Integrated the hive warehouse with HBase
  • Written customized Hive UDFs in Java where the functionality is too complex.
  • Implemented Partitioning, Dynamic Partitions, Buckets in HIVE.
  • Designing and creating Hive external tables using shared meta-store instead of derby with partitioning, dynamic partitioning and buckets.
  • Generate final reporting data using Tableau for testing by connecting to the corresponding Hive tables using Hive ODBC connector.
  • Maintain System integrity of all sub-components related to Hadoop.
  • Maintained System integrity of all sub-components (primarily HDFS, MR, HBase, and Hive).
  • Monitored System health and logs and respond accordingly to any warning or failure conditions.
  • Presented data and dataflow using Talend for reusability.

Environment: UNIX, Apache Hadoop, HDFS, Hive, Java, Sqoop, Cloudera CDH4, Oracle, MySQL, Tableau, Talend, Elastic search, Kibana, SFTP

Senior Java Developer

Confidential, Windsor, CT

Responsibilities:

  • Acted as a lead for development team of 8 gathering the requirements and designing the flow of the project.
  • Analyzed all the test cases based on the requirements gathered and documented for unit testing as well as for integration testing.
  • Designed the user interface required for the portal with all the components for selection of plan.
  • Provided the design using Restful WebServices to populate the individual details of plans available for the customers to pick.
  • Programmed functionality for all the components in the user interface interacting with the database using Enterprise Java Beans and MySQL Server.
  • Developed various Controller classes and business logic using the spring libraries which interact with the middle tier to perform the business operations.
  • Responsible to develop the custom tools as per the client needs.
  • Developed the DTDs finalized by the business.
  • Tested the application by programming the test cases using JUnit for both unit testing and Integration testing and bug tracking for the entire application.

Environment: Core Java, JDK 1.7, JSP, Struts, EJB, Hibernate, MySQL, SOAP, REST, JUnit, Eclipse, HTML, JavaScript, XML

Senior Java Developer

Confidential

Responsibilities:

  • Prepared the design document for the flow of each module and their dependency over other modules.
  • Documented all the test cases based on the requirements gathered for unit testing as well as for integration testing.
  • Developed the user interface components apart from the ready to use components using CSS to maintain the uniformity across the application.
  • Developed the front end interface using JavaScript and Ajax features.
  • Implemented SOAP WebServices for total equipment and Mechanic details accessible over all the departments.
  • Participated and delivered various work products in the development and implementation of Software deliverables and Software configuration management (SCM)

Environment: Core Java, JDK 1.6, JSP, Spring Framework, EJB, Hibernate, Oracle 10g, JUnit, Eclipse, HTML, CSS, JavaScript, REST, XML, WebSphere Application Server.

Java Developer/Programmer

Confidential, Bentonville

Responsibilities:

  • Interacted with client for gathering requirements and preparing the test cases for each and every requirement which are useful in testing.
  • Developed UI using HTML, JavaScript, CSS and JSP for interactive cross browser functionality and complex user interface.
  • Implemented the end to end functionality of the client requirement during the development phase.
  • Implemented the functionality of mapping entities to the database using Hibernate.
  • Written SQL queries involved in the JDBC connection in accordance with the business logic.
  • Performed various levels of unit testing for the entire application using the test cases which included preparation of detail documentation for the results.
  • Actively participated in client meetings and taking the inputs for the additional functionality.
  • Suggested some better possible ways of having the user interface from the user perspective.

We'd love your feedback!