Senior Hadoop Developer Resume
San Jose, CA
SUMMARY
- Over 8 years of overall experience as software developer in design, development, deploying and supporting large scale distributed systems.
- Over 3.5 years of extensive experience as Hadoop Developer and Big Data Analyst.
- Primary technical skills in HDFS, MapReduce, YARN, Pig, Hive, Impala, Sqoop, HBase, Cloudera,Spark.
- Have good experience in extracting and generating statistical analysis using Business Intelligence tool Tableau for better analysis of data.
- Experience in importing and exporting data using Sqoop and SFTP for Hadoop to/from RDBMS.
- Excellent understanding of Hadoop architecture and its components such as HDFS, Application Master, Resource Manager, Node Manager and MapReduce programming paradigm.
- Good experience in Core Java, J2EE, JavaScript, Servlets, Struts, spring, Hibernate, JDBC, EJB, XML, PL/SQL and working with Agile methodologies.
- Extensive experience with Databases such as MySQL, Oracle 11G.
- Experience in writing SQL queries, Stored Procedures, Triggers, Cursors and Packages.
- Good experience in writing optimized Map Reduce jobs using Java.
- Experience in implementing User Defined Functions for Pig and Hive.
- Experience with working on web Services REST, JAX - WS, SOAP and AWS.
- Very Good knowledge and Hands-on experience in Cassandra, Flume and Spark (YARN).
- Good knowledge in distributed coordination system ZooKeeper and search platform Solr.
- Expertise in preparing the test cases, documenting and performing unit testing and Integration testing.
- In-depth understanding of Data Structures and Algorithms and Optimization.
- Strong knowledge of Software Development Life Cycle and expertise in detailed design documentation.
- Fast learner with good interpersonal skills, having strong analytical and communication skills and interested in problem solving and troubleshooting.
- Self-motivated, excellent team player, with positive attitude and adhere to strict deadlines.
TECHNICAL SKILLS
Languages: Java, Scala, C, C++
BigData Technologies: Hadoop, HDFS, YARN, MapReduce, Hive, Pig, HBase, Storm, kafka, Impala, Sqoop, Informatica, Tidal, Spark, Cassandra, MapR, HiveQL, PigLatin, Git
RDBMS: Oracle, MySQL, SQL Server.
No SQL: HBase, Cassandra
Scripting & Query Languages: Python, Shell, SQL & PL/SQL.
Web/Application Servers: IBM Web sphere, Tomcat and LDAP.
Technologies: J2EE, JDBC, Multi-threading, JSP, Servlets, Struts, JSF, AJAX, SOAP, XSLT, DOM, CSS, DTD and Schema
PROFESSIONAL EXPERIENCE
Senior Hadoop Developer
Confidential, San Jose, CA
Responsibilities:
- Worked closely with client and proposed how Hadoop can benefit their purpose and gave a high level picture of what hadoop is capable of.
- Contacted the MapR distribution team for the creation of account for GCS(Global Customer Success) for Dev, stage and Prod environments.
- Converted the existing business logic and enhanced it with new changes and prepared the corresponding Hive queries.
- Connected to different sources to obtain the data directly instead of flat files which were processed and filtered before being sent to the data team through Informatica.
- Developed code to fetch data from C3 and CR database based on the requirement and to get the output in a required format using sqoop.
- Written python scripts for conversion of .xlsx files into .csv for compatibility to load into hadoop using PIG.
- Written Shell scripts to automate most of the procedure using Tidal scheduler.
- Written UDF’s, UDAF’s for custom operations in Java.
- Created the QA Test cases based on the business rules defined and performed validation.
- Pushed the output files into S3 bucket from where the Gainsight team downloads the data and loads it into their system for the availability of CuSM and vCuSM’s.
- Distributed the work efficiently and assigned tasks to the team members to meet the SLA as a part of Agile environment.
- Trained the other team members to get business overview and to understand the functionality.
- Took sessions for technical assistance for the new team members.
Environment: Hive, PIG, Spark, MapR, Informatica, sqoop, Java, J2EE, python scripting, Shell Scripting, Amazon S3, Oracle 11g
Senior Hadoop Developer
Confidential, San Jose, CA
Responsibilities:
- Successfully completed the POC and created that into a project for data migration and analysis.
- Sqooped the required business views from Teradata and loaded into respective table into hive and MapR DB.
- Developed Hive queries for the business use case and optimized the queries.
- Implemented partitioning and dynamic partitioning on hive tables for faster search.
- Benchmarked Impala to Hive on performance for the respective business use case.
- Integrated Hive table with MapR DB tables for OLTP transactions.
- Created a UI which fetches data from the MapR DB table and updates it if modified in the UI and reflect back in the search.
- Scheduled sqoop jobs for the incremental load based on the business requirement from Teradata.
- Written shell scripts for automation of the queries to a certain point in the business usecase.
- Projected the results on Tableau Dashboard connected to the Hive tables and prepared reports for delivery.
- Written QA test cases as a part of Agile unit testing process for processing of data and persistence.
- Trained team members with non-hadoop background to understand the work flow and tools involved.
Environment: Teradata, Sqoop, Hive, MapR DB, Tableau, Platfora, Java, Shell scripting, Impala, Spark-sql.
Big Data / Hadoop Developer
Confidential, Rocky Hill, CT
Responsibilities:
- Have setup the 64 node cluster and configured the entire Hadoop platform.
- Migrating the needed data from Oracle, MySQL in to HDFS using Sqoop and importing various formats of flat files in to HDFS.
- Mainly worked on Hive queries to categorize data of different claims.
- Integrated the hive warehouse with HBase
- Written customized Hive UDFs in Java where the functionality is too complex.
- Implemented Partitioning, Dynamic Partitions, Buckets in HIVE.
- Designing and creating Hive external tables using shared meta-store instead of derby with partitioning, dynamic partitioning and buckets.
- Generate final reporting data using Tableau for testing by connecting to the corresponding Hive tables using Hive ODBC connector.
- Maintain System integrity of all sub-components related to Hadoop.
- Maintained System integrity of all sub-components (primarily HDFS, MR, HBase, and Hive).
- Monitored System health and logs and respond accordingly to any warning or failure conditions.
- Presented data and dataflow using Talend for reusability.
Environment: UNIX, Apache Hadoop, HDFS, Hive, Java, Sqoop, Cloudera CDH4, Oracle, MySQL, Tableau, Talend, Elastic search, Kibana, SFTP
Senior Java Developer
Confidential, Windsor, CT
Responsibilities:
- Acted as a lead for development team of 8 gathering the requirements and designing the flow of the project.
- Analyzed all the test cases based on the requirements gathered and documented for unit testing as well as for integration testing.
- Designed the user interface required for the portal with all the components for selection of plan.
- Provided the design using Restful WebServices to populate the individual details of plans available for the customers to pick.
- Programmed functionality for all the components in the user interface interacting with the database using Enterprise Java Beans and MySQL Server.
- Developed various Controller classes and business logic using the spring libraries which interact with the middle tier to perform the business operations.
- Responsible to develop the custom tools as per the client needs.
- Developed the DTDs finalized by the business.
- Tested the application by programming the test cases using JUnit for both unit testing and Integration testing and bug tracking for the entire application.
Environment: Core Java, JDK 1.7, JSP, Struts, EJB, Hibernate, MySQL, SOAP, REST, JUnit, Eclipse, HTML, JavaScript, XML
Senior Java Developer
Confidential
Responsibilities:
- Prepared the design document for the flow of each module and their dependency over other modules.
- Documented all the test cases based on the requirements gathered for unit testing as well as for integration testing.
- Developed the user interface components apart from the ready to use components using CSS to maintain the uniformity across the application.
- Developed the front end interface using JavaScript and Ajax features.
- Implemented SOAP WebServices for total equipment and Mechanic details accessible over all the departments.
- Participated and delivered various work products in the development and implementation of Software deliverables and Software configuration management (SCM)
Environment: Core Java, JDK 1.6, JSP, Spring Framework, EJB, Hibernate, Oracle 10g, JUnit, Eclipse, HTML, CSS, JavaScript, REST, XML, WebSphere Application Server.
Java Developer/Programmer
Confidential, Bentonville
Responsibilities:
- Interacted with client for gathering requirements and preparing the test cases for each and every requirement which are useful in testing.
- Developed UI using HTML, JavaScript, CSS and JSP for interactive cross browser functionality and complex user interface.
- Implemented the end to end functionality of the client requirement during the development phase.
- Implemented the functionality of mapping entities to the database using Hibernate.
- Written SQL queries involved in the JDBC connection in accordance with the business logic.
- Performed various levels of unit testing for the entire application using the test cases which included preparation of detail documentation for the results.
- Actively participated in client meetings and taking the inputs for the additional functionality.
- Suggested some better possible ways of having the user interface from the user perspective.
