Java, Hadoop Developer Resume
SUMMARY
- A creative, results - driven professional with over 9+ years of experience in IT services industry across Engineering, Service Delivery and Customer Relationship Management functions with Executive Leadership, Business Stakeholders to improving process, thinking strategically and building teams with excellent measurable and valuable business results.
- 9+ years of leading development team as a technical lead.
- Unique offshore & onsite management experience.
- Experience in object oriented analysis (OOA) and design (OOD).
- Having good Exp in Banking, Telecom & Retail domains.
- 2 years of experience in Hadoop Ecosystem providing and implementing solutions for Big Data Applications with excellent knowledge of Hadoop architecture.
- Strong experience in building, maintaining multiple Hadoop clusters of different sizes and configuration and setting up the rack topology for large clusters.
- Good understanding of Hadoop architecture and Hands-on experience with Hadoop components such as Job Tracker, Task Tracker, Name Node, Data Node and MapReduce concepts and HDFS Framework.
- Hands on experience in tuning Hadoop jobs, monitoring, and setting up standards and security for Big Data Systems.
- Expertise in optimizing MapReduce algorithms using Mappers, Reducers, Combiners and partitioners to deliver the best results for the large datasets.
- Experience in using Spark API over Hadoop Map Reduce to perform analytics on data.
- Good hands on experience in creating the RDD's, DF's for the required input data and performed the data transformations using Spark Scala.
- Expertise in writing MapReduce jobs using Java native code, Pig, Hive for various business use cases.
- Good experience in data loading from Oracle and MYSQL databases to HDFS system using Sqoop (Structure Data) and Flume (Log Files & XML).
- Experience in developing customized UDF's in java to extend Hive and Pig Latin functionality.
- Experience with NoSQL databases like HBase as well as other ecosystems like Zookeeper, Oozie.
- Extensive experience in middle-tier development using J2EE technologies like JDBC, Servlets, JSP.
- Expertise in RDBMS like Oracle, MS SQL Server, MySQL and DB2.
- Capable of processing large sets of structured, semi-structured and unstructured data and supporting systems application architecture.
- Knowledge on installing, configuring, monitoring, and maintaining HDFS, Yarn, HBase, Flume, Sqoop, Oozie, Pig, Hive.
- Installation of various Hadoop Ecosystems and Hadoop Daemons.
- Experience with web-based UI development using Angular.js, node.js, reac.tjs, Query UI, jQuery, CSS, HTML, HTML5, XHTML and JavaScript.
- Participated in multiple big data POC to evaluate different architectures, tools and vendor products.
TECHNICAL SKILLS
Big Data Ecosystem: Hadoop, MapReduce, YARN, Pig, Hive, HBase, Flume, Storm, Kafka, Sqoop, Impala, Oozie, Zookeeper, Spark, Mongo-DB, Cassandra, Elastic search and Kibana, Avro.
Hadoop Distributions: Cloudera, HDP, Amazon Web Services.
JAVA / J2EE Technologies: Java, Java Beans, Struts, Hibernates, J2EE (JSP, Servlets, EJB), SOA, JDBC, spring. Java, J2EE, Spring,SpringMVC, Rest web service,Struts,Rest and Spring WS (SOAP WebServices ),Hibernate,JSP,Servlet, JDBC XML, HTML, JavaScript.
NO SQL: Cassandra, Mongo DB, HBase.
DB Languages: SQL, MySQL, PL/SQL, Oracle, DB2.
Operating Systems: UNIX / LINUX, MS-DOS, Windows-XP/7/8/Vista, Mac.
Programming/Scripting Languages, Tools: C, C++, .NET in C#, HTML5, JS, JSON, PHP, XML, Visual Studio 2010, D2L (LMS).
Methodologies: Agile, UML, Design Patterns
Application Server: Apache Tomcat 5.x 6.0, GlassFish v3.1.2.2
Analysis and Reporting Tools: Splunk, Tableau
IDE / Testing Tools: NetBeans, Eclipse
Testing x API: Junit
PROFESSIONAL EXPERIENCE
Confidential
Java, Hadoop Developer
Responsibilities:
- Data Platform for the Homes that will accept any data from any device, Store it and compute on it, and allow any consumer (External or Internal) to leverage it.
- The Titan Data Platform will be a source of all customer data,a single point of integration into the system and will provide direct access for other systems and users.
- TDP is to analyze thermostat data using big data analysis which is pumped to AZURE Event HUB from TCC,LCC and CHIL clouds.
- Impletemented CHIL and DAS preprocessor using kafka stream api with spring boot.
- Written the DAS streaming jobs using spark with scala.
- Written unit test cases for preprocessors and DAS streaming service.
Technologies: Kafka Stream, Elassandra, Cassandra, Spark stream, Java, Azure, Java, JDK 1.8, Tomcat and GIT, C#, Kuberneetes, Docker, Kibana, Spring Rest API using Spring boot,Scala
Confidential
Java,Hadoop Developer
Responsibilities:
- Created use cases for the case create service utilization from various systems.
- Provided XML and JSON response format to support various service clients.
- Jackson processor for JSON data binding and JAXB for XML data binding are utilized.
- Designed and developed Customer Event API with all the CRUD capabilities.
- Designed and developed RESTful service interface using Spring MVC to the underlying customer event API.
- Utilized Jackson processor to handle JSON data binding for Request & Response objects.
- Developed rest web services using Spring MVC with rest controller.
- Used App Config files to integrate all the components in the Spring mvc framework.
- Implemented Test cases using JUnit and Tested web services with REST Client.
- Mongo template is used to establish communication with the MongoDB collection.
- Published API's for application services, generate CSV formatted data reports.
- Developed java RESTfull webservices to upload data from local to Amazon S3, listing S3 objects and file manipulation operations.
- Developed MapReduce programs to perform Quality Check, Sequence Alignment, SV/CNV detection on single-end/paired-end data.
- Designed and transmitted a RDBMS(SQL) Database to NOSQL MangoDB Database
- Analyzing Hadoop cluster and different Big Data analytic tools including Pig, Hive, HBase and Sqoop.
- Creating multiple MapReduce jobs in Pig and Hive for data cleaning and pre-processing.
- Successfully loading files to Hive and HDFS from Oracle, SQL Server using SQOOP.
- Writing Hive jobs to parse the logs and structure them in tabular format to facilitate effective querying on the log data.
- Creating Hive tables, loading with data and writing Hive queries.
- Involved in Spark for fast processing of data. Defining job flows.
- Using Hive to analyze the partitioned data and compute various metrics for reporting.
- Managing and reviewing the Hadoop log files.
- Using Pig as ETL tool to do Transformations, even joins and some pre-aggregations.
- Unit testing and delivered Unit test plans and results documents.
- Exporting data from HDFS environment into RDBMS using Sqoop for report generation and visualization purpose. when your hadoop cluster integrated with Kerberos security then authenticated user must exist in the every node where the task runs. Worked on Oozie workflow engine for job scheduling.
- Analyze or transform stored data by writing Mapreduce or Pig jobs based on business requirements.
- Worked with IT in installing CDH production cluster, commissioning & decommissioning of data node, name
Technologies: Hadoop, MapReduce, HBase HDFS, Hive, pig, Impala, Cassandra, spark, Kafka Java, SQL, PIG, Zookeeper, Sqoop, CentOS, Pentaho, Java, Spring MVC 4, Hibernate with Rest, JDK 1.8, No SQL, Tomcat and Oracle, GIT, WebLogic, Maven, Node.js,Angular.js
Confidential
Hadoop Developer
Responsibilities:
- Analyzing Hadoop cluster and different Big Data analytic tools including Pig, Hive, HBase and Sqoop.
- Creating multiple MapReduce jobs in Pig and Hive for data cleaning and pre - processing.
- Successfully loading files to Hive and HDFS from Oracle, SQL Server using SQOOP.
- Writing Hive jobs to parse the logs and structure them in tabular format to facilitate effective querying on the log data.
- Creating Hive tables, loading with data and writing Hive queries.
- Involved in Spark for fast processing of data. Defining job flows.
- Using Hive to analyze the partitioned data and compute various metrics for reporting.
- Managing and reviewing the Hadoop log files.
- Using Pig as ETL tool to do Transformations, even joins and some pre-aggregations.
- Unit testing and delivered Unit test plans and results documents.
- Exporting data from HDFS environment into RDBMS using Sqoop for report generation and visualization purpose.
- Worked on Oozie workflow engine for job scheduling.
Technologies: Hadoop, MapReduce, HDFS, Java, SQL, Zookeeper, Sqoop,, Java, Spring MVC 4 with Node.js, React.js, Angular,Rest, Hadoop,Pig,Hive,Kafta, JDK 1.8, No SQL, Tomcat and MongoDB, GIT, WebLogic, Maven, Cassandra
