Hadoop Developer Resume
Livonia, MI
SUMMARY
- Around 7 years of professional experience in Requirements Analysis, Design, Development and Implementation of Java, J2EE and Big Data technologies.
- 3+ years of exclusive experience in Big data Technologies and Hadoop ecosystem’s such as HDFS, MapReduce, PIG, Hive, Sqoop, Spark, Storm, Kafka, and Flume.
- Full - scale knowledge of Hadoop components such as HDFS, Job Tracker, Name Node, Data Node.
- Extensive experience in Map Reduce Programming, Pig Scripting and analyzing data using HiveQL, Pig Latin, and HBase.
- Experience in designing and developing POCs in Spark using Scala to compare the performance of Spark with Hive and SQL/Oracle. Experience in managing and monitoring Hadoop cluster using Cloudera Manager.
- Experience using Hadoop distributions like Cloudera (CDH4, CDH5), Hortonworks (HDP 2.2, 2.4).
- Experience in writing Pig scripts to transform raw data from several data sources into baseline data.
- Experience in working with different NoSQL databases like Hbase, MongoDB and Cassandra.
- Handled different file formats like Parquet, Avro files, RC files using different SerDes in Hive.
- Experience in importing and exporting data using Sqoop from HDFS/Hive/HBase to Relational Database Systems.
- Experience in job workflow schedulers and monitoring applications such as Oozie and Zookeeper.
- Experience in dumping shared data into HDFS from MySQL by writing shell scripts.
- Extensive experience in the collecting, aggregating and moving huge chucks of data from different sources using Flume and Kafka.
- Worked on Java HBase API for ingestion processed data to HBase tables.
- Strong experience in working with UNIX/LINUX environments, writing shell scripts.
- Experience in designing and coding web applications using Core Java and J2EE Technologies like JSP, Hibernate, spring, Struts, Java Beans, Servlets, EJB, RMI, XML and JDBC.
- Experience in MVC (Model View Controller) Architecture, using spring, Struts and Hibernate Framework with various Java/J2EE design patterns.
- Experience in building and deploying web applications in multiple applications servers and middleware platforms including Web logic, Web sphere, Apache Tomcat, JBoss.
- Experience in writing test cases in Java Environment using Junit.
- Background with traditional databases such as Oracle, SQL Server, MySQL.
- Good understanding of ETL processes and Data warehousing.
- Extensive experience in writing stored procedures and functions using SQL and PL/SQL.
- Experience in Data Mapping, Data Modeling and Data Normalization.
- Hands on experience on IDE tools like Eclipse, NetBeans, and Visual Studio.
TECHNICAL SKILLS
Hadoop/Big Data: HDFS, MapReduce, HBase, Pig, Hive, Sqoop, Spark, Zookeeper, Impala, Storm, Oozie, Cassandra, MongoDB.
Methodologies: Agile, UML, Design Patterns (Core Java and J2EE)
Programming: Java, J2EE, XML, Unix Shell scripting, HTML.
Java/J2EE Technologies: JDBC, JNDI, JSON, JSTL, RMI, JMS, Java Script, JSP, Servlets, EJB, JSF, jQuery, AngularJS.
Modelling Tools: Rational Rose, Star UML, Visual paradigm for UML
Databases: Oracle 11g/10g, MS-SQL Server, MySQL, MS-Access
Operating Systems: Windows 7/8/10, UNIX, Linux, Ubuntu.
PROFESSIONAL EXPERIENCE
Hadoop Developer
Confidential - Livonia, MI
Responsibilities:
- Analyzing the requirement to set up a cluster.
- Installed and configured Hadoop, MapReduce, HDFS (Hadoop Distributed File System), developed multiple MapReduce jobs in Java.
- Developed Map Reduce programs in Java for parsing the raw data and populating Staging Tables.
- Developed Spark scripts by using Scala shell commands as per the requirement.
- Used Spark API over Cloudera Hadoop YARN to perform analytics on data in Hive.
- Developed Scala scripts, UDFFs using both Data frames/SQL and RDD/MapReduce in Spark for Data Aggregation, queries and writing data back into OLTP system through Sqoop.
- Importing and exporting data into HDFS and Hive using Sqoop.
- Experienced in analyzing data with Hive and Pig.
- Writing Pig scripts to process the data.
- Developed a data pipeline using Kafka and Spark to store data into HDFS and performed the real-time analytics on the incoming data.
- Developed PIG Latin scripts to extract the data from the web server output files to load into HDFS.
- Integrating bulk data into Cassandra file system using MapReduce programs.
- Involved in HBase setup and storing data into HBase, which will be used for further analysis.
- Experienced in managing and reviewing Hadoop log files.
- Experienced in defining job flows.
- Experienced in managing and reviewing Hadoop log files.
- Involved in creating Hive tables, loading with data and writing Hive queries using the HiveQL which will run internally in the map-reduce way.
- Extracted the data from MySQL into HDFS using Sqoop.
Environment: Cloudera, Hadoop, MapReduce, HDFS, Hive, Sqoop, HBase, Pig, Oozie, Storm, Kerberos, Java, Linux, Shell Scripting.
Confidential - Wisconsin
Hadoop Developer
Responsibilities:
- Mainly worked on Hive queries to categorize data of different claims.
- Integrated the Hive warehouse with HBase.
- Involved in loading data from LINUX file system to HDFS. Written customized Hive UDFs in Python where the functionality is too complex.
- Implemented Partitioning, Dynamic Partitions, Buckets in Hive.
- Designing and creating Hive external tables using shared Meta-store instead of the derby with partitioning, dynamic partitioning, and buckets.
- Responsible for managing the test data coming from different sources Reviewing peer table creation in Hive, data loading, and queries.
- Gained experience in managing and reviewing Hadoop log files.
- Deployed Hadoop Cluster in Fully Distributed and Pseudo-distributed modes.
- Involved in scheduling Oozie workflow engine to run multiple Hive and Pig jobs involved unit testing, interface testing, system testing and user acceptance testing of the workflow tool.
- Supported in setting up QA environment and updating configurations for implementing scripts with Pig, Hive, and Sqoop.
Environment: Hortonworks, Hadoop, HDFS, Hive, Map Reduce, Storm, Core Java, Pig, Oracle, MySQL.
Confidential
Java/J2EE Developer
Responsibilities:
- Responsible for understanding the business requirement.
- Worked with Business Analyst and helped to represent the business domain details in technical specifications.
- Also helped to develop UML Diagrams: Use Cases, Activity Diagram, Sequence Diagram, Class Diagram.
- Was also actively involved in setting coding standards and writing related documentation.
- Developed the Java Code using Eclipse as IDE.
- Developed JSP and Servlets to dynamically generate HTML and display the data to the client side.
- Developed application on Struts MVC architecture utilizing Action Classes, Action Forms and validations.
- Code Review & Debugging using Eclipse Debugger.
- Involved in the design and decision-making for Hibernate or Mapping.Developed Hibernate Mapping file (. hbm.xml) files for mapping declarations.
- Configured Queues in WebLogic server where the messages, using JMS API, were published.
- Consumed Web Services (WSDL, SOAP,) from the third party for authorizing payments to/from customers.Writing/Manipulating the database queries, stored procedures for Oracle9i.
Environment: Java, /J2EE, Oracle9i, JSP1.1, HTML, JavaScript, JMS, Servlets, UML, XML, Eclipse, Web Services, WSDL, SOAP, ANT, JUnit, Log4j.
Confidential
Java developer
Responsibilities:
- Involved in the process Design, Coding and Testing phases of the software development cycle.
- Developed Servlets, JSP pages, Beans, JavaScript.
- Involved in developing a module for the transformation of files across the remote systems using JSP and servlets.
- JSP’s had HTML tags and JSP tags that were used as to develop the view of the module and servlets acted as backend java classes that were used to connect to Database retrieve files from there and send it across the systems or store the receiving files from other systems.
- Development of Various Reports using Java and SQL queries
- Participation in Design, Issue Resolution, and Sustainment meetings.
