Big Data Developer Resume
Denver, CO
PROFESSIONAL SUMMARY:
- Software professional over 6+ years of experience in the IT industry including 2.5+ years of experience in implementing Big data solutions using Cloudera Apache Hadoop distributing system and good communication skills.
- Experience in analyzing data using Hadoop Ecosystem, handling wif elements based on Map Reduce, HIVE, HBase, HBase - Hive Integration, PIG, HDFS, Zookeeper, Sqoop, Oozie and Flume.
- In depth noledge of Hadoop Architecture (YARN).
- Involved in full Software Development Life Cycle of projects including design, development, testing and implementation of Client Server and Web applications, experience in understanding Data Quality processes, methods.
- Experience in Object Oriented Analysis, Design (OOAD) and development of software using all of the SDLC Methodologies, good noledge of J2EE design patterns and Core Java design patterns.
- Good Understanding on the MVC Architecture (Struts, spring) and building up the Applications likewise.
- Working experience in developing Enterprise Applications using Java, JSP, Servlets, Hibernate, JBoss, JDBC, EJB, Web Logic, Web Sphere, JQuery, XML, HTML, RMI, Java Script and Ajax.
- Hands on experience in application development using RDBMS, and UNIX Linux shell scripting.
- Extension of Custom User Defined function’s (UDF’s) using PIG and HIVE core functionalities.
- Experience in stacking Unstructured, Structured Data from unique sources into HDFS.
- Robust experience in NOSQL databases such as HBase and MongoDB.
- Experience in debugging Hadoop component, stability, and performance issues.
- Good Knowledge on Apache Spark open-source data analytics cluster computing framework.
- Expertise in importing and exporting data using Sqoop from Relational Database Systems to HDFS and vice-versa.
- Experience in workflow management using Oozie.
- Knowledge on understanding the elements of Data Mining and techniques of Machine Learning.
- Good noledge on integrating the BI tools like Tableau wif the Hadoop stack and extracting the required Data.
- Used HBase wif Hive for real time low latency queries.
- Experience in creating Map Reduce jobs in Java as per the business requirements.
- Good experience in Oozie Framework and Automating daily import jobs.
- Knowledge in Talend open source ETL tool.
- Worked wif Tableau for report, DASHBOARD creation and further analysis from the front end.
- Proficient in Optimization of Sql Queries and concepts of Database.
- Experience in using the version control tools like SVN and CVS.
- Potential to adapt dynamically to the team environment wif the capability to increase productivity and follow administrative governance instructions in creating better value to the project.
- Ability to perform at a high level, meet deadlines, adaptable to ever changing priorities.
- Experience in AGILE Methodology /SCRUM Framework.
TECHNICAL SKILLS:
Big data/Hadoop Ecosystem: Map Reduce, HDFS, YARN, HIVE, PIG, HBase, Sqoop, Oozie, Flume, Kafka
Programming Languages: Linux shell scripts, C,C++, Java, Scala, SQL, PL/SQL
NoSQL Databases: HBase, MongoDB
Relational Database: Mysql, Oracle 11g/10g, MS-SQL Server, Teradata
Web Technologies: JDBC, JSP, JavaScript, AJAX, SOAP, HTML, XML
Web Servers: Web logic 10.3, Web Sphere 6.1, Apache Tomcat 6.0
Frameworks: MVC, Struts 2/1, Hibernate 3, Spring 3/2.5/2
Tools Used: SQL Developer, Eclipse, Putty, cygwin, Win SCP, SVN
Operating System: Ubuntu (Linux), Win 95/98/2000/XP, Red Hat, Mac OS
ETL Tools: Talend, Informatica
Monitoring Tools: Ganglia, Nagios
Reporting Tools: Tableau
Methodologies: UML, Design Patterns, Agile
PROFESSIONAL EXPERIENCE
Big Data Developer
Confidential, Denver, CO
Responsibilities:
- Loaded Structured data using SQOOP, Unstructured data using FLUME from multiple sources ingesting into HDFS.
- Developed Map Reduce programs to parse the raw data, populate staging tables and store the refined data in partitioned tables in the EDW.
- Created Hive queries that helped market analysts spot emerging trends by comparing fresh data wif EDW reference tables and historical metrics.
- Worked on analysing Hadoop cluster using different big data analytic tools including Pig, Hive and Sqoop.
- Worked on Ingestion of data from various RDBMS Systems to HDFS through sqoop and vice-versa.
- Created Hive external tables for the data in HDFS and moved data from archive layer to business layer wif Hive transformations.
- Worked on Pig and hive scripts for data joins and experienced in writing PIG-UDF's for data cleansing.
- Supported in setting up QA environment and updating configurations for implementing scripts wif Pig and Sqoop.
- Used Hbase as NOSQL Data base for faster data access patterns.
- Worked on Talend ETL POC.
Environment: HDFS, CDH 5.3, Impala, Mahout, Map-Reduce, Apache Crunch, Hive, Pig, Sqoop, SQL server, Flume, Spark, Lambda, Zookeeper, Oozie, DB2, Hbase and Pentaho.
Big Data Developer
Confidential, Denver, CO
Responsibilities:
- Provided day to day development and production support in Hadoop platform.
- Manage and monitor Hadoop cluster and platform infrastructure, including recovering from node failures and troubleshooting Hadoop cluster issues.
- Loaded log files using Flume into HDFS and process the data.
- Loaded files from MongoDB into HDFS using Sqoop.
- Load data from Mysql and Oracle RDBMS tables into HDFS using Sqoop.
- Tune HiveQL and Pig scripts for user reporting.
- Used Pig for data processing and data cleansing on the data ingested.
- Involved in pushing the cleansed dataset from HDFS to HBASE using Map Reduce programs.
- Responsible to manage data coming from disparate sources.
- Supported Map Reduce Programs those are running on the cluster.
- Used Oozie Scheduler system to automate the pipeline workflow and orchestrate the map reduces jobs that extract the data on a timely manner.
- Experienced in running Hadoop streaming jobs to process terabytes of xml format data.
- Load and transform large sets of structured, semi structured and unstructured data.
Environment: Hadoop, MapReduce, Hive, HDFS, PIG, Impala, Sqoop, Oozie, lambda, Cloudera, Flume, Hbase, Zookeeper, CDH3, MongoDB, Oracle, NoSQL and Unix/Linux, Kafka.
Big Data Developer
Confidential, Oakland, CA
Responsibilities:
- Created HBase tables to load large sets of structured, semi structured and unstructured data coming from UNIX, NoSQL and a variety of portfolios.
- Created reports for the BI team using Sqoop to export data into HDFS and Hive.
- Developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
- Worked on analyzing Hadoop cluster using different big data analytic tools including Pig, Hive, and Map Reduce on EC2.
- Collecting and aggregating large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
- Gained experience in managing and reviewing Hadoop log and JSON files.
- Written HIVE UDF for Data Cleaning.
- ExperiencedinmanagingandreviewingHadooplogfiles.
- Responsibletomanagedatacomingfromdifferentsources.
- Assisted in monitoring the hadoop cluster using Ganglia tool.
- Involved in developing PIG Scripts and Hive Reports.
- Created buckets and partitions using hive for data analysis.
- Load data into HDFS from RDBMS using Sqoop.
Environment: Apache Hadoop, RDBMS, HBase, HDFS, Map Reduce, Hive, Pig, Sqoop, Flume, Oozie, Ganglia, CoreJava, JSP, RMI, JNDI, JDBC, Hibernate, Shell Scripting, Webservices.
Java Developer
Confidential
Responsibilities:
- Worked on complete life cycle, Design, development and testing using OOA and OOD.
- Implemented the presentation layer wif HTML, XHTML, JavaScript, JSP, and CSS.
- Used JQUERY for UI and AJAX for reports display.
- Created the web application using MVC pattern to manage runtime content, in the form of POIs.
- Used Web Services to extract customer related product data from machines and servers using WSDL, XML, SOAP using Service Oriented Architecture.
- Used EJBs to develop business logic and coded reusable components in Java Beans.
- Played a crucial role in developing persistence layer.
- Developed and utilized J2EE Services and JMS components for messaging communication in Web Logic.
- Development of database interaction code to JDBC API making extensive use of SQL.
- Query Statements and advanced Prepared Statements.
- Involved in writing complex SQL queries to validate converted date.
- Responsible for Planning & tracking of the enhancement.
- Responsible in deploying the application into System Test Environment.
- Ensured seamless deployment of application into production.
- Developed application using Eclipse and used build and deploy tool as Maven.
- Deployed the applications on multiple Web Logic Server and maintained Load balancing,
- Analyzed heap dumps to fix the memory leaks.
- Implemented Unit Testing Framework Driven Using Junit.
- Prepared the Installation, Customer guide and Configuration document which were delivered to the customer along wif the product Involved in development, and Testing, phases of the project by following Agile methodology.
Environment: Core Java, J2EE, EJB, JSP, Servlets, MVC, WebSphere 7.0, Hibernate, Web Services, ANT, SOAP, WSDL, JavaScript, PL/SQL, CVS, Oracle 10gi, JUnit, Unix.
Java Developer
Confidential
Responsibilities:
- Involved in the analysis, design, and development and testing phases of application using SCRUM AGILE methodology.
- Created Use case, Sequence diagrams, functional specifications and User Interface diagrams using Star UML.
- Involved in complete requirement analysis, design, coding and testing phases of the project.
- Developed the Presentation Layer using Spring MVC Framework and also used Struts Tiles Layout for View Transactions and User Activity Monitoring module.
- Implemented the Web Services functionality in the application to allow external applications to access data.
- Implemented cross cutting concerns like business transactions, security, Logging and monitoring mechanism using Spring AOP.
- Developed user interfaces using JSP, HTML, XML and JavaScript.
- Generated XML Schemas and used XML Beans to parse XML files.
- Developed the code which will create XML files and Flat files wif the data retrieved from Databases and XML files.
- Created Data sources and Helper classes which will be utilized by all the interfaces to access the data and manipulate the data.
- Used JMS to communicate wif the Mainframe.
- Followed TDD Test Driven Development and developed test cases by using Junit for unit testing for each and every module developed.
- Web logic is used as the application production server.
Environment: Java, JSP, Servlets, Spring AOP, Spring IOC, Spring ORM, Spring MVC, Hibernate, DAO, DTO, XML, JDBC, Transactions, JavaScript, JQuery, JSON, Apache Tomcat, WebSphere Application Server, SOAP, WSDL, JMS, Eclipse, Oracle, CVS, JUnit, log4j.
