Sr.hadoop Developer Resume
Jacksonville, FL
SUMMARY:
- Overall 9+ years of experience in design and deployment of Enterprise Application Development, Web Applications, Client - Server Technologies, Web Programming using Java and Big data technologies.
- Expertise on Hadoop architecture and ecosystem such as HDFS, MapReduce, Pig, Hive, Sqoop Flume and Oozie.
- Complete Understanding on Hadoop daemons such as Job Tracker, Task Tracker, Name Node, Data Node and MRV1 and YARN architecture.
- Experience in installation, configuration, Management, supporting and monitoring Hadoop cluster using various distributions such as Apache, Cloudera and AWS.
- Experience in Installation and Configuring Hadoop Stack elements MapReduce, HDFS, Hive, PigSqoop, Flume, Oozieand Zookeeper.
- Experience in data process and analysis using Map Reduce, HiveQL, and Pig Latin.
- Extensive experience in Writing User Defined Functions (UDFs) in Hive and Pig.Experience working withCloudera Distributions of Hadoop
- Converted Map Reduce applications to Spark.
- Handled messaging services using Apache Kafka.
- Excellent communication, interpersonal and analytical skills and a highly motivated team player with the ability to work independently.
- Supported data analysis projects using Elastic Map Reduce on the Amazon Web Services (AWS) cloud. Exporting and importing data into S3.
- Experience in working on AWS, Flume to load the log data from multiple sources directly into HDFS & running Pig and Hive scripts.
- Involved in deployment of Hadoop cluster, addition/removal of nodes, troubleshooting when job is hung/failed, configured and tuned the cluster, find the bottlenecks, monitoring cluster, configured name-node high availability, configured schedulers, tookdata backups.
- Worked with BI team and transform big data requirements into Hadoop centric technologies.
- Worked in performance tuning of Hadoop cluster by gathering and analyzing the existing infrastructure.
- Written MRUnit and unit Test cases and submitted Unit test results as per the quality process.
- Involved in production monitoring using workflow monitor and experience in development and support environments.
- Experience in Agile Methodology, Water Fall Methodology and Test Driven Development framework.
- Experience in software configuration management using VSS, CVS and Subversion.
TECHNICAL SKILLS:
Big Data Technologies: Hadoop, HDFS, Hive, MapReduce, Pig, Sqoop, Flume, Oozie, Hadoop distribution, and HBase,Spark
Programming Languages: Java (5, 6, 7),Python,Scala
MySQL, SQL/PLSQL, MS: SQL Server 2005, Oracle 9i/10g/11g
Scripting/ Web Languages: JavaScript, HTML5, CSS3, XML, SQL, Shell
ETL Tools: Cassandra, HBASE,ELASTIC SEARCH, Alteryx.
Operating Systems: Linux, Windows XP/7/8
Software Life Cycles: SDLC, Waterfall and Agile models
MSOffice,MS: Project and Risk Analysis tools, Visio
Utilities/Tools: Eclipse, Tomcat, NetBeans, JUnit, SQL, SOAP UI, ANT, Maven, Automation and MR-Unit
Cloud Platforms: Amazon EC2
Visualization Tools: Tableau.
PROFESSIONAL EXPERIENCE:
Confidential, Jacksonville FL
Sr.Hadoop Developer
Responsibilities:
- Involved in software detailed design, coding, testing application.
- Involved in importing data from Weblog and Apps log using Flume and Spark Streaming.
- Involved in importing data from Oracle to HDFS using SQOOP and spark streaming.
- Involved in writing Map Reduce program and spark (Scala and Python) program and Hive queries to load and process data in Hadoop File System.
- Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
- Supported Map Reduce Programs those are running on the cluster.
- Involved in loading data from UNIX file system to HDFS.
- Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
- Develop MapReduce jobs for the users. Maintain, update and schedule the periodic jobs which range from updates on periodic MapReduce jobs to creating ad-hoc jobs for the business users.
- Importing and exporting data into HDFS and Hive using Sqoop.
- Experienced in defining job flows.
- Experienced in managing and reviewing Hadoop log files.
- Extracted files from Couch DB through Sqoop and placed in HDFS and processed.
- Experienced in running Hadoop streaming jobs to process terabytes of xml format data.
- Load and transform large sets of structured, semi structured and unstructured data.
- Responsible to manage data coming from different sources.
- Got good experience with NOSQL database.
- Developed a custom File System plug in for Hadoop so it can access files on Data Platform.
- This plugin allows Hadoop MapReduce programs, HBase, Pig and Hive to work unmodified and access files directly.
- Designed and implemented Mapreduce-based large-scale parallel relation-learning system
- Extracted feeds form social media sites such as Facebook, Twitter using Python scripts.
- Setup and benchmarked Hadoop/HBase clusters for internal use.
- Created Hive tables to store the processed results in a tabular format.
- Writing the script files for processing data and loading to HDFS
- Developed the UNIX shell/Python scripts for creating the reports from Hive data.
- Exported the result set from Hive to SQL using Shell scripts.
- Used Spark API over Hortonworks Data Platform, YARN to perform analytics on data in Hive.
- Use Sqoop as a bridge scripts to pull data from MySQL database
- Experienced in performance tuning of Spark Applications for setting right Batch Interval time, correct level of Parallelism and memory tuning.
- Involved in migration of Flume ingestion to AWS Kinesis Firehose.
- Involved in Hadoop installation and Configuration on Dev/IT/TEST Env
- Providing End to End support for the testing activities during System Testing, UAT.
- Maintain source code using SVN.
Environment: Hadoop, Hive, Pig, Spark, Scala, Oozie, Hue, HBase, Flume, AWS, S3, EMR, Kafka, Hdfs, Sqoop,spark (scala, python) Cloudera distribution, Unix Shell scripting
Confidential, TX
Sr.Hadoop Developer
Responsibilities:
- Involved in software detailed design, coding, testing application.
- Performing Code reviews with the team, design reviews with the Architects.
- Developed application using Java/J2EE Technologies, Web services.
- Delivered new and complex high quality solutions to clients using IBM Products/Tools, J2EE, EJB, Servlets, JSP, HTML, JavaScript and XML
- Involved in importing data from Weblog and Apps log using Flume.
- Involved in Map Reduce and Hive Optimization.
- Developed Server side validation checks using Struts validators and Java Script validations.
- With JSP's and Struts custom tags, developed and implemented validations of data.
- Developed applications, which access the database with JDBC to execute queries, prepared statements, and procedures.
- Developed programs to manipulate the data and perform CRUD operations on request to the database.
- Used message driven beans for asynchronous processing alerts to the customer.
- Worked on developing Use Cases, Class Diagrams, Sequence diagrams, and Data Models.
- Developed and Deployed SOAP Based Web Services on Tomcat Server
- Coding of SQL, PL/SQL, and Views using IBMDB2 for the database.
- Working on issues while converting JAVA to AJAX.
- Supported in developing business tier using the stateless session bean.
- Extensively used JDBC to access the database objects.
- Involved in importing data from Oracle to HDFS using SQOOP.
- Involved in writing Map Reduce program and Hive queries to load and process data in Hadoop File System.
- Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
- Preparing builds, deploy and Co-ordinate with the release management team to ensure that the proper process is followed during the release.
- Involved in Hadoop installation and Configuration on Dev/IT/TEST Env
- Providing End to End support for the testing activities during System Testing, UAT.
- Maintain source code using SVN.
Environment: CDH, Java(JDK1.7), Hadoop, MapReduce, HDFS, Hive, Sqoop, Flume, HBase, Cassandra, Pig, Oozie, Kerberos, Scala, Spark, SparkSQL, Spark Streaming, Kafka, Linux, AWS, Shell Scripting, MySQL Oracle 11g, PL/SQL, SQL*PLUS
Confidential, Chicago, IL
Hadoop Developer
Responsibilities:
- Installed and configured Hadoop Mapreduce, HDFS, Developed multiple MapReduce jobs in java for data cleaning and pre-processing.
- Supported Map Reduce Programs those are running on the cluster.
- Involved in loading data from UNIX file system to HDFS.
- Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
- Develop MapReduce jobs for the users. Maintain, update and schedule the periodic jobs which range from updates on periodic MapReduce jobs to creating ad-hoc jobs for the business users.
- Importing and exporting data into HDFS and Hive using Sqoop.
- Experienced in defining job flows.
- Experienced in managing and reviewing Hadoop log files.
- Extracted files from Couch DB through Sqoop and placed in HDFS and processed.
- Experienced in running Hadoop streaming jobs to process terabytes of xml format data.
- Load and transform large sets of structured, semi structured and unstructured data.
- Responsible to manage data coming from different sources.
- Got good experience with NOSQL database.
- Developed a custom File System plug in for Hadoop so it can access files on Data Platform.
- This plugin allows Hadoop MapReduce programs, HBase, Pig and Hive to work unmodified and access files directly.
- Designed and implemented Mapreduce-based large-scale parallel relation-learning system
- Extracted feeds form social media sites such as Facebook, Twitter using Python scripts.
- Setup and benchmarked Hadoop/HBase clusters for internal use.
- Gained very good business knowledge on health insurance, claim processing, fraud suspect identification, appeals process etc.
Environment: Hadoop, MapReduce, HDFS, Hive, Oracle, Java, Struts, Servlets, HTML, XML, SQL, J2EE, JUnit, Tomcat.
Confidential, Tampa, FL
Hadoop Developer
Responsibilities:
- Developed application using Java/J2EE Technologies, Web services.
- Provided End to End support for the testing activities during System Testing, UAT.
- Delivered new and complex high quality solutions to clients using IBM Products/Tools, J2EE, EJB, Servlets, JSP, HTML, JavaScript and XML.
- Developed the web applications using Struts, Spring Framework.
- Applied Spring Framework for transaction Management and Spring JDBC, and for AOP and Dependency Injection.
- Developed application using Java/J2EE Technologies, Web services.
- Provided End to End support for the testing activities during System Testing, UAT.
- Delivered new and complex high quality solutions to clients using IBM Products/Tools, J2EE, EJB, Servlets, JSP, HTML, JavaScript and XML.
- Developed the web applications using Struts, Spring Framework.
- Implemented Log4j framework for logging mechanism.
- Developed Spring DAO to integrate with Hibernate ORM.
- Developed Hibernate mapping with Spring, Hibernate queries (HQL) and criteria queries.
- Developed the Web Services using Jersey framework based on REST Architecture, which are used by other Applications
- Wrote SQL queries and Stored procedures for the application.
- Wrote Test cases using JUnit.
- Tested raw data and executed performance scripts.
- Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms
- Exported the result set from Hive to Netezza using Shell scripts.
- Involved in converting Hive/SQL queries into Spark transformations using Spark RDDs
- Implemented Log4j framework for logging mechanism.
Environment: Core Java, EJB, Hibernate, AWS, JSF, Struts, Spring, JPA, REST, JBoss, DB2, Oracle, XML, JUnit, HTML4, CSS, JavaScript, Apache Tomcat 5x, Log4j .
Confidential . San Jose, CA
Hadoop Developer
Responsibilities:
- Analysis the Requirements preparing estimate and preparing the documentation (Impact Analysis, detail design document, Implementation plan).
- Applied Spring Framework for transaction Management and Spring JDBC, and for AOP and Dependency Injection.
- Developed Spring DAO to integrate with Hibernate ORM.
- Developed Hibernate mapping with spring, Hibernate queries (HQL) and criteria queries.
- Developed the Web Services using Jersey framework based on REST Architecture, which are used by other Applications
- Used Jenkins continuous integration tool to do the deployments.
- Worked on JDBC for database connections.
- Worked on multithreaded middleware using socket programming to introduce whole set of new business rules implementing OOPS design and principles.
- Involved in implementing Java multithreading concepts.
- Developed several REST web services supporting both XML and JSON to perform task such as demand response management.
- Used Servlet, Java and Spring for server side business logic.
- Implemented the log functionality by using Log4j and internal logging API's.
- Used Junit for server side testing.
- Wrote SQL queries and Stored procedures for the application.
- Wrote Test cases using JUnit.
- Developed code base to stream data from topic to Storm Spout Bolt and Database.
- Identifying the errors in the logs and rescheduling/resuming the job, killing the topology and start again.
- Configured, deployed and maintained a single node storm cluster in DEV environment.
- Write queries to retrieve the data from MarkLogic database.
- Work on XML configuration for the MarkLogic and Production boxes.
- Provide Post deployment support
- Preparing the Test plan, Test cases (unit testing, System Testing) and also involving in unit testing of all development and modifications and support for UAT
- Involved in bug fixing by debugging the existing codes
- Worked on production tickets of the application.
Environment: Java, Eclipse, Oracle, Sub Version, Hadoop, Hive, HBase, Linux, MapReduce, HDFS, Hive, Java (JDK), Hadoop Distribution of Horton Works, Cloudera, MapReduce, DataStax, IBM DataStage, Oracle, PL/SQL, SQL*PLUS, UNIX Shell Scripting.
