We provide IT Staff Augmentation Services!

Sr.hadoop Developer Resume

4.00/5 (Submit Your Rating)

Jacksonville, FL

SUMMARY:

  • Overall 9+ years of experience in design and deployment of Enterprise Application Development, Web Applications, Client - Server Technologies, Web Programming using Java and Big data technologies.
  • Expertise on Hadoop architecture and ecosystem such as HDFS, MapReduce, Pig, Hive, Sqoop Flume and Oozie.
  • Complete Understanding on Hadoop daemons such as Job Tracker, Task Tracker, Name Node, Data Node and MRV1 and YARN architecture.
  • Experience in installation, configuration, Management, supporting and monitoring Hadoop cluster using various distributions such as Apache, Cloudera and AWS.
  • Experience in Installation and Configuring Hadoop Stack elements MapReduce, HDFS, Hive, PigSqoop, Flume, Oozieand Zookeeper.
  • Experience in data process and analysis using Map Reduce, HiveQL, and Pig Latin.
  • Extensive experience in Writing User Defined Functions (UDFs) in Hive and Pig.Experience working withCloudera Distributions of Hadoop
  • Converted Map Reduce applications to Spark.
  • Handled messaging services using Apache Kafka.
  • Excellent communication, interpersonal and analytical skills and a highly motivated team player with the ability to work independently.
  • Supported data analysis projects using Elastic Map Reduce on the Amazon Web Services (AWS) cloud. Exporting and importing data into S3.
  • Experience in working on AWS, Flume to load the log data from multiple sources directly into HDFS & running Pig and Hive scripts.
  • Involved in deployment of Hadoop cluster, addition/removal of nodes, troubleshooting when job is hung/failed, configured and tuned the cluster, find the bottlenecks, monitoring cluster, configured name-node high availability, configured schedulers, tookdata backups.
  • Worked with BI team and transform big data requirements into Hadoop centric technologies.
  • Worked in performance tuning of Hadoop cluster by gathering and analyzing the existing infrastructure.
  • Written MRUnit and unit Test cases and submitted Unit test results as per the quality process.
  • Involved in production monitoring using workflow monitor and experience in development and support environments.
  • Experience in Agile Methodology, Water Fall Methodology and Test Driven Development framework.
  • Experience in software configuration management using VSS, CVS and Subversion.

TECHNICAL SKILLS:

Big Data Technologies: Hadoop, HDFS, Hive, MapReduce, Pig, Sqoop, Flume, Oozie, Hadoop distribution, and HBase,Spark

Programming Languages: Java (5, 6, 7),Python,Scala

MySQL, SQL/PLSQL, MS: SQL Server 2005, Oracle 9i/10g/11g

Scripting/ Web Languages: JavaScript, HTML5, CSS3, XML, SQL, Shell

ETL Tools: Cassandra, HBASE,ELASTIC SEARCH, Alteryx.

Operating Systems: Linux, Windows XP/7/8

Software Life Cycles: SDLC, Waterfall and Agile models

MSOffice,MS: Project and Risk Analysis tools, Visio

Utilities/Tools: Eclipse, Tomcat, NetBeans, JUnit, SQL, SOAP UI, ANT, Maven, Automation and MR-Unit

Cloud Platforms: Amazon EC2

Visualization Tools: Tableau.

PROFESSIONAL EXPERIENCE:

Confidential, Jacksonville FL

Sr.Hadoop Developer

Responsibilities:

  • Involved in software detailed design, coding, testing application.
  • Involved in importing data from Weblog and Apps log using Flume and Spark Streaming.
  • Involved in importing data from Oracle to HDFS using SQOOP and spark streaming.
  • Involved in writing Map Reduce program and spark (Scala and Python) program and Hive queries to load and process data in Hadoop File System.
  • Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
  • Supported Map Reduce Programs those are running on the cluster.
  • Involved in loading data from UNIX file system to HDFS.
  • Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
  • Develop MapReduce jobs for the users. Maintain, update and schedule the periodic jobs which range from updates on periodic MapReduce jobs to creating ad-hoc jobs for the business users.
  • Importing and exporting data into HDFS and Hive using Sqoop.
  • Experienced in defining job flows.
  • Experienced in managing and reviewing Hadoop log files.
  • Extracted files from Couch DB through Sqoop and placed in HDFS and processed.
  • Experienced in running Hadoop streaming jobs to process terabytes of xml format data.
  • Load and transform large sets of structured, semi structured and unstructured data.
  • Responsible to manage data coming from different sources.
  • Got good experience with NOSQL database.
  • Developed a custom File System plug in for Hadoop so it can access files on Data Platform.
  • This plugin allows Hadoop MapReduce programs, HBase, Pig and Hive to work unmodified and access files directly.
  • Designed and implemented Mapreduce-based large-scale parallel relation-learning system
  • Extracted feeds form social media sites such as Facebook, Twitter using Python scripts.
  • Setup and benchmarked Hadoop/HBase clusters for internal use.
  • Created Hive tables to store the processed results in a tabular format.
  • Writing the script files for processing data and loading to HDFS
  • Developed the UNIX shell/Python scripts for creating the reports from Hive data.
  • Exported the result set from Hive to SQL using Shell scripts.
  • Used Spark API over Hortonworks Data Platform, YARN to perform analytics on data in Hive.
  • Use Sqoop as a bridge scripts to pull data from MySQL database
  • Experienced in performance tuning of Spark Applications for setting right Batch Interval time, correct level of Parallelism and memory tuning.
  • Involved in migration of Flume ingestion to AWS Kinesis Firehose.
  • Involved in Hadoop installation and Configuration on Dev/IT/TEST Env
  • Providing End to End support for the testing activities during System Testing, UAT.
  • Maintain source code using SVN.

Environment: Hadoop, Hive, Pig, Spark, Scala, Oozie, Hue, HBase, Flume, AWS, S3, EMR, Kafka, Hdfs, Sqoop,spark (scala, python) Cloudera distribution, Unix Shell scripting

Confidential, TX

Sr.Hadoop Developer

Responsibilities:

  • Involved in software detailed design, coding, testing application.
  • Performing Code reviews with the team, design reviews with the Architects.
  • Developed application using Java/J2EE Technologies, Web services.
  • Delivered new and complex high quality solutions to clients using IBM Products/Tools, J2EE, EJB, Servlets, JSP, HTML, JavaScript and XML
  • Involved in importing data from Weblog and Apps log using Flume.
  • Involved in Map Reduce and Hive Optimization.
  • Developed Server side validation checks using Struts validators and Java Script validations.
  • With JSP's and Struts custom tags, developed and implemented validations of data.
  • Developed applications, which access the database with JDBC to execute queries, prepared statements, and procedures.
  • Developed programs to manipulate the data and perform CRUD operations on request to the database.
  • Used message driven beans for asynchronous processing alerts to the customer.
  • Worked on developing Use Cases, Class Diagrams, Sequence diagrams, and Data Models.
  • Developed and Deployed SOAP Based Web Services on Tomcat Server
  • Coding of SQL, PL/SQL, and Views using IBMDB2 for the database.
  • Working on issues while converting JAVA to AJAX.
  • Supported in developing business tier using the stateless session bean.
  • Extensively used JDBC to access the database objects.
  • Involved in importing data from Oracle to HDFS using SQOOP.
  • Involved in writing Map Reduce program and Hive queries to load and process data in Hadoop File System.
  • Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
  • Preparing builds, deploy and Co-ordinate with the release management team to ensure that the proper process is followed during the release.
  • Involved in Hadoop installation and Configuration on Dev/IT/TEST Env
  • Providing End to End support for the testing activities during System Testing, UAT.
  • Maintain source code using SVN.

Environment: CDH, Java(JDK1.7), Hadoop, MapReduce, HDFS, Hive, Sqoop, Flume, HBase, Cassandra, Pig, Oozie, Kerberos, Scala, Spark, SparkSQL, Spark Streaming, Kafka, Linux, AWS, Shell Scripting, MySQL Oracle 11g, PL/SQL, SQL*PLUS

Confidential, Chicago, IL

Hadoop Developer

Responsibilities:

  • Installed and configured Hadoop Mapreduce, HDFS, Developed multiple MapReduce jobs in java for data cleaning and pre-processing.
  • Supported Map Reduce Programs those are running on the cluster.
  • Involved in loading data from UNIX file system to HDFS.
  • Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.
  • Develop MapReduce jobs for the users. Maintain, update and schedule the periodic jobs which range from updates on periodic MapReduce jobs to creating ad-hoc jobs for the business users.
  • Importing and exporting data into HDFS and Hive using Sqoop.
  • Experienced in defining job flows.
  • Experienced in managing and reviewing Hadoop log files.
  • Extracted files from Couch DB through Sqoop and placed in HDFS and processed.
  • Experienced in running Hadoop streaming jobs to process terabytes of xml format data.
  • Load and transform large sets of structured, semi structured and unstructured data.
  • Responsible to manage data coming from different sources.
  • Got good experience with NOSQL database.
  • Developed a custom File System plug in for Hadoop so it can access files on Data Platform.
  • This plugin allows Hadoop MapReduce programs, HBase, Pig and Hive to work unmodified and access files directly.
  • Designed and implemented Mapreduce-based large-scale parallel relation-learning system
  • Extracted feeds form social media sites such as Facebook, Twitter using Python scripts.
  • Setup and benchmarked Hadoop/HBase clusters for internal use.
  • Gained very good business knowledge on health insurance, claim processing, fraud suspect identification, appeals process etc.

Environment: Hadoop, MapReduce, HDFS, Hive, Oracle, Java, Struts, Servlets, HTML, XML, SQL, J2EE, JUnit, Tomcat.

Confidential, Tampa, FL

Hadoop Developer

Responsibilities:

  • Developed application using Java/J2EE Technologies, Web services.
  • Provided End to End support for the testing activities during System Testing, UAT.
  • Delivered new and complex high quality solutions to clients using IBM Products/Tools, J2EE, EJB, Servlets, JSP, HTML, JavaScript and XML.
  • Developed the web applications using Struts, Spring Framework.
  • Applied Spring Framework for transaction Management and Spring JDBC, and for AOP and Dependency Injection.
  • Developed application using Java/J2EE Technologies, Web services.
  • Provided End to End support for the testing activities during System Testing, UAT.
  • Delivered new and complex high quality solutions to clients using IBM Products/Tools, J2EE, EJB, Servlets, JSP, HTML, JavaScript and XML.
  • Developed the web applications using Struts, Spring Framework.
  • Implemented Log4j framework for logging mechanism.
  • Developed Spring DAO to integrate with Hibernate ORM.
  • Developed Hibernate mapping with Spring, Hibernate queries (HQL) and criteria queries.
  • Developed the Web Services using Jersey framework based on REST Architecture, which are used by other Applications
  • Wrote SQL queries and Stored procedures for the application.
  • Wrote Test cases using JUnit.
  • Tested raw data and executed performance scripts.
  • Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms
  • Exported the result set from Hive to Netezza using Shell scripts.
  • Involved in converting Hive/SQL queries into Spark transformations using Spark RDDs
  • Implemented Log4j framework for logging mechanism.

Environment: Core Java, EJB, Hibernate, AWS, JSF, Struts, Spring, JPA, REST, JBoss, DB2, Oracle, XML, JUnit, HTML4, CSS, JavaScript, Apache Tomcat 5x, Log4j .

Confidential . San Jose, CA

Hadoop Developer

Responsibilities:

  • Analysis the Requirements preparing estimate and preparing the documentation (Impact Analysis, detail design document, Implementation plan).
  • Applied Spring Framework for transaction Management and Spring JDBC, and for AOP and Dependency Injection.
  • Developed Spring DAO to integrate with Hibernate ORM.
  • Developed Hibernate mapping with spring, Hibernate queries (HQL) and criteria queries.
  • Developed the Web Services using Jersey framework based on REST Architecture, which are used by other Applications
  • Used Jenkins continuous integration tool to do the deployments.
  • Worked on JDBC for database connections.
  • Worked on multithreaded middleware using socket programming to introduce whole set of new business rules implementing OOPS design and principles.
  • Involved in implementing Java multithreading concepts.
  • Developed several REST web services supporting both XML and JSON to perform task such as demand response management.
  • Used Servlet, Java and Spring for server side business logic.
  • Implemented the log functionality by using Log4j and internal logging API's.
  • Used Junit for server side testing.
  • Wrote SQL queries and Stored procedures for the application.
  • Wrote Test cases using JUnit.
  • Developed code base to stream data from topic to Storm Spout Bolt and Database.
  • Identifying the errors in the logs and rescheduling/resuming the job, killing the topology and start again.
  • Configured, deployed and maintained a single node storm cluster in DEV environment.
  • Write queries to retrieve the data from MarkLogic database.
  • Work on XML configuration for the MarkLogic and Production boxes.
  • Provide Post deployment support
  • Preparing the Test plan, Test cases (unit testing, System Testing) and also involving in unit testing of all development and modifications and support for UAT
  • Involved in bug fixing by debugging the existing codes
  • Worked on production tickets of the application.

Environment: Java, Eclipse, Oracle, Sub Version, Hadoop, Hive, HBase, Linux, MapReduce, HDFS, Hive, Java (JDK), Hadoop Distribution of Horton Works, Cloudera, MapReduce, DataStax, IBM DataStage, Oracle, PL/SQL, SQL*PLUS, UNIX Shell Scripting.

We'd love your feedback!