We provide IT Staff Augmentation Services!

Hadoop Developer Resume

0/5 (Submit Your Rating)

Piscataway, NJ

SUMMARY:

  • Total of 7+ years of IT experience with expertise in Distributed processing capability (HADOOP).
  • Around 2.5 years of work experience in Big Data domain, having excellent knowledge on HADOOP Administration/Development and processing/analyzing huge volume of data.
  • 4.5 years of Experience with multiple SDLC cycles developing projects on Java, Lotus Domino, Lotus script and Web development with Lotus Notes, Java, PHP, MySQL, and DB2. Have extensively used Web development aids like Java Script and DHTML. Have worked with DB2 with JDBC connectivity.
  • Experience in working with MapReduce programs using Apache Hadoop for working with Big Data.
  • Experience in analyzing data using HiveQL, Pig Latin, HBase and custom MapReduce programs in Java. experience with Hadoop/Hbase, Hotonworks or Cloudera.
  • Conversant with Web/application Servers - Tomcat, Websphere, Weblogic and Jboss servers.
  • Worked on NoSQL databases including HBase. Knowledge in job workflow scheduling and monitoring tools like oozie and Zookeeper.Familiar with Java virtual machine (JVM) and multi-threaded processing.
  • Up to date on evaluating new analytical tools and projects coming up big data space like Apache Spark and Apache Shark, Datameer, Platfora etc.
  • In-depth understanding of Data Structures and Algorithms.
  • Experience in writing Shell Scripts (bash, SSH, Perl).
  • Proficient inJava/J2EEtechnologies likeJSP, Hibernate, spring, Struts, Java Servlets, AJAX, Java Beans, JDBC, JNDI, XML, web service usingIDEs likeEclipse.
  • Worked on multi-platform environment UNIX as well as Windows NT.
  • Software development skills in SCALA.
  • Implemented Unit Testing using JUNIT and Load Runner during the projects
  • Used Nutch search engine for data reading from websites
  • Used pig and Wukong to understand network graphs

TECHNICAL SKILLS:

Hadoop /Big Data: HDFS, MapReduce, Hive, Pig, HBase, Sqoop, Flume, Oozie,scala,spark,Nutch

ETL Tools: Informatica 8.6.1/8.5.1/7.1/6.2/5.1 (Power Center/Power Mart)

Data Modeling: Erwin 4.0/3.5, Star Schema Modelling, Snow Flake Modelling.

Databases: Oracle 11i/10g/9i/8i, MS SQL Server 2005/2000, DB2, Teradata v2r6/v2r5

SAP Tools: ECC 6.0, 4.X, BW3.X

OLAP Tools: Business Objects6.5/XI/R1/R2

Programming Lang: C,C++, JAVA, ASP, C# and .NET 3.5

Languages: SQL, PL/SQL, Unix Shell Script, Visual Basic

Tools: Toad, SQL* Loader

Operating Systems: Windows 2003/2000/NT, AIX, Sun Solaris, Linux

PROFESSIONAL EXPERIENCE:

Confidential, Piscataway, NJ

Hadoop developer

Responsibilities:

  • Responsible for building scalable distributed data solutions using Hadoop.
  • Responsible for cluster maintenance, adding and removing cluster nodes, cluster monitoring and troubleshooting, manage and review data backups, manage and review Hadoop log files.
  • Worked hands on with ETL process.
  • Worked on SCALA for managing collection and concurrency over java
  • Upgrading the Hadoop Cluster from CDH3 to CDH4 and setup High availability Cluster Integrate the HIVE with existing applications
  • Configured Ethernet bonding for all Nodes to double the network bandwidth
  • Handled importing of data from various data sources, performed transformations using Hive, MapReduce, loaded data into HDFS and Extracted the data from Teradata into HDFS using Sqoop.
  • Analyzed the data by performing Hive queries and running Pig scripts to know user behavior.
  • Continuous monitoring and managing the Hadoop cluster through Cloudera Manager.
  • Refactored Cassandra-access code, to allow either Hector or Thrift access, replacing the original thrift code interspersed throughout the application
  • Designed Hadoop jobs to verify chain-of-custody and look for fraud indications.
  • Prepared multi-cluster test harness on EC2 to exercise the system for performance and failover.
  • Used Hadoop Streaming to write jobs in a Python scripting language.
  • Expertise in writing Shell scripts to monitor Hadoop job.
  • Involved in ETL environment to push complex data into Hadoop and analysis.
  • Used Nutch search engine for data reading from websites
  • Used pig and Wukong to understand network graphs.

Environment: Hadoop, MapReduce, HDFS, Hive, Java, SQL, Cloudera Manager, Pig, Sqoop, Oozie, Storm, Flume,spark,scala,Thrift,zookeeper,SCALA, Cassandra

Confidential, Portland, OR

Hadoop Developer

Responsibilities:

  • Involved in business analysis and technical design sessions with business and technical staff to develop requirements document, and ETL specifications.
  • Installed and configured Hadoop MapReduce, HDFS and developed multiple MapReduce jobs in Java for data cleansing and preprocessing.
  • Involved in loading data from UNIX file system to HDFS.
  • Installed and configured Hive and also written Hive UDFs.
  • Wrote Hive Queries and PIG UDF’s.
  • Wrote MapReduce jobs.
  • Evaluated business requirements and prepared detailed specifications that follow project guidelines required to develop written programs.
  • Devised procedures that solve complex business problems with due considerations for hardware/software capacity and limitations, operating times and desired results.
  • Analyzed large amounts of data sets to determine optimal way to aggregate and report on it.
  • Provided quick response to ad hoc internal and external client requests for data and experienced in creating ad hoc reports.
  • Expert knowledge developing and debugging in Java/J2EE
  • Automated all the jobs starting from pulling the Data from different Data Sources like MySQL to pushing the result set Data to Hadoop Distributed File System using Sqoop.
  • Implemented Partitioning, Dynamic Partitions, Buckets in HIVE.

Environment: Hadoop MapReduce, HBase, Zookeeper, Hive, Pig, HDFS, Sqoop, Cassandra, Java, Web Services, HTML, Java Script, XML, XSL, XSLT, Storm, Flume.

Confidential, San Jose, CA

Java/J2EE Developer, Tester, Implementer

Responsibilities:

  • Enhance/maintain/develop the application built in Perl and Java according to the requirement by the customers.
  • Implementedcross cuttingconcerns as aspects at Service layer usingSpring AOP.
  • Involved in the implementation of DAO objects using spring - ORM.
  • Involved in the JMS Connection Pool and the implementation of publish and subscribe usingSpring JMS. Used JMS Template to publish andMessage Driven POJO (MDP)to subscribe from theJMSprovider.
  • Involved in creating theHibernate POJO’s and developedHibernate mapping Files.
  • UsedHibernate, object/relational-mapping (ORM) solution, technique of mapping data representation from MVC model to Oracle Relational data model with a SQL-based schema.
  • Used Ajax and JavaScript to handle asynchronous request, CSS to handle look and feel of the application.
  • Java also includes connection with the database.
  • A package is built of the modifications and is build, tested and then deployed on the production server.
  • The application is tested on the clustered environment of the test server before moving it to the production server (Win 2003).
  • Prepare the test cases for the new releases.
  • Unit testing and bug fixing. Discuss and fix production defects.

Environment: Java, MS SQL, OVSD, OVSC, REMEDY.

Confidential

Java Developer

Responsibilities:

  • I joined the project as a developer and worked my way up to being the Lead developer.
  • I have performed the role of single point of contact for the project for the clients.
  • Receive high level requirement and discuss requirements with client.
  • Propose best/alternative solutions in business terms to customer.
  • Discuss Technical feasibility for solutions proposed by customer.
  • Prepare and review design documents.
  • Coding and reviewing.
  • Wrote custom JavaScript functions for field validations.
  • Analyzed business requirements and submitted timely deliverables.
  • Implemented SAX Parser.

Environment: Java, JSP, Servlets.

Confidential

Java/SQL Programmer

Responsibilities:

  • Involved and Designed the System Development Life Cycle, i.e. such as conducting the preliminary investigation and technical specs.
  • Involved in technical and user level documentation and .
  • Involved in the generation of Forms & Reports.
  • Involved in creation of tables, writing stored procedures, Triggers, PL/SQL Libraries.
  • Performed Normalization and Logical Database Design Unit Testing of the modules.
  • Developed user-friendly forms for easy data entry.

Environment: java,Oracle 9i, SQL, PL/SQL, Erwin, Windows NT/XP/2000

We'd love your feedback!