We provide IT Staff Augmentation Services!

Hadoop Developer Resume

0/5 (Submit Your Rating)

Hartford, CT

SUMMARY

  • Over 7 years of strong experience in the IT industry that includes 3+ years as a Hadoop Developer in domains like financial services and Insurance. Maintained positive communications and working relationship at all levels. An enthusiastic and goal - oriented team player possessing excellent communication, interpersonal skills with good work ethics
  • Expertise inHadoopeco system components HDFS, Map Reduce, Yarn, HBase, Pig, Sqoop and Hive for scalability, distributed computing and high performance computing.
  • Experience in using Hive Query Language for data Analytics.
  • Experienced in Installing, Maintaining and Configuring Hadoop Cluster.
  • Strong knowledge on creating and monitoring Hadoop clusters on EC2, VM, CDH3, CDH4 Cloudera Manager on Linux, Ubuntu OS etc.
  • Capable of processing large sets of structured, semi-structured and unstructured data and supporting systems application architecture.
  • Having Good knowledge on Single node and Multinode Cluster Configurations.
  • Strong knowledge in NOSQL column oriented databases like HBase and its integration with Hadoop cluster.
  • Very Good understanding of SQL, ETL and Data Warehousing Technologies
  • Knowledge of MS SQL Server 2012/2008/2005 and Oracle 11g/10g/9i and E-Business Suite.
  • Expert in TSQL, creating and using Stored Procedures, Views, User Defined Functions, implementing Business Intelligence solutions using SQL Server 2000/2005/2008.
  • Developed Web-Services module for integration using SOAP and REST.
  • NoSQL database experience on Hbase, Cassandra
  • Flexible with Unix/Linux and Windows Environments working with Operating Systems like Centos 5/6, Ubuntu 13/14, Cosmos.
  • Knowledge of java virtual machines (JVM) and multithreaded processing.
  • Strong programming skills in designing and implementation of applications using Core Java, J2EE, JDBC, JSP, HTML, Spring Framework, Spring batch framework, Spring AOP, Struts, JavaScript, Servlets.
  • Java Developer with extensive experience on various Java Libraries, API's and frameworks.
  • Hands on development experience with RDBMS, including writing complex SQL queries, Stored procedure and triggers.
  • Have sound knowledge on designing data warehousing applications with using Tools like Teradata, Oracle and SQL Server.
  • Experience in working with job scheduler like Autosys and Maestro.
  • Strong in databases like Sybase, DB2, Oracle, MS SQL.
  • Strong understanding of Agile Scrum and Waterfall SDLC methodologies.
  • Strong communication, collaboration & team building skills with proficiency at grasping new Technical concepts quickly and utilizing them in a productive manner.
  • Adept in analyzing information system needs, evaluating end-user requirements, custom designing solutions and troubleshooting information systems.
  • Team player, hardworking and committed individual. Ability to handle multiple tasks.

TECHNICAL SKILLS

Big Data: HDFS, Hive, Pig, HBase, Sqoop,mahout, Hadoop components (JT, TT, ZK)

Languages: Java, C/C++, Python, XML Shell scripting, Python, COBOL

Web Technologies: HTML, XML, JavaScript, J query, AJAX

Frame works: Spring, Hibernate, Struts

Servers: IBM WebSphere, WebLogic, Tomcat, and Redhat Satellite Server

Version Control: CVS, Tortoise SVN

Database: Oracle, DB2, MS-SQL Server, MySQL, MS-Access

Operating Systems: Windows 95/98/2000/XP/Vista/7/9, Macintosh, Unix.

PROFESSIONAL EXPERIENCE

Confidential, Hartford, CT

Hadoop Developer

Responsibilities:

  • Installed and configured Hadoop MapReduce, HDFS, Developed multiple MapReduce jobs in java for data cleaning and preprocessing.
  • Written MapReduce code to process and parsing the data from various sources and storing parsed data into HBase and Hive using HBase-Hive Integration.
  • Worked on moving all log files generated from various sources to HDFS for further processing.
  • Developed workflows using custom MapReduce, Pig, Hive and Sqoop.
  • Tuned the cluster for optimal performance to process these large data sets.
  • Worked hands on with ETL process. Handled importing data from various data sources, performed transformations
  • Developed Cloud computing using Storm framework.
  • Built reusable Hive UDF libraries for business requirements which enabled users to use these UDF's in Hive Querying.
  • Written Hive UDF to sort Structure fields and return complex data type.
  • Responsible for loading data from UNIX file system to HDFS.
  • Developed suit of Unit Test Cases for Mapper, Reducer and Driver classes using MR Testing library.
  • Designed and developed a distributed processing system running to process binary files in parallel and crunch the analysis metrics into a Data Warehousing platform for reporting.
  • Rewrite the hadoop map reduce code using the apache crunch framework
  • Developed workflow in Control M to automate tasks of loading data into HDFS and preprocessing with PIG.
  • Used Maven extensively for building jar files of MapReduce programs and deployed to Cluster.
  • Modelled Hive partitions extensively for data separation and faster data processing and followed Pig and Hive best practices for tuning.

Environment: Hive QL, MySQL, HBase, HDFS, HIVE, Eclipse (Kepler), Hadoop, Oracle 11g, PL/SQL, SQL*PLUS, Toad 9.6, Flume, PIG, Sqoop, Unix, Cosmos.

Confidential, IL

Hadoop Developer /Admin

Responsibilities:

  • Solid Understanding of Hadoop HDFS, Map-Reduce and other Eco-System Projects
  • Installation and Configuration of Hadoop Cluster
  • Working with Cloudera Support Team to Fine tune Cluster
  • Working Closely with SA Team to make sure all hardware and software is properly setup for Optimum usage of resources
  • Developed a custom File System plugin for Hadoop so it can access files on Hitachi Data Platform
  • Plugin allows Hadoop MapReduce programs, HBase, Pig and Hive to work unmodified and access files directly.
  • The plugin also provided data locality for Hadoop across host nodes and virtual machines wrote data ingesters and map reduce programs
  • Developed map Reduce jobs to analyze data and provide heuristics reports
  • Good experience in writing data ingesters and complex MapReduce jobs in java for data cleaning and preprocessing and fine tuning them as per data sets
  • Extensive data validation using HIVE and also written Hive UDFs
  • Involved in creating Hive tables loading with data and writing hive queries which will run internally in map reduce way lots of scripting (python and shell) to provision and spin up virtualized Hadoop clusters
  • Adding, Decommissioning and rebalancing nodes
  • Created POC to store Server Log data into Cassandra to identify System Alert Metrics
  • Rack Aware Configuration
  • Configuring Client Machines
  • Configuring, Monitoring and Management Tools
  • HDFS Support and Maintenance
  • Cluster HA Setup
  • Applying Patches and Perform Version Upgrades
  • Incident Management, Problem Management and Change Management
  • Performance Management and Reporting
  • Recover from Name Node failures
  • Schedule Map Reduce Jobs - FIFO and FAIR share
  • Installation and Configuration of other Open Source Software like Pig, Hive, HBASE, Flume and Sqoop
  • Integration with RDBMS using sqoop and JDBC Connectors
  • Working with Dev Team to tune Job Knowledge of Writing Hive Jobs

ENVIRONMENT: Windows 7, UNIX, Linux Java, Apache HDFS Map Reduce, Pig, Hive, HBase, Flume, Sqoop, Cassandra, NOSQL

Confidential, CA

Hadoop Developer

Responsibilities:

  • Loading the data from the different Data sources like (Teradata and DB2) into HDFS using sqoop and load into Hive tables, which are partitioned.
  • Developed Hive UDF’s to bring all the customers email id into a structured format.
  • Developed bash scripts to bring the Tlog files from ftp server and then processing it to load into hive tables.
  • All the bash scripts are scheduled using Resource Manager Scheduler.
  • Developed Oozie Workflows for daily incremental loads, which gets data from Teradata and then imported into hive tables.
  • Developed pig scripts to transform the data into structured format and it are automated through oozie coordinators.
  • Developed Hive queries for Analysis across different banners.

Environment: Windows 7, Hadoop, HDFS, MapReduce, Sqoop, Hive, pig, Teradata, DB2, Oozie, MySQL, Eclipse

Confidential, Santa Clara, CA

Senior Software Engineer

Responsibilities:

  • Participate in project planning sessions with business analysts and team members to analyze business IT Requirements and translate business requirements into working model.
  • Involved in Planning, Defining and Designing data base on business requirement and provided documentation.
  • Involve in Initial designing and creating Use case diagrams, Sequence Diagrams and class diagrams using the MS Visio Tool.
  • Develop Java application using Java Spring framework.
  • Developed dynamic ASPX web pages for application using ASP.NET, C#, XML, HTML and JavaScript.
  • Wrote complex SQL queries using joins, sub queries and correlated sub queries to retrieve data from the database.
  • Created/Updated database objects like tables, views, stored procedures, function, packages
  • Used JavaScript and struts validation framework for performing front end validations.
  • Develop DAO design pattern for hiding the access to data source objects.
  • Use Hibernate framework for the backend persistence
  • Use Eclipse as IDE tool to develop the application and JIRA for bug and issue tracking
  • Use CVS for software configuration management and version control
  • Optimized and modified the triggers, complex store functions, procedures, user base data type etc.
  • Added methods for performing CRUD operations in applications using JDBC and wrote several SQL queries.
  • Responsible for the dealing with the problems, bug fixing and troubleshooting.
  • Developing Oracle PL/SQL stored procedures, Functions, Packages, SQL scripts to facilitate the functionality for various modules.

Environment: Java, J2EE, EJB 1.1, JSF, XML, JDBC, Oracle 9i, Log4J 1.2.,PL/SQL Developer, REST framework, C#, .NET Framework 3.0, Spring framework.

Confidential

UI Designer, Web Designer

Responsibilities:

  • Developed the application under JEE architecture, developed, Designed dynamic and browser compatible user interfaces using JSP, Custom Tags, HTML, CSS, and JavaScript.
  • Developed Application Servers persistence layer using JDBC and SQL.
  • Used JDBC to connect the web applications to Databases.
  • Implemented Test First unit testing framework driven using Junit.
  • Developed and utilized J2EE Services and JMS components for messaging in Web Logic.
  • Configured development environment using Web logic application server for developers integration testing.

Environment: Windows XP, Java/J2EE, SQL, Oracle 10g, JSP 2.0, AJAX, Java Script, Web Logic 8.0, HTML, JDBC

Confidential

Jr Developer

Responsibilities:

  • Involved in understanding the functional specifications of the project.
  • Involved in developing JSP pages for the web tier and validating the client data using JavaScript.
  • Developed connection components using JDBC.
  • Designed Screens using HTML and images.
  • Cascading Style Sheet (CSS) was used to maintain uniform look across different pages.
  • Involved in creating Unit Test plans and executing the same.
  • Deployed web modules in Tomcat web server.

Environment: Java, JSP, J2EE, Servlets, Java Beans, HTML, JavaScript, JDeveloper, Apache Tomcat, Webserver, Oracle, JDBC, XML.

We'd love your feedback!