Hadoop Developer Resume
Dublin, OH
SUMMARY
- Software Analysis, Design, Development, debugging and implementation which includes 2+ experience of using Hadoop Eco - System
- In-depth knowledge of Hadoop Architecture, HDFS, Job Tracker, Task Tracker, Name Node, Data Node and Map-Reduce programming paradigm.
- Experience in installing, configuring and using ecosystem components like Hadoop MapReduce, HDFS, HBase, Zoo Keeper, Oozie, Hive, Cassandra, Sqoop, Pig, Flume, and Avro.
- Analyzing data using Pig Latin, HiveQL, HBase and custom MapReduce programs in Java.
- Extending Hive and Pig core functionality by writing custom UDFs.
- Good knowledge in using job scheduling and monitoring tools like Oozie and ZooKeeper.
- Background with traditional databases such as Oracle, SQL Server, and ETL tools / processes.
- Hands on experience in application development using Java, RDBMS, and Linux shell scripting.
- Fluent with the core Java concepts like I/O, Multi-threading, Exceptions, Regular Expressions, Collections, Data-structures and Serialization.
- Experience in Java, JSP, Servlets, JDBC, RMI, Java Script, JQuery, XML, and HTML.
- Knowledge of Agile Software Development using Scrum process.
- Co-ordination with software development team, project implementation, analysis, technical support, data Conversion and deployment.
- Worked extensively on Hardware capacity management and procurement process.
- Skilled technologist with deep expertise in aligning solutions against strategic roadmaps to support business goals.
- Expert communicator, influencer, and motivator who speaks the language of business and technology, resulting in excellent rapport with fellow employees, peers and executive management.
- Excellent leadership, interpersonal, problem solving and time management skills.
- Excellent communication skills, both written (documentation) and verbal (presentation).
- Very responsible and good team player. Can work independently with minimal supervision.
TECHNICAL SKILLS
Hadoop Ecosystem: Hadoop, MapReduce, HDFS, HBase, Zookeeper, Hive, Pig, Sqoop, Cassandra, Oozie, Flume, Tableau, Hue, Cloudera Manager.
Database: Oracle 10g, DB2, MySQL, Oracle, MS SQL Server 2005, Derby, MS Access.
Languages: C, C++, SQL, PL/SQL, JAVA, Shell, Mat lab.
Web Tools: HTML, Java Script, XML, Schemas, VMware.
Tools: SQL developer, Eclipse, JUnit.
IDE / Testing Tools: NetBeans, Eclipse.
Operating System: Windows, Linux, UNIX.
PROFESSIONAL EXPERIENCE
Confidential, Dublin OH
Hadoop Developer
Responsibilities:
- Designed and developed MapReduce programs and tested using MRUnit.
- Managing and scheduling Jobs on a Hadoop cluster using Oozie.
- Involved in moving all log files generated from various sources to HDFS for further processing through Flume.
- Involved in loading data from UNIX file system to HDFS.
- Worked on Hue interface for querying the data.
- Created Hive tables to store the processed results in a tabular format.
- Involved in designing the Hive table partitioning (Yearly partitioning, Monthly partitioning).
- Created HBase tables to store variable data formats of data coming from different portfolios.
- Involved in transforming data from Mainframe tables to HDFS, and HBASE tables using Sqoop.
- Implemented best income logic using Pig scripts.
- Designed and Developed UDF for Pig Scripts to organize and join the invoice and product data.
- Designed and developed UDF for Hive Scripts to handle the data and business logic.
- Implemented test scripts to support test driven development and continuous integration.
- Responsible to manage data coming from different sources.
- Experienced on loading and transforming of large sets of structured and semi structured data.
- Exported the analyzed data to the relational databases using Sqoop for visualization and to generate reports for the BI team.
- Analyzed large amounts of data sets to determine optimal way to aggregate and report on it.
- Participate in requirement gathering and analysis phase of the project in documenting the business requirements by conducting workshops/meetings with various business users.
Environment: Hadoop, MapReduce, HDFS, Hive, Pig, Hue, Oozie, Core Java, Eclipse, HBase, Flume, Cloudera Manager, Toad, Putty, Windows NT, UNIX Shell Scripting.
Confidential, Philadelphia, PA
Hadoop Developer
Responsibilities:
- Responsible for developing efficient MapReduce programs for more than 20 years’ worth of claim data to detect and separate fraudulent claims.
- Uploaded and processed more than 30 terabytes of data from various structured and unstructured sources into HDFS using Sqoop and Flume.
- Played a key-role is setting up a 40 node Hadoop cluster utilizing Apache Hadoop by working closely with the Hadoop Administration team.
- Worked with the advanced analytics team to design fraud detection algorithms and then developed MapReduce programs to efficiently run the algorithm on the huge datasets.
- Developed MapReduce programs to perform data scrubbing for unstructured data.
- Responsible for designing and managing the Sqoop jobs that uploaded the data from Oracle to HDFS and Hive.
- Helped in troubleshooting Hive problems while working with Micro Strategy to produce illustrative reports and dashboards along with ad-hoc analysis.
- Used Flume to collect the logs data with error messages across the cluster.
- Designed and Maintained Oozie workflows to manage the flow of jobs in the cluster.
- Played a key role in installation and configuration of the various Hadoop ecosystem tools such as Sqoop, Flume, Pig and HBase.
- Actively updated the upper management with daily updates on the progress of project that include the classification levels that were achieved on the data.
Environment: Java, Hadoop, Hive, Pig, Sqoop, Flume, HBase, Oozie, Tableau, Oracle 10g
Confidential, Waltham, MA
JAVA/J2EE Developer
Responsibilities:
- Study and analysis of business and functional requirements
- Worked as a Component Developer to code in Java and J2EE technologies
- Created UI using JSPs and JSF based framework.
- Used XML Beans to build Java objects from XML Schema
- Designed and developed the Application based on J2EE Architecture for server side on Spring MVC Framework
- Developed XML files for Object Relation Mapping (ORM) and Hibernate Configuration.
- Used SQL Queries and Stored Procedures extensively in retrieving the contents from MySQL
- Maintained documentation using Javadoc
- Provided extensive pre-delivery support using Bug Fixing and Code Reviews
- Unit testing was done using JUnit
- Extensive usage of XML throughout modules
- Extensively used the log4j for logging
- Used CVS for code repository and as version control mechanism
- Used Oracle 10gas a DB Server and involved in writing stored procedures and SQL statements for data retrieve, Created test cases and Involved in Unit Testing
Environment: Java, J2EE (JSP, Java beans, DAO),Eclipse, Apache Tomcat Server, Spring MVC, MySQL.
Confidential, Baltimore, MD
J2EE Consultant
Responsibilities:
- Integrated with external systems throughWeb Servicesthat includes a connector module, which acts as an abstract layer to the web services layer.
- Involved in different Web module components and use cases like User Enrolment and Profile Management, Login and Authentication, Accounts, Statements, Payment, Preferences.
- Mainly worked in presenting data using eBillingreporting Moduleby configuring the report in the report XML.
- Coding of UI components likeServlets, jsp, Java Script and Strutscomponents like Action classes, Action forms, XML configuration files.
- Designed and developed lightweightPOJObased application components of One source application usingSpring Framework.
- Vast experience designing and developing Enterprise Applications usingEJB, SERVLETS, JSP’s, XML, STRUTSand Spring.
- Successfully able to validate the customer credentials and other information withStruts Validator Frameworkdisplayed the error messages if any.
- Implemented theXMLbased Digital signatures to improve the web servicessecurity.
- Involved in testing test suite class using JUnit.
- Created JMS Resource, Module, Queue for e-Billing.
- Configured Log4j using the Property file or XML file externally.
Environment: Java (JDK 1.5.x), HTML,XML, JavaScript, Oracle SQL Developer, Apache Log4J, Apache Struts, Spring Framework, JUnit.
Confidential
Java/J2EE Developer
Responsibilities:
- Involved in analyzing the client requirements and convert them into technical specifications
- Worked in Analysis, Design and Coding for client developmentusingJ2EE stack using Eclipse platform.
- Involved in creating web-based java components like client Applets and client side UI using JFC in Eclipse
- Developed PL/SQL stored procedures to perform complex database operations.
- Designed and developed SQL queries in the application
- Developed Design documents for various components identified in the system.
- Generated the Hibernate XML and Java Mappings for the schemas
- Extensively used Core Java, Servlets, JSP and XML.
- Generated the Hibernate XML and Java Mappings for the schemas.
- Used Subversion as the version control system
- Transactional Data Analysis in Matlab.
Environment: Java/J2EE, Oracle 10g, SQL, PL/SQL, JSP, EJB, Struts, Hibernate, WebLogic 8.0, HTML, Java Script, JDBC, XML, JUnit, log4j.