We provide IT Staff Augmentation Services!

Sr. Hadoop Developer Resume

2.00/5 (Submit Your Rating)

NY

SUMMARY

  • Overall 16+ years of extensive experience in design and development of software applications and gone through full life cycle of software development including system analysis, functional design, program design, debugging, testing, implementation, maintenance, production support and documentation.
  • 5+ years of extensive experience in Big Data Analytics and Hadoop ecosystems.
  • Experience designing and implementing dimensional data models that scale across an enterprise business.
  • Experience in Develop, implementing, and refining data engineering solutions for huge volumes of data.
  • Experience building scalable ELT/ETL workflows to transform and integrate data in to structures conducive for reporting and analytics in Hadoop.
  • Professional experience in Java, Hadoop Ecosystems (Spark, Scala, Hive, Impala, Map Reduce, Pig, Sqoop, HBase, and Oozie), UNIX, Shell Scripting, and NoSQL.
  • Expertise in building Map/Reduce algorithm using Design patterns.
  • Extensive experience in designing and developing Big Data Projects from ground level and as well migration of existing applications into Big Data space.
  • Proven knowledge of writing Hive Queries to generate reports using Hive Query Language.
  • Experience with Spark and Scala.
  • Working knowledge on Oozie, a workflow scheduler system to manage apache Hadoop jobs.
  • Experience with Pig Latin, a Scripting Language for Hadoop Distributed File system.
  • Experience with NoSQL databases HBASE (Column family DB).
  • Extending HIVE and PIG core functionality by using custom User Defined Function's (UDF), and User Defined Aggregating Functions (UDAF) for Hive and Pig.
  • Hands on development experience with RDBMS, including writing complex SQL queries, views, stored procedure, etc.
  • Knowledge on messaging system Kafka.
  • Basic Knowledge of backend programming skills - UNIX, shell scripting and R.
  • Have work experience with developing software on plan, Estimation, Data mapping for migration, implementation, testing and documentation of technical specifications and requirements.
  • Hands on experience in implementing data warehouse into Hadoop production cluster.
  • Experience in working on all teh phases of software engineering cycle (i.e. analysis, design, coding, testing, implementation and maintenance)
  • Strong coding, debugging, design and analysis skills.
  • Experience in developing CICS web services programs for web applications.
  • Good experience in Migration and Conversion, maintenance, and development projects in Healthcare and Security domain.
  • Excellent communication, interpersonal, and problem-solving skills

TECHNICAL SKILLS

Big Data: Hadoop 1.0, Hadoop 1.2.x, Hive, Pig, SQOOP, HBase, Oozie, Zookeeper, Design Patterns

Analytics & Machine Learning: R, EDA and Machine Learning Techniques

Languages: Scala, R, JAVA, SQL, FORTRAN, 1.x, J2EE 2.1, JSP 1.4, Servlets 2.1, EJB 2.0, JSF, WebSphere 6.X, JSR 168 Portlets, COBOL, JCL, Easytrieve

Databases: VSAM, ISAM, DB2, IMS DB

Middleware: MQ Series, Kafka

Operating Systems: Unix, Linux, MS-DOS, Windows 2000/NT, Windows 98,XP

Tools/Utilities: Eclipse, Git, Jenkins, Nexus, Nolio, TOAD, CHANGEMAN, Control-M, Infopac, CA7, NDM, FTP, SPUFI, QMF, TSO/ISPF, FILE-AID, XPEDITOR, SyncSort, IEBGENER, IEFBR14, BMP, LIBRARIAN, INSYNC, StarTool, SDF, PkZip, RAD 7.0, WID 6.X, SVN, Cloudera.

PROFESSIONAL EXPERIENCE

Confidential, NY

Sr. Hadoop Developer

Environment: Hadoop 1.2.x, Spark, Scala, Hive, Impala, Hbase, Java, Linux, SQL, Eclipse, Git, Jenkins, Nexus, Teradata, Zookeeper, Cloudera cluster, Hue.

Responsibilities:

  • Involved in business discussions with data owners and business users.
  • Participated in data model design.
  • Lead Hadoop platform ETL design
  • Lead Hadoop ETL development including Data acquisition from gloden sources, process and transform teh data to Analytical layer.
  • Design teh horizontal frame works that could be used across all Hadoop cluster.
  • Point of contact for SIT/UAT defects.

Confidential

Develop scalable ELT/ETL

Environment: Hadoop 1.2.x, Hive, Oozie, Java, XML, Linux, SQL, DB2, MySQL, Eclipse, Git, Jenkins, Nexus, Nolio, Teradata, Zookeeper, Cloudera cluster, Hue.

Responsibilities:

  • Participate in all aspects of Big Data solution delivery life cycle including analysis, design, development, testing, production deployment, and support.
  • Play a leading role in teh development of ETL in Hadoop.
  • Create high level design document for ETL pipeline.
  • Develop Landing/Staging/Enterprise layer and Analytical layer in Hadoop.
  • Develop scalable ETL workflows to transform and integrate data in to structures conducive for reporting and analytics.
  • Develop Surrogate Key and SCD2 for dimensional tables.
  • Exporting data from Hadoop to Teradata using Sqoop.
  • Develop UDF to validate seller id.
  • Set up Git repository to maintain teh versions of teh code.
  • Prepare deployment documentation for production release.
  • Conduct process/workflow demos to SIT, UAT and Application Support teams.

Confidential, NJ

Sr. Hadoop Developer

Environment: Hadoop 1.0, Hadoop 1.2.x, Hive, Pig, Oozie, R, RHIPE, RHADOOP, Design Patterns, Map Reduce, Java, XML, Linux, SQL, DB2, MySQL, Eclipse

Responsibilities:

  • Providing accurate and timely estimates for Business requirements.
  • Creating high level design document for approved business requirements.
  • Lead walk through of design documents with stakeholders.
  • Importing data from DB2 into HDFS using Sqoop to perform Data Analysis on customer and order entry data using Hive and Pig.
  • Created Hive partition tables based on LOB (Line of Business) and Year, and load these tables every day in teh batch process.
  • Created Pig and Hive scripts to generate reports for users.
  • Developed HBase table with column family names as Customer details, Invoice details, and loaded teh invoice data.
  • Developed map reduce functions to encrypt and decrypt teh customer sensitive data.
  • Created UDFs to calculate teh pending payment for teh given Residential or Small Business customer, and used in Pig and Hive Scripts.
  • Developed UDFs to calculate credits and pending payments for Commercial and National accounts, and used in Pig and Hive scripts.
  • Performing a Text mining & Sentiment analytics on teh social media to understand teh emotion and polarity of teh customer.
  • Clustering of social media comments to understand teh frequently discussed topic about teh organization, to provide insights to management for better decision making.
  • Wrote Map/Reduce processing using RHIPE to process huge volumes of data and to arrive computation on Social Media data.

Confidential, NJ

Sr. Programmer Analyst

Environment: IBM WebSphere Portal Server 6.0/7.0, WebSphere Process Server 6.1/7.0, WebSphere Message Broker 6/7.0, RAD 7/8, Integration Developer 6/7, IBM DB2 Content Manager 8.4, IBM DB2 9.

Responsibilities:

  • Identified and analyzed teh user requirements and design, development, testing and deployment.
  • Responsible for creating artifacts like class diagrams, sequence diagrams, use case diagrams for all teh use cases.
  • Involved in Lifecycle development on business tier of large scale complex SOA based application.
  • Used IBM RAD in developing components.
  • Integrated teh J2EE enterprise application with legacy Mainframe using WebSphere Message Broker.
  • Participated in Design and development of Message Flows, Message Sets & Definitions for legacy integration based on WSDL
  • Involved in development of a custom validation framework for handling Global and field level validations which offloads each developer from handling validation.
  • Worked closely with deployment teams during testing and production deployments of teh application.

Confidential, GA

Sr. Programmer Analyst

Environment: OS/390, VS COBOL II, CICS, JCL, VSAM, MQSeries, CHANGEMAN, XPEDITOR, EZYTRIEVE, INFOPAC, ABEND-AID.

Responsibilities:

  • Gathering requirements and preparing high level specs for teh system requests received from teh business and end users.
  • Analyze business and system requirements, including impact analysis on existing systems and translate them in to business system design, technical design documents and provide estimation to complete teh task from design thru implementation into production.
  • Schedule weekly meetings with Key Users and Business analysts in order to provide status.
  • Participate in meetings with other functional teams to understand requirements and develop External design document.
  • Unit tested external Interface Input File process to ensure teh data flow between programs & database.
  • Responsibilities include all teh stages of teh software development cycle (SDLC), namely design, specification, coding, debugging, testing (test plan and test execution), integration and system testing, documentation and maintenance of teh design documents and programs.
  • Taking care of version controls of teh deliverables from teh offshore. Testing teh code by preparing Test plans and Test cases. And then delivering to Services (Business Analysts).
  • Involved in fixing daily Production, Test problems faced by Users (Regions) and Business Analysts.
  • Also supporting teh Test cycle and Parallel cycles daily.
  • Doing teh production investigations and giving permanent solutions to them.
  • Completing assigned tasks and on request jobs in a timely manner.
  • Clarifying teh doubts raised by teh offshore team on teh service requests and others.
  • Attending meetings with business analysts to gather more inputs on teh system requests and give them teh status of on going requests.
  • Preparing weekly status, monthly status and reporting to teh Manager.
  • Involved as an active member in Knowledge Transfer phase on Healthcare Systems.
  • Prepared Power point presentations on teh acquired knowledge and shared with teh onsite and offshore team.

Confidential, GA

Programmer Analyst

Environment: OS/390, VS COBOL II, CICS, JCL, VSAM, DB2, CHANGEMAN, XPEDITOR, EZYTRIEVE, INFOPAC, ABEND-AID.

Responsibilities:

  • Responsible for Development and maintenance of Cobol, DB2, VSAM programs.
  • Responsible for developing and supporting Batch Programs/ applications.
  • Responsible for enhancement of Batch Programs for new business expansion.
  • Responsible for teh on-call support of batch applications for Q/Care application.
  • Loading data from Production to Test Environments to test COBOL II, DB2, VSAM programs
  • Attending day to day operational problems
  • Attending Teleconferences with on site team & client
  • Having discussions with team and team leader in resolving their issues so that teh project deliverables and schedules were met as per client expectations.
  • Preparation of weekly, Monthly status reports.
  • To Collect metrics, analyze defects and take necessary actions
  • Preparation of Project Plan, SQA Plan & SCM Plan
  • Preparation of Power point Project status monthly presentation to project head
  • Review of Tasks/Change requests for all Modules before delivery to Onsite.
  • Participate in teh code review and code walk through of teh new and modified programs
  • Preparation of Requirement documents for enhancement activities
  • Preparation of Approach documents, Impact Analysis, LLD, UTP & UTR for Major enhancements
  • Preparation of Unit Test Plans and Unit Test Report for above applications
  • Review of code walk through, Unit Test Plans and Unit Test Results
  • Preparation of integration and regression
  • Conducting Unit Testing, integration and regression testing and document teh test case with test results for above applications
  • Testing of applications and delivering to teh client

We'd love your feedback!