Big Data/hadoop Developer Resume
Pasadena, CA
SUMMARY:
- Result - driven successful software professional with 13 Years of IT experience which includes 2 years of experience in Big Data Ecosystem related technologies.
- Experience with Hadoop and Hadoop Ecosystems such as HDFS, MapReduce, Pig, Hive, Hbase, Sqoop, Zookeeper, Oozie.
- Excellent Hands on Experience in developing Hadoop Architecture in Windows and Linux platforms.
- In depth understanding/knowledge of Hadoop Architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and MapReduce concepts.
- Experience in writing Java Map Reduce Jobs, HIVEQL for Data Architects and have good Knowledge of Python scripting.
- Good experience in optimizing MapReduce algorithms using Mappers, Reducers, combiners and partitioners to deliver the best results for the large datasets.
- Set up standards and processes for Hadoop based application design and implementation.
- Performed data analytics using PIG and Hive for Data Scientists within the team. Extending HIVE and PIG core functionality by using custom UDF’s.
- Good Experience in data loading from Mainframe, Oracle and Mongo DB to HDFS system using Sqoop (Structure Data) and Flume (Log Files & XML).
- Scheduled map reduce jobs in production environment using Oozie scheduler.
- Worked with Hadoop Admin folks in setting up Hadoop in various modes (Pseudo Distributed and Fully Distributed Mode) and integrating Hadoop Ecosystem with Hadoop.
- Hands on experience in application development using Java, RDBMS, and Linux shell scripting.
- Experience in developing Automation tool using Java script, HTML.
- Knowledge of Spark, Scala and Kafka.
- Work with Product owner, Scrum master and Architect to develop the Use cases. Also worked with Hadoop QA team to develop test plans, test scripts and test environments and to understand and resolve defects.
- Participate in code reviews, software design sessions, and architectural reviews of Hadoop Ecosystems
- Design and implement map reduce jobs to support distributed processing using Java, Hive and Pig.
- Build libraries, user defined functions, and frameworks around Hadoop Ecosystems.
- Research, evaluate and utilize new technologies/tools/frameworks around Hadoop eco system
- Defined and developed best practices in Hadoop.
- Experience in working with Software Methodologies like Agile Unified Process - AUP Scrum, Rational Unified Process - RUP and Waterfall .
- Domain experience in Energy & Utility (Advanced Meter Infrastructure, Smart Meter and Smart Grid), Banking (Credit Cards)/Financial, Retail and Telecom industries.
TECHNICAL SKILLS:
O/S: UNIX, Linux, OS/390, MVS, Windows 95/ 98/2000/NT/XP
Hadoop: Cloudera Hadoop CDH3/4, HDFS, Map Reduce, Hive, Pig, Sqoop, Flume, Oozie, ZooKeeper
Languages: C/C++, Core Java, Cobol, SQL, and PL/SQL 2.X.
Scripting Languages: Python, JavaScript, UNIX Shell Script, VBScript
Web Technologies: HTML, JAVA, ASP, XML
Database: HBase, Oracle, NoSQL, DB2, Cassandra, MongoDB, SQL SERVER 2000
Packages: MS Office(Word, Excel and Power Point)
Protocol: TCP/IP, HTTP
Other Tools: Rally, ALM, Bugzilla, Eclipse IDE
PROFESSIONAL EXPERIENCE:
Confidential, Pasadena, CA
Big Data/Hadoop Developer
Responsibilities:
- Gathered the business requirements from the Business Partners and Subject Matter Experts.
- Worked with Data Modeler and DBAs to build the data model and table structures.
- Actively participated in discussion sessions to design the ETL job flow.
- Worked with 10+ source systems and got batch files from heterogeneous systems like Unix/windows/oracle/mainframe/db2.
- Handled 20 TB of data volume with 10 Node cluster in Test environment.
- Weekly meetings with technical collaborators and active participation in code review sessions with senior and junior developers.
- Used to manage and review the Hadoop log files.
- Supported Hbase Architecture Design with the Hadoop Architect team to develop a Database Design in HDFS.
- Supported Map Reduce Programs those are running on the cluster and also Wrote MapReduce jobs using Java API.
- Involved in HDFS maintenance and loading of structured and unstructured data.
- Imported data from mainframe dataset to HDFS using Sqoop. Also handled importing of data from various data sources (i.e. Oracle, DB2, Cassandra, and MongoDB) to Hadoop, performed transformations using Hive, MapReduce.
- Wrote Hive queries for data analysis to meet the business requirements.
- Wrote Pig Latin scripts and also developed UDFs for Pig Data Analysis.
- Involved in managing and reviewing Hadoop log files.
- Developed Scripts and Batch Job to schedule various Hadoop Program.
- Utilized Agile Scrum Methodology to help manage and organize a team of 4 developers with regular code review sessions.
- Upgraded the Hadoop Cluster from CDH4 to CDH5 and setup High availability Cluster to Integrate the HIVE with existing applications
- Analyzed the data by performing Hive queries and running Pig scripts to know user behavior.
- Continuous monitoring and managing the Hadoop cluster through Cloudera Manager.
- Developed Hive queries to process the data and generate the data cubes for visualizing.
- Optimized the mappings using various optimization techniques and also debugged some existing mappings using the Debugger to test and fix the mappings.
- Updated maps, sessions and workflows as a part of ETL change and also modified existing ETL Code and document the changes.
- Installed Oozie workflow engine to run multiple Hive and Pig jobs.
- Familiar with Scala, closures, higher order functions, monads.
Environment: Hadoop, Java, MapReduce, HDFS, Hive, Pig, Linux, XML, Eclipse, Cloudera, CDH4/5 Distribution, DB2, SQL Server, Oracle 11i, MySQL
Confidential, San Dimas, CA
Sr. Analyst
Responsibilities:
- Interact with the Business team, Development team and other vendors to understand the Business Requirements, System Requirements etc.
- Provide the Estimation for all the phases of testing life cycle based on the requirements.
- Design the test scripts covering the requirements, negative testing and regression testing, and get approvals from the concerned groups.
- Creation of input data in the GSIS Webpage built using JAVA & XML.
- Perform the System Testing and Debug the Oracle (ORA) errors as part of the testing.
- Analyze the Oracle stored procedures and packages, complex SQL queries as part of the debugging activities.
- Execute the Shell scripts on UNIX server to transfer the files to the Oracle DB
- Propose enhancements to improve the performance and reliability of the system, and test the enhancements after they are implemented.
- Analyze data in the XML file and edit the data to match the input need.
- Developed the Automation tool using Java script to automate the manual Buy off process for SOD application.
- Test the web services and formatting of the messages using SOAPUI.
- Analyze the test results, verify database tables for valid data entries and validate the records as part of Back End Testing using TOAD.
- Review & Analyze the QTP Scripts & Results and analyze the Java Script/HTML code
- Perform the Security/proxy testing on different browsers using Burpsuite and Validate the Blacklist and Encoding characters.
- Analyze and troubleshoot the database performance issues using complex SQL queries and recommend configuration or code changes to address the issues.
- Interacting with offshore resources (ADPI) and reviewing their deliverables.
- Conduct the Defect Meeting, and maintain the Defect tracker to have the updated status of the defects.
- Provide detailed snapshot and additional information to development team for the defects raised in Quality Center (QC), and help identify and resolve the design/development errors.
- Manage tester’s work throughout test plan development and test execution to ensure that testing is on time and within budget.
- Generate the weekly progress reports and comparing the progress against the Confidential completion to identify schedule slippages, and presenting the consolidated status of execution during weekly project meetings
- Prepare Documentation on possible database errors and steps to resolve.
Environment: JAVA, HTML, XML, Websphere, Mainframe, Shell script, UNIX, Oracle 9i, TOAD, Quality Center V 10.0, QTP V 9.5, Burpsuite V 1.3
Confidential
Lead Analyst
Responsibilities:
- Review the Requirements for their Clarity, Conciseness, Completeness and Testability, and work with business team for clarifications.
- Define testing standards and procedures, and provide inputs for the Test Plan.
- Prepare Test Condition and Test script documents, conduct internal and external quality audits, facilitate walk-through meetings, and obtain approvals to baseline the test condition and test script documents.
- Ensure that the test team follows the testing standards, guidelines, processes, and testing methodology.
- Perform Smoke testing to ensure system test environment readiness for testing.
- Test Customer Service System (CSS) application to validate various functionalities like Provisioning, Commissioning, Interval data Validation, Estimation and Editing, Framing, Billing validations, Re-billing etc.
- Handle various testing requirements to test the Database, Files, and Screens and Strings etc.
- Execute the JCL Jobs in mainframe to receive the input feed from CSS.
- Execute the Shell scripts on UNIX server manually to perform the ETL Process as part of testing.
- Execute the SQL queries using PL/SQL Developer, validate the tables, perform the Database testing manually and analyze the XML file.
- Perform Regression testing to ensure existing functionality is not affected between builds or releases.
- Review and validate test results, raise defects, and assign defects to the appropriate vendor or internal development teams.
- Document and report testing progress and metrics to the Test Manager.
- Use Rational ClearCase for artifacts management and version control to implement Confidential ’s Software Configuration Management (SCM).
- Use JIRA and Rational ClearQuest for Defect management.
Environment: Siebel, SQL Server 2000, Shell script, UNIX, PL/SQL Developer, Mainframe, Rational ReqPro, Rational ClearCase, Rational ClearQuest, JIRA.
Confidential, Elk Grove Village, IL
Lead/Onsite Coordinator for UAT - Mainframe Applications
Responsibilities:
- Attend weekly project meetings starting from kick-off meetings to implementation.
- Analyze user requirement document and system requirement specification document and use cases for developing Test plan, Test cases and Test scenarios.
- Assign the functionalities (called ‘skill-blocks’ in FDR terminology) to the onsite and offshore resources based on their experience and availability, and update the TCAP (Capacity planner) to show the resource’s engagement on the projects
- Attend design walk-through with development teams.
- Prepare Test plan document, based on the requirements and designs with the inputs from all the impacted skill-block testers, and review the Test Plan with project team.
- Prepare the Test Plan in Rational Manual Tester and provide the guide-lines like standard naming conventions, requirement traceability, level of details needed for execution steps, maximum number of verification points for each test-case etc. to the Testers on Test-case preparation.
- Review the test-cases to make sure that the test coverage includes functional requirements, negative testing and regression testing.
- Arrange Test-case walk-through, update the issue-log with the comments from project team, follow-up with developers to close the issues, and get the approvals from the concerned groups.
- Review the Program Change Requests (PCR) and collect the impacts on effort and schedules and communicate the consolidated impact to the project manager.
- Gather the test data requirements and coordinate the effort with the appropriate skill-block testers.
- Promote the packages to the QA Environment using ChangeMan.
- Responsible for designing, executing and recording results for manual test cases in SAT (System Acceptance Testing) & UAT (User Acceptance Testing)
- Execute queries and create reports using QMF
- Prioritize the defects and raise the defects in Rational ClearQuest, retest the fixes and track the defects to closure.
- Organize the defect calls during test execution, and expedite the resolution.
- Provide the inputs for Lessons Learned to the project manager and documenting the Best Practices.
Environment: COBOL, CICS, JCL, DB2, QMF, VSAM, FILEAID, ChangeMan, Rational Manual Tester, Rational ClearQuest, Rumba RoboTester.
Confidential, Denver, CO
Quality Analyst in Mainframe
Responsibilities:
- Attend weekly project meetings starting from kick-off meetings to implementation.
- Attend Requirement and Design document walk-throughs and work with business and development teams for clarifications in case of questions.
- Design the test scripts covering the requirements, negative testing and regression testing, and get approvals from the concerned groups.
- Review the PCR (Program Change Request) and make sure the Test scripts are built for all the PCR’s impacting the Requirements.
- Perform the Regression testing to ensure the functioning of existing requirements is not affected by the new requirements.
- Prepare the User acceptance Test Data by creating the Test accounts and conditioning the accounts in the CICS Screens as per the test script input conditions.
- Test the customer service screens in CICS to verify the account number qualifies based on the conditions in the test scripts.
- Use SPUFI - online query tool to query the data based on the input conditions.
- Work on DB2 - a relational database management system and use SQL to obtain data from DB2 database.
- Build the Automation tool using MS Excel to condition the accounts for repeated scenarios.
- Use File-AID - test data management tool to do the file operations like edit, compare and load the data.
- Perform the System Testing according to the prioritized Test Scripts so that major error can be captured before the beginning of UAT.
- Perform UAT (User Acceptance Testing) and report the execution status using the Reporting tool WISS (Web Integrated Statistics System).
- Raise the defects using Rational ClearQuest, retest the fixes and track the defects to closure.
Environment: COBOL, CICS, JCL, DB2, SQL, SPUFI, QMF, VSAM, FILEAID, Rational Manual Tester, Rational ClearQuest, Rumba RoboTester.
Confidential
Programmer Analyst in Mainframe
Responsibilities:
- Provide first on call support for production batch jobs. Activities include the issue ticket resolution as per Service Level Agreement, and follow the escalation processes.
- Provide the permanent solution to the resolved issue tickets to fix their recurrence.
- Perform Impact analysis and provide estimates for the System Change Requests.
- Prepare Program specifications and Unit Test Plans and Check out the application programs and make the updates as per the requirement for JCL, COBOL programs.
- Analyze to study the Data mapping by using File-Aid.
- Use Change Man to edit a component, compile and link edit a program.
- Use CA7 Scheduler to schedule the Jobs by giving the Dependencies based on the Requirements.
- Use ABEND AID to debug the Abend.
- Interact with clients for clarifications to ensure the deliverables with quality on time.
- Move the code to UAT region by using the ENDEVOR.
- Prepare standards, guidelines, Induction manuals for new team members and technical documentation that support the development and QA phases of SDLC.
- Perform Peer review, external quality assurance reviews and final inspection of the deliverables.
- Involve in enhancements and maintenance of the Cobol programs.
- Analyze the existing system and propose changes for improving the existing processes and the system performance.
- Help others and provide on-the-job training or guidance.
- Conduct Quality reviews, Technical Documentation and User manual preparation.
Environment: COBOL, JCL, DB2, CA7, QMF, VSAM, FILEAID, ENDEVOR, XPEDITER.
