Etl Developer Resume
Phoenix, AZ
SUMMARY
- 5 years of experience working in ETL methodologies for supporting data extraction, transformation, loading using IBM Infosphere DataStage and Quality Stage.
- Worked on designing Parallel jobs using various stages like Join, Merge, Lookup, remove duplicates, Filter, Dataset, Lookup file set, Complex flat file, Modify, Aggregator, XML.
- Worked with Data Stage Manager, Designer, Administrator, and Director.
- Experience in analyzing the data generated by the business process, defining the granularity, source to target mapping of the data elements, creating Indexes and staging tables for the data warehouse design and development.
- Troubleshooting of Data Stage jobs and addressing production issues for performance tuning and enhancements.
- Have experience in designing and scheduling jobs using TWSd workload scheduler, Control - M scheduling tool
- Proficient in writing, implementation and testing of triggers, procedures and functions in PL/SQL and Oracle.
- Expertise in UNIX shell scripts for the automation of processes and scheduling the Data Stage jobs using wrappers.
- Experience in using software configuration management tools like Git Hub for version control and using CI/CD pipelines for Deploying the DataStage, Oracle and Unix objects.
- Expert in unit testing, system integration testing, implementation and maintenance of databases jobs.
- Worked in Agile - Scrum project environment.
- Good exposure to Informatica Power Center and Ab-Initio ETL tools
TECHNICAL SKILLS
Data Warehousing: IBM Infosphere DataStage 11.7/11.5/8.7/8.5, Informatica PowerCenter 10/9.6/9.1
Data Modelling: Dimensional Data Modeling, Physical and Logical Data Modelling
Reporting Tools: Tableau
Programming: Scala, SQL, PL/SQL
Databases: Oracle 10g/9i/8i/8/7.x, IBM DB2, Teradata, Hive, HBASE, Data Lake
Project Management Tools: Jira, Microsoft Project, CA Agile Rally
Cloud Technologies: Snowflake, AWS Analytics Services - Kinesis, S3, Dynamo DB, Lambda, GlueEMR, Athena, Redshift
PROFESSIONAL EXPERIENCE
Confidential, Phoenix, AZ
ETL Developer
Responsibilities:
- Involved as primary on-site ETL Developer during the analysis, planning, design, development, and implementation stages of projects using IBM Infosphere DataStage.
- Developed highly configurable/reusable parameterized DataStage jobs using DataStage Designer to receive the Medicaid members enrollment data from multiple States departments and load into the Facets claims processing platforms core back-end tables with required business transformations.
- Build DataStage jobs to prepare daily change/full outbound member eligibility extracts to multiple vendors (Vision, Dental, Pharmacy, Lab etc.,)
- Involved in all phases of SDLC, created Analysis documents with Source to target mappings.
- Worked with EDI X12 834 Enrollment files to understand the loops and segments by referring to the respective state companion guides.
- Built DataStage parallel jobs using different stages like Join, Lookup, Transformer, Funnel, Sequential File, Aggregator, sort, Remove Duplicates, Filter, Data Set.
- Designed and Deployed Work/stage tables to load the raw data to use them while performing data transformations.
- Created shell script to run data stage jobs from UNIX and then schedule this script to run data stage jobs through scheduling tool.
- Created Job sequences and scheduled them using TWSd workload scheduler by gathering the file frequency from the business.
- Experienced in importing and exporting data between HDFS and Relational Database Management systems using Sqoop.
- Good exposure to Big Data Eco system - HDFS, MAPR and Spark, Scala, Hive for big data querying and HBase for control table updates.
- Performed performance tuning of the jobs by interpreting performance statistics of the jobs developed.
- Documented ETL test plans, test cases and validations based on design specifications for unit testing, system testing, functional testing, prepared test data for unit testing, error handling and analysis.
- Provided technical support to both business and user departments during the UAT testing.
Environment: Facets, IBM Infosphere DataStage 11.7/11.3(Designer, Director) SQL, Oracle 11g, Hive, HBase, PL/SQL, Unix, Shell Scripting, TWSd Scheduler, Jenkins, Git Hub, Air Flow, XL Deploy, Sqoop, Spark, Scala, HDFS, MapR
Confidential, Plano, TX
ETL Developer
Responsibilities:
- Prepared lineage and mapping documents for the existing ETL processes for data mapping.
- Worked with Informatica Tools- Mapping Designer, Workflow Designer, Workflow Manager, Repository Manager.
- Created DataStage jobs using different stages like Transformer, Aggregator, Sort, Join, Merge, Lookup, Data Set, Funnel, Remove Duplicates, Copy, Modify, Filter, Change Data Capture, Change Apply, Sample, Surrogate Key, Column Generator, Row Generator, Etc.
- Extensively worked with Join, Look up and Merge stages.
- Extensively worked with sequential file, dataset, file set and look up file set stages.
- Integrated data sources like flat file, excel file, XML file with relational databases like Oracle.
- Validated the data and made test cases, maintained the mapping documents and prepared run book.
- Created and scheduled the jobs using Control-m workload automation tool.
- Performed code review to validate all the mappings and workflows in the project to ensure the code quality.
- Production Migration and Deployment handled UAT and production failures.
- Tested End to End to verify the failures in the mappings.
Environment: IBM Infosphere DataStage 11.3, SQL, Oracle 11g, Teradata, Unix (Putty), Control-m, Git Hub, Microsoft Visio
Confidential, Detroit, MI
ETL Developer
Responsibilities:
- Used the DataStage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into staging tables.
- Extensively used ETL to load data from IBM DB2 database, XML & Flat files Sources
- Involved in analysis, planning, design, development, and implementation phases of projects using IBM Web Sphere software.
- Developed complex jobs using various stages like Lookup, Join, Transformer, Dataset, Row Generator, Column Generator, Datasets, Sequential File, Aggregator and Modify Stages.
- Used the DataStage Director for testing and debugging.
- Created shared container to in corporate complex business logic in job.
- Created and modified batch scripts to ftp files from different servers to data stage server.
- Create parameter set to assign a value to job at run time.
- Implemented PL/SQL scripts in accordance with the necessary Business rules and procedures.
- Developed PL/SQL procedures & functions to support the reports by retrieving the data from the data warehousing application.
- Modified existing mappings based on the user requirements and did testing on the same.
- Coordinated with QA People to follow up the issues and fixing them before migrating to the production.
- Experience in debugging mappings, error handling strategies.
- Prepared SQL Queries to Validate the data both in sources and targets.
Environment: IBM Infosphere DataStage 8.7.1, SQL, IBM DB2, PL/SQL, UNIX, Control - m
Confidential
ETL Engineer
Responsibilities:
- Used IBM DataStage Designer to develop jobs for extracting, cleaning, transforming and loading data into data marts/data warehouse.
- Developed several jobs to improve performance by reducing runtime using different partitioning techniques.
- Used different stages of DataStage Designer like Lookup, Join, Merge, Funnel, Filter, Copy, Aggregator, and Sort etc.
- Use the EXPLAIN PLAN statement to determine the execution plan Oracle Database.
- Involved in writing Test Plans, Test Scenarios, Test Cases and Test Scripts and performed the Unit, Integration, system testing and User Acceptance Testing.
- Strong knowledge in creating procedures, functions, sequences, triggers.
- Expertise in PLSQL/SQL.
- Performed debugging and unit testing and System Integrated testing of the jobs.
- Wrote UNIX shell script according to the business requirements.
- Designed strategies for archiving of legacy data.
- Created shell scripts to perform validations and run jobs on different instances (DEV, TEST and PROD).
- Wrote mapping documents for all the ETL Jobs (interfaces, Data Warehouse and Data Conversion activities)
Environment: IBM Infosphere DataStage 8.5.1, SQL, Oracle 11/10g, PL/SQL, UNIX