Sr. Datastage Developer Resume
Professional Summary:
- Having 7+ years of IT industry experience in Analysis, Design, Development, Testing, Implementation and Support of Data Warehouses, Database Technologies using Extraction Transforamtion and Load (ETL) tools like IBM Web Sphere Datastage 8.0/7.5/6.x.
- Extensive domain knowledge of Healthcare and Life Insurance, Financial Industry.
- Developed industry standard solutions with Datastage ETL jobs based on business requirements using various Datastage stages like Sort, Column Import, Modify, Aggregator, Filter, Funnel, Join, Lookup, Merge, Change Capture, Datasets, Sequential files and Transformer etc.
- Extensive experience in design and implementation of Star Schema, Snowflake Schema and Multi Dimensional Modeling.
- Experience in Data integration of various data source like DB2, TeraData, SQL Server, Oracle, MS Access and Flat Files into staging area and eventually to warehouse.
- Experience in troubleshooting of DataStage jobs and addressing production issues like performance tuning and enhancement.
- Using CMM methodology to ensure compliance with project processes and quality of deliverables from the development team.
- Proficient in designing DS Jobs and Job Sequences using Datastage Enterprise Edition.
- Worked in designing and developing the Logical and Physical model using Erwin.
- Excellent working knowledge on multiple platforms like Windows NT/2000, UNIX (Sun Solaris, IBM-AIX, HP).
- Proficiency in data warehousing techniques for Slowly Changing Dimensions, Surrogate key assignment and Change Data Capture.
- Experience with Onsite Offshore Working Model.
- Gained a good experience in UNIX Shell Scripting.
- Strong problem solving capabilities with Good communication skills with a proven track record of success.
- Providing support in Development, QA and Production environments
- An excellent team member with
- An ability to perform individually as well as ability to work in-group
- Excellent problem solving with good analytical and programming skills
- Good time management skills and strong communication skills
- Hard working and high level of motivation.
- Quick learner and initiative to learn new technology and tools quickly.
ETL Tools : IBM Datastage 8.0/7.5/7.1/6.0
Databases : Oracle 11g/10g, MS SQL Server 2005, DB2, Teradata V2R5,
IBM DB2 UDB 7.2/8.1.
Data Modeling Tools Erwin 4.2/3.5.2
Operating Systems : MS-DOS, Windows NT/2000,UNIX,AIX, Sun Solaris, LINUX
Languages : SQL, PL/SQL, C, C++
Scripting : UNIX Shell Scripting, DOS Scripting.
Other Tools : MS Office,Visio,SQL*Plus, TOAD, Sql Developer.
Confidential,Washington DC
Role: Sr. DataStage Developer Jul 2008 to Till date
FLEXX – NASCO/FACETS 4.5 Migration:
This project scope includes populating the Datawarehouse from different data feeds and other operational data sources. This application is specifically designed for Facets 4.5 Dental and FLEXX Medical Claims Processing Systems of CareFirst BCBS.
Responsibilities:
- Created DataStage Parallel Jobs to create Fact and Dimension Tables.
- Modified the DS jobs to improve performance.
- Created standards for using different types of Stages such as FTP, Hashed File, Sequential File, Sort, Aggregator, Transformer and ODBC to develop different jobs.
- Created re-usable components using shared containers for local use or shared use.
- Used Parallel Extender for splitting the data into subsets and flowing of data concurrently across all available processors to achieve job performance by invoking partition parallelism.
- Worked with SCD stage for implementing slowly changing dimensions.
- Worked with Orchestrate environment for parallel processing at Job & lookup stage.
- Generated surrogate ID’s for the dimensions in the fact table for indexed and faster access of data.
- Excessive usage of DS Director for monitoring Job logs to resolve issues.
- Designed and developed jobs using Data stage Components.
- Used various stage like Sort, Merge, Sequence, Transformer to process the data.
- Used the Datastage Designer to develop processes for extracting, clean, transforming, integrating, and loading data into data warehouse database.
- Developed user defined Routines and Transformations.
- Used the DataStage Director and its run-time engine to schedule running the solution, testing and debugging its components, and monitoring the resulting executable versions.
- Designed, developed, and deployed DataStage mappings and associated functionality in an Item Costing and Product Inventory Data Mart.
- Participated in weekly status meetings.
- Configured File for Parallel extender as per number of processors available to process the data by using DataStage Director.
Technical Environment:
IBM Information Server/ WebSphere Datastage 8.0.1(Datastage, QualityStage), Oracle 10g,
Windows XP, UNIX.
Confidential,North Haven, CT
Role:Programmer Analyst Sep 2006 to June 2008
WellPoint is the largest and most experienced health insurance company in the state, providing more than 6.5 million members with comprehensive and affordable health plans This will be achieved by providing with a timely Enterprise view and single source of truth for transactional activity across the Enterprise including our subsidiaries This will be accomplished by creating and maintaining an enterprise operational data store, and access mechanisms, which are reflective of the transactional enterprise information in a common model.
Responsibilities:
- Involved in design and development of parallel jobs, sequences using the Designer.
- Designed several parallel jobs using Sequential File, Dataset, Join, Merge, Lookup, Change Apply, Change Capture, Remove duplicates, Funnel, Filter, Copy, Column Generator, Peek, Modify, Compare, Oracle Enterprise, Surrogate Key, Aggregator, Transformer, Row Generator stages.
- Used Environment Variables, Stage Variables and Routines for developing Parameter Driven Jobs and debugging them.
- Used both Pipeline and Partition Parallelism for improving performance.
- Used lookup stage with reference to Oracle tables for insert/update strategy and updating of slowly changing dimensions.
- Involved in Jobs and analyzing scope of application, defining relationship within & between groups of data, Star Schema etc.
- Created the ETL routines for enterprise Operational Data Source.
- Performed the system test and resolved the functional and technical issues.
- Developing Shell Scripts to automate file manipulation and data loading procedures.
- Assist the development team to address the issues raised by testing team
- Along with designing mappings from scratch, re-wrote existing code to enhance performance and trouble-shoot errors in both Oracle8I and DataStage.
- Used Parallel Extender for distributing load among different processors by implementing pipeline and partitioning of data in Parallel Extender.
Technical Environment:
Environment: Ascential DataStage XE/7.5, Parallel Extender, Orchestrate, Erwin, Oracle9i/8i,XML Scripting, DB2 UDB, PL/SQL, DB2 z/OS,AIX and Windows NT.
Confidential,Minneapolis, MN
Role: DataStage Developer Aug 2005 to Aug 2006
U.S. Bancorp is a leading financial services company, with international offices serving clients in many countries. U.S. Bancorp provides individuals, small businesses and commercial, corporate and institutional clients across the United States to manage their financial lives. In this project I was involved in the analysis, design, testing and deploying the data from the source system to the Data warehouse system according to the business requirements of the users by using the DataStage ETL tool.
Responsibilities:
- Involved in meetings to gather information and requirements from the clients.
- Involved in Designing the ETL process to Extract translates and load data from OLTP Oracle database system to Teradata data warehouse.
- Gathered information from different data warehouse systems and loaded into One Sprint Financial Information System Consolidated model using Fast Load, Fast Export, Multi Load, Bteq and UNIX Shell Scripts.
- Used the Ascential DataStage Designer to develop processes for extracting, cleansing, transforming, integrating, and loading data into data warehouse database.
- Technical expert in the areas of relational database logical design, physical design, and performance tuning of the RDBMS.
- Also worked as an SME in reviewing and approving ETL optimization techniques.
- Worked extensively on different types of stages like Sequential file, ODBC, Hashed File, Aggregator, Transformer, Merge, Join, Lookup, Sort and Containers (Server, Parallel) for developing job.
- Create master controlling sequencer jobs using the DataStage Job Sequencer.
- Effectively used DataStage Manager to Import/Export projects from development server to production server. Parameterized jobs for changing environments.
- Extensively used ETL to load data from Oracle9i, XML files and Complex Flat files.
- Responsible for trouble shooting, identifying and resolving data problems, Worked with analysts to determine data requirements and identify data sources, provide estimates for task duration.
- Optimized performance of Mappings and sessions by identifying bottlenecks and eliminating them.
- Created Fast Load, Fast Export, Multi Load, Tpump, BTEQ scripts for One Sprint Financial Information System.
- Used the DataStage Director and its run-time engine to schedule running the solution, testing and debugging its components, and monitoring the resulting executable versions (on an ad hoc or scheduled basis).
- Scheduled jobs dependencies using Control-M Scheduler.
- Implemented Unit, Functionality, Performance and Stress testing on Mappings and created Testing Documents.
- Involved in unit testing, systems testing, integrated testing and user acceptance testing.
Technical Environment:
Ascential DataStage 7.5.2 (Administrator, Manager, Designer, Director, Parallel Extender), Teradata, Teradata SQL, Tools & Utilities (BTEQ, Fast Export, Multi Load, Fast load, TPUMP, Queryman), Oracle 9i, AutoSys, Windows 2000/Unix.
Confidential,Dearborn, MI
Data Warehouse Analyst/Developer May 2004 to Aug 2005
The project in Ford Motor Company is a warehousing of financial related data. The primary objective of the project is to estimate the actual accounting of the vehicles. The objective is to pull Vehicle related financial data based on unique vehicle identification number (VIN) from different sources and load into Oracle ODS and then into Teradata data warehouse.
- Involved in the entire life cycle from design, developing, testing using DataStage Designer to develop Parallel Jobs (Orchestrate environment) for Extracting, Cleansing, Transforming, Integrating and Loading data into Oracle database
- Created Various Parallel Jobs Utilizing Different Partitioning techniques running on Multiple nodes
- Developed Shell Scripts for automation, for file validation and for data loading procedures
- Developed Graphical Job Sequencers using activity stages, which specifies a sequence of parallel jobs to run.
- Used Autosys for scheduling the executable versions of the DataStage as it was the project wide used scheduler
- Used debugging stages like Peek, head and tail for developing and testing the jobs.
- Worked on Oracle OCI, Oracle Enterprise stages
- Used debugging stages like Peek, head and tail for developing and testing the jobs.
- Join stage, Modify stage, Funnel stage, Lookup stage, Transformer stage, Merge stages are used for validating and transforming the data
- Created Parallel Shared Containers for Re-usability of common jobs.
- Used Before/After Job-Subroutines in Job Properties.
Technical Environment:
Ascential DataStage 7.5.2 (Administrator, Manager, Designer, Director, Parallel Extender), Oracle Loader, Oracle9i/8i, DB2 UDB, PL/SQL, AutoSys, UNIX and Windows NT.
Confidential,India
Role: Programmer Analyst Jan 2003 to April 2004
GE Healthcare is the world\'s leading manufacturer of medical equipments; GE Healthcare provides services to their own systems and also to the third party systems.
The primary objective of the project is to develop GEHC system making extensive use of Data marts. The objective is to extract data stored in different databases and load into Oracle system.
- Involved in the creation of jobs using Data Stage Designer and used Director to Validate, Schedule, Run and Monitor the Data Stage jobs.
- Involved in designing the procedures for getting the data from all systems to Data Warehousing system.
- Information is stored in table definitions in the repository and is entered using the Data stage manager.
- Extensively used ETL to load data Oracle database.
- Exporting the universe to the Repository to make resources available to the users.
- Analysis, coding / change requests / data extraction, Testing.
- Used DataStage Manager for importing metadata from repository, new job categories and creating new data elements.
- On call Support
- Attending meetings with Client frequently
- Overseeing daily activities, coordinating tasks and mentoring new trainees.
- Used Parallel Extender for parallel processing for improving performance when extracting the data from the sources.
- Designed complex job control processes to manage a large job network.
Technical Environment:
Ascential Datastage 6.0, Oracle 9i, SQL, PL/SQL, SQL* Plus, Windows NT, UNIX.
