We provide IT Staff Augmentation Services!

Datastage Developer Resume

3.00/5 (Submit Your Rating)

Eden Prairie, MN

SUMMARY:

  • IT professional, around Seven years’ experience in Design and Development and Five years of experience in IBM DataStage v 8.x/7.x/6.x/5.2 (Server and Enterprise Edition) using Components like Administrator, Manager, Designer, and Director
  • Five years in the fields of data Warehousing, Data Integration, Data Migration using IBM Websphere DataStage, Teradata, Oracle, PL/SQL, DB2, UDB, SQL Server 2000/2005/2008. SQL procedural language and Shell Scripts
  • Used tools like Infosphere DataStage Designer and Director for developing Jobs.
  • Experience in Information Analyzer, QualityStage, Metadata Workbench, Business Glossary, FastTrack etc products in IBM InfoSphere Information Server IIS 8.0/8.1 Suites and WebSphere Application Server
  • Over Five years of experience in ETL methodologies in all the phases of the Data warehousing life cycle
  • In Depth knowledge in Data Warehousing & Business Intelligence concepts with emphasis on ETL and full Life Cycle Development including requirement analysis, design, development, testing and implementation
  • 2+ years of experience in the SAP Analysis, Testing and Implementation.
  • Expertise in all the phases of System development life Cycle (SDLC) using different methodologies like Agile, Waterfall.
  • Expertise in OLTP/OLAP System Study, Analysis and Dimensional Modeling, E - R modeling. Involved in ODS/Designing Dimensional Model (Star Schema and Snowflake Schema) Designing logical and physical design data modeling with star schema using Erwin.
  • Extensively used SQL coding for overriding the generated SQL in DataStage and also tested the data loaded into the data base.
  • Extensively used DataStage- Designer to design and develop Server and PX jobs to migrate data from transactional systems (Sybase, DB2UDB) into the Data Warehouse.
  • Extensively used DataStage- Manager to Export/Import DataStage Job components and Import Plug in Table Definitions from DB2UDB, Oracle and Sybase databases.
  • Designed Server jobs, Job Sequencers, Batch Jobs and Parallel jobs.
  • Designed Parallel jobs using various stages like join, merge, lookup, remove duplicates, filter, dataset, lookup file set, modify, aggregator, CFF, Transformer.
  • File Stages - Lookup File Set, Dataset, Sequential File, Database Stages - DB2 Enterprise, and Sybase OCI, Surrogate Key
  • Good Experience in Extraction Transformation and Loading (ETL) processes using Data Stage ETL Tool, Parallel Extender, Metastage, Quality Stage, Profile Stage
  • Developed Server jobs using various types of stages like Sequential file, ODBC, Hashed File, Aggregator, Transformer, Sort, Link Partitioner and Link Collector .
  • Experience in integration of various data sources like Oracle, Teradata, DB2, SQL Server, MS Access and Flat files into the Staging Area. Extensively worked with materialized views and TOAD
  • Proven track record in troubleshooting of DataStage Jobs and addressing production issues such as performance tuning and enhancement
  • Excellent knowledge of studying the data dependencies using Metadata stored in the Repository and preparing batches for the existing sessions to facilitate scheduling of multiple sessions

TECHNICAL SKILLS:

Data Warehousing:  IBM DataStage 8.7/8.5/8.1/8.0/7.5.3/7.5.2/7.5.1/7.0 (Designer, Director, manager, Administrator), Parallel extender, Server Edition, Quality Stage 7.5, Infosphere & InfoAnalyser, ETL, Metadata, OLAP, OLTP, SQL*Plus

Databases: Oracle11g/10g/9i/8i/8.0/7.0,DB2UDB7.2/8.1/9.0,Mainframe,TeradataV2R5, DB2UDB,MSAccess 7.0/’97/2000, MS SQL Server 2010/2008/2005/2000

Programming:  SQL, Unix Shell Scripting, SQL loader ERP Oracle Applications 11i/11.0.x (Accounts Payables (AP), Accounts Receivables (AR))

Others:  MS Office

Environment:  IBM AIX 5.3/5.2/4.2, MS DOS 6.22, Win 3.x/95/98/2000, Win NT 4.0,Win XP ERP SAP R/3 4.6B, 4,6C, 4.7, ECC 5.0, 6.0 in MM, SD, PP, ABAP

PROFESSIONAL EXPERIENCE:

Confidential, Eden Prairie, MN

DataStage Developer

Responsibilities:

  • Monitoring production cycle and solving Tivoli job failures.
  • Reporting the batch run status to downstream and upper level management at definite intervals.
  • Understand the application thoroughly in order to solve production issues.
  • Working on code changes and unit testing while working on Service requests.
  • Understanding the technical & functional specifications of the system.
  • Doing RCA (Root cause analysis) of production abends and giving solution for the same.
  • Performing analysis for RTI (Run time improvement) of the production jobs.
  • Strict adherence to SLA while performing the above task.
  • Involved in updating/modifying UNIX scripts which generated emails to notify the incoming data to the Tango DA team.
  • Worked closely with Project lead/Manager, system admins to understand the business process and functional requirements
  • Did unit testing of the jobs developed before taking them for Test and Production.
  • Modified DataStage wrapper scripts written in UNIX (BASH)
  • Used UNIX Shell Scripts for extracting and parsing data from files into the database tables.
  • Involved in writing Shell scripts for reading parameters from files, invoking DataStage jobs, and FTP files to specific locations.
  • Scheduled the jobs with Automated Scheduler, Event Coordinator for batch transfers of data.
  • Using the stages like Modify, Look-up, Remove Duplicate, Aggregator in modifying the jobs already developed to improve the performance and to meet the requirements.
  • Agile methodology (2 week scrums) was followed to produce the results and provide support to the ETL jobs.
  • Extensively used SQL coding for overriding the generated SQL in DataStage and also tested the data loaded into Hadoop.
  • Involved in writing SQL scripts for data validation of files received from various sources.
  • Implemented complex logics in Transformer stage like date validation, use of stage variables
  • Migrated DataStage jobs from 8.1 earlier versions to DataStage 9.2

Environment: ETL, DataStage EE 8.5/9.2, Information Analyser 8.5, Oracle 11g, SQL Server 2010, Teradata Client 12.0, Tiwoli Job scheduler and HPSM.

Confidential, SanJose, CA

DataStage Developer

Responsibilities:

  • Involved in building a prototype which comprised of multiple jobs without mapping documents.
  • Imported a routine to convert a decimal value to over punch character.
  • Gathering the ETL requirements for implementing the business rule and mapping corresponding data.
  • Designed ETL jobs which populated the tables using column generator, surrogate key, join, Oracle enterprise stage
  • Involved in modifying UNIX scripts which generated emails to notify the incoming file to the RITS team.
  • Worked closely with Project lead/Manager, Architects, and Data Modelers, System Analyst to understand the business process and functional requirements
  • Did unit testing of the jobs developed before taking them for UAT and finally Production.
  • Used UNIX Shell Scripts for extracting and parsing data from files into the database tables.
  • Involved in writing Shell scripts for reading parameters from files, invoking DataStage jobs, and FTP files to specific locations.
  • Designed server jobs using various stages like Sequential file, Transformer, ODBC, Aggregator, Hash file, Link Partitioner
  • sScheduling the jobs with Automated Scheduler, Event Coordinator for batch transfers of data.
  • Using the stages like Modify, Look-up, Remove Duplicate, Aggregator in modifying the jobs already developed to improve the performance and to meet the requirements.
  • Developed job control routines for batch processing in DataStage BASIC
  • Agile methodology was followed to produce the results and to perform the ETL jobs.
  • Extensively used SQL coding for overriding the generated SQL in DataStage and also tested the data loaded into the data base.
  • Developed Data stage basic routines for automation ETL batch process.
  • Involved in writing SQL scripts for populating tables.

Environment: ETL, DataStage EE 8.7/8.5, Information Analyser 8.5, Oracle 11g, SQL Server 2010, SQL Navigator 6.5, SQL/PLSQL,, Control M, Windows XP.

Confidential, Orlando, FL

DataStage Developer

Responsibilities:

  • Gathered the ETL requirements for implementing the business rule and mapping corresponding data.
  • Modified the current jobs in the time-tracking system, to incorporate the business logic.
  • Identified source systems connectivity, related tables and fields and ensure data suitably for mapping.
  • Developed new Parallel jobs to populate new tables required with the DataStage Enterprise Edition.
  • Used the stages like Change-Capture, Modify, Look-up, Remove Duplicate in modifying the jobs already developed to improve the performance and to meet the requirements.
  • Developed custom Routines and use of stage variables in certain jobs
  • Developed various one-shot jobs to fix the issues and inconsistent data in production. Also, comparing the two data warehouses and minimizing the differences with comparison.
  • Gathered requirements for Data Integration needs for external data for BI
  • Written SQL scripts to populate new fields added to tables on a one-shot basis.
  • Unit Testing and Integration testing the individual and extract-transform-load jobs in sequence respectively.
  • Extensively analyzed the Data Sources in identifying data anomalies, patterns, value ranges. Wrote SQL scripts for accomplishing the same.
  • Designed and Developed validation ETL processes using Data Stage, Oracle SQL loader and Unix shell programming for Alignment, Customer, Promotions, and mainframe flat files Mapping Data Items from Source System to the Target System.
  • Compiled and debugged the Jobs based on the Errors.
  • Designed & Developed the DataStage jobs (using join, lookup, sort, remove duplicates, Transformer, Teradata MultiLoad stages etc.) for handling complex transformations.
  • Scheduled jobs using Autosys.
  • Written shell scripts for scheduling the ETL process.
  • Used Both Pipeline Parallelism and Partition Parallelism for improving performance.
  • Extensively used Parallel Extender to load data into data warehouse with different techniques like Pipeline and Partition in MPP environment.
  • Developed DataStage job sequences used the User Activity Variables, Job Activity, Wait for File stages
  • Used FastTrack to automate creation of ETL DataStage jobs, to leverage published profiling results from Information Analyzer and to create and link Business Glossary terms and their relationships.
  • Extensively used DataStage Tools like Infosphere DataStage Designer, Infosphere DataStage Director for developing jobs and to view log files for execution errors.
  • Migrated the code required for the project to QA and Model using Version Control .

Environment: ETL DataStage EE 8.5/8.1.2/8.0 , Oracle 10g, Teradata, SQL/PLSQL, Teradata, BTEQ, IBM InfoSphere 8.5   Quality stage 8.5/8.1.2/8.0 , Informix 9/8,Autosys,SQl * loader, Meta Stage, Korn Shell Scripts, UNIX, Windows XP.

Confidential, Quincy, MA

Data warehouse/DataStage Developer

Responsibilities:

  • Involved in requirements gathering and creating logical and physical design based on the business requirements
  • Used Teradata utilities for bulk loading of records which had count often going up to 30 million records.
  • Implemented logical and physical data modeling with Star and Snowflake techniques using Erwin in Data warehouse
  • Involved in creating entity relational and dimensional relational data models using Data modeling tool Erwin.
  • Worked closely with Project lead/Manager, Architects, and Data Modelers to understand the business process and functional requirements
  • Gathered requirements for Data Integration needs for external data for BI
  • Documented Technical Specifications for ETL Job Streams, Sequencers, Services
  • Extracted data from Facets application to pull it into the Staging area.
  • Updated data into the Facets application.
  • Used Facets application to open, add generations, enter and save information.
  • Analyzed Facets data like claims, billing to resolve related subject areas issues.
  • Extract and transform source data from DB2 Database
  • Used Teradata Enterprise stage to extract data from teradata database and integrated it with the source data from Oracle Database in staging area and loaded the data into Teradata database.
  • Good working knowledge in teradata database and other teradata utilities like Teradata FastLoad, MultiLoad, TPump used for extracting large volume of data and updating number of tables in a single pass.
  • Documented Technical Specifications for ETL Job Streams, Sequencers, Services
  • Documented Unit Test Cases for ETL Job Streams, Sequencers, Services and System Integration test cases also Develop and Build ETL Job Streams, Sequencers, Services
  • Used the DataStage Designer to develop jobs for data loads into the Sales Data mart.
  • Used DataStage Director to schedule running the solution, validate, monitor and analyze the job stages
  • Used DataStage Manager to import Metadata Table Definition.
  • Warehouse was implemented using sequential files from Mainframe System, Oracle, Excel utilized Data Stage to process into Operational Data.
  • Designed and Developed Extract, Transform and Load (ETL) functions from a variety of operational systems including legacy systems utilizing SQL, DataStage (ETL tool), Unix shell Scripts, bulk loads (Oracle SQL Loader), etc.
  • Always delivered the results on time and helped group in meeting the deadlines.
  • Have become a subject matter expert (SME) in a short time, proving my ability to grasp complex business models in relatively short time.
  • Development of processes for enterprise data cleansing and de-duplication using QualityStage
  • Used Before/After Job-Subroutines in Job Properties. Defined Constraints and Stage Variables.
  • Involved in testing of Stored Procedures and Functions written in PL/SQL, Unit and integration testing of DataStage jobs.
  • Used Autosys for job scheduling.
  • Extensively used SQL coding for overriding the generated SQL in DataStage and also tested the data loaded into the data base.
  • DataStage Performance Tuning.
  • Tuning SQL statements using Explain Plan Tool and Optimizer Hints for best response time.
  • Writing SQL statements to improve the performance of the jobs and tuning the SQL statements
  • Defined the program specifications for the data migration programs, as well as the necessary test plans used to ensure the successful execution of the data loading processes.
  • Used quality stage jobs to validate the quality of output data generated.
  • Worked on Control-M jobs for scheduling and automating the month end job flow.
  • Created shell scripts for production and the scripts for small changes using before-after subroutine. Used Korn Shell scripts for scheduling DS jobs.
  • Knowledge of configuration files, user defined variables.
  • Used sequencer activities and triggers to ensure appropriate completion of the jobs.

Environment: IBM Ascential DataStage 7.5.1(Designer, Manager, Director), Parallel Extender, Quality stage 7.1,Erwin 4.1, PL/SQL, Autosys, Teradata V2R5/V2R4, XML,IBM Mainframe, SQl Server 2000 Oracle 9i, Sun Solaris 2.6, Windows 2000, DB2 QMF V9, BTEQ, Korn Shell Scripting, DB2 UDB, SQL * Loader, MS DOS 6.22.

Confidential, Chicago, IL

ETL Consultant/DataStage Developer

Responsibilities:

  • Worked on DataStage parallel extender (DataStage 7.0) Designer to develop processes for extracting, cleaning, transforming, integrating, and loading data into data warehouse database
  • Performed data manipulations using various stages like Aggregator, Lookup, Join, Merge, Remove Duplicates and Transformer Stages
  • Worked on ETL DataStage Manager for importing and exporting metadata and dsx files.
  • Extensively used the DataStage Manager for importing Metadata from different Sources
  • Extensively worked on performance tuning of DataStage jobs by eliminating back to back to transformers and by proper partitioning, sorting data etc.
  • Configured the DataStage projects - created projects, enabled project properties and set the user accounts using DataStage Administrator.
  • Designed and developed validation routines to validate the data in warehouse and data marts after loading process balancing with source data.
  • Wrote in Data stage Routine and Transforms.
  • Involved in Warehouse Load Testing, System Testing, Integration Testing on Data stage Job’s and performed performance tuning on Data stage Job’s and SQL query tuning.
  • Developed UNIX shell scripts.
  • Provide Technical Guidance of Scheduling & Job Deployment
  • Creation of Shared containers so that it can be used by other modules of the plan.
  • Exporting the jobs to Testing Environment from Development and then to Production
  • Performed debugging on these jobs using Peek stage by outputting the data to Job Log or a stage
  • Performed testing of these Parallel jobs using Row Generator and Column Generator stages
  • Created Shell Scripts, which will in turn, call the DataStage jobs with all paths to Source and targets and even with connection information.
  • Used Shell Script to run and schedule DataStage jobs on UNIX server. Scheduler used Control-M on Unix
  • Performed Import and Export of DataStage Components and Table Definitions using DataStage Manager.
  • Used DataStage- Director to Run and Monitor the Jobs after successful compilation.
  • Generated reports using Cognos .

Environment: Ascential DataStage 7.0, Oracle 9i, UNIX Shell Scripts, Toad, Windows 2000/XP, Cognos 7.0, Erwin 3.5.

We'd love your feedback!