Senior Datastage Developer Resume
Concord, CA
SUMMARY
- Confidential is an IBM Certified Professional with 7+ years of IT experience, with special expertise in Development, Analysis and Design of ETL methodologies in all the phases of the Data Warehousing life cycle. He has extensive experience IBM Web Sphere 8.5/8.1.0/8.01 (Data stage, Information Server, Quality Stage, Information Analyzer, Metadata Server) and experience in Ascential DataStage 7.5/6.0XE/5.0 using components like DataStage Designer, DataStage Manager, DataStage Director, DataStage Administrator and Parallel Extender. Expert in all the stages of Software Development Life Cycle.Skills Inventory
- Strong Decision Support Systems experience using Business Intelligence tool Business Objects.
- Expert in developing DataStage jobs using star schema and snowflake schema modeling and conversant with Data Warehouse concepts like fact tables and dimension tables.
- Created and implemented common UNIX shell scripts to FTP the files from/to different servers, validate, process the Flat Files for different projects.
- Strong understanding of the principles of Data Warehouse using Fact Tables, Dimension Tables, Star Schema modeling and Snowflake Schema modeling.
- Extensive experience in using highly scalable parallel processing infrastructure using DataStage Enterprise Edition.
- Expert in writing, testing and implementing triggers, functions and procedures at database level using PL/SQL.
- Experienced in incorporating various data sources like Oracle, MS SQL Server, DB2, Teradata, XML and flat files into the staging area.
- Experienced in troubleshooting of DataStage jobs and addressing production issues like performance tuning and enhancement.
- Having good experience in the preparation of JIL scripts to schedule the jobs through Autosys.
- Expertise in data migration projects with (target - source) same database as well as with heterogeneous databases.
- Strong analytical, organizational, presentation and problem solving skills.
TECHNICAL SKILLS
- Tools/Methods
- Web Sphere 8.7/8.5/8.1/8.01 Data stage
- Information Server Quality stage, Metadata Server Ascential DataStage7.5/6.0XE/5
- Quality Stage 7.x, 8.0
- Oracle 11g/10g/9i/8i, DB2
- Tera Data, PEGA
- SQL Server2000/2005, PL SQL
- Shell Scripts, C Shell
- K - Shell Scripts, AWK, VI-editor
- Perl Scripting
- COBOL, C, C++, JAVA
- SQL, PL/SQL, Linux/Unix
- IBM AIX 4.3
- Win XP/NT 6.0/5.0/4.0
- FrontPage 98/2000
- Excel, Power Point
- Visual Interdev 6.5/6.0
- Visual Age 3.5
- Cognos
- AUTOSYS
PROFESSIONAL EXPERIENCE
Senior DataStage Developer
Confidential, Concord, CA
Responsibilities:
- Communicated with business users and management to get business requirements and translate to ETL specifications.
- Developed Full Software Development Life Cycle (SDLC) including requirement analysis, design, development, testing, implementation and Project documentation.
- Designed Mappings between sources to operational staging targets, using Star Schema, Implemented logic for Slowly Changing Dimensions (SCD).
- Developed jobs for handling different data transformations as per specified requirements using stages like Join, Merge, Lookup, Transformer and Aggregator etc.
- Extensively used SCD, Sort, Merge, Aggregator, Peek, Dataset, Funnel and Remove Duplicates stages.
- Designed the Unit testing and integrated testing process and necessary documentation.
- Used DataStage Manager for importing and exporting jobs into different environments and projects.
- Used UNIX scripts to execute jobs and also used the DataStage Director for scheduling, executing and monitoring jobs.
- Used shared containers to reuse the specific business logic in various jobs to eliminate redevelopment.
- Extensively worked on DataStage Job Sequencer to Schedule Jobs to run jobs in Sequence.
- Experienced in troubleshooting of DataStage jobs and addressing production issues like performance tuning and enhancement.
- Experience with Scheduling tool Autosys for automating the ETL process.
Environment: IBM Information Server 8.7 (Designer, Director, Manager and Administrator), QualityStage, AutoSys, K-Shell Scripts, Win SCP, Oracle 10g, PL/SQL, Toad 7, SQL Developer.
Senior DataStage Developer
Confidential -Charlotte-NC
Responsibilities:
- Involved in understanding business processes and coordinated with business analysts to get specific user requirements to build Data Marts.
- Designed and developed the DataStage server as well as parallel jobs for extracting, cleansing, transforming, integrating and loading data using DataStage designer.
- Developed DataStage jobs to load the data into tables using fast load utility.
- Worked with DataStage director to schedule, monitor and analyze performance of individual stages and run DataStage jobs.
- Imported and exported repositories across project using DataStage manager.
- Set various environment variables and parameters in DataStage administrator and also used them in managing the projects.
- Used several stages like sequential file, hash file, aggregator, funnel, change capture, change apply, transformer, merge, join and lookup during the development process of the DataStage jobs.
- Involved in the preparation of ETL documentation by following the business rule, procedures and naming conventions.
- Used shard containers for reusability and reducing job complexity.
- Responsible for unit, system and integration testing. Development test scripts, test plan and test data. Participated in UAT (User Acceptance Testing).
- Analyzed data with discrepancies through error files and log files for further data processing and cleansing.
- Responded to user request on queries on the enterprise data warehouse and creation of reports using cognos 8.
- Created Autosys jils for scheduling and running DataStage jobs.
- Used UNIX commands like awk, sed, and cat.
Environment: Data Stage 8.5 (Designer, Director and Administrator), Oracle8i/10g, IBM AIX, MS SQL Server, Flat files, Oracle 10g, TOAD, Windows XP, MS Word, Excel, Autosys.
Senior DataStage Developer
Confidential, Auburn Hills, MI
Responsibilities:
- Experienced with full life Cycle and Methodology for implementing logical and physical models of Data warehouse.
- Designed & Developed jobs according business requirements.
- Involved in design and development of parallel jobs, sequences using the Designer.
- Designed several parallel jobs using Sequential File, Dataset, Join, Merge, Lookup, Change Apply, Change Capture, Remove duplicates, Funnel, Filter, Copy, Column Generator, Peek, Modify, Compare, Oracle Enterprise, Surrogate Key, Aggregator, Transformer, Row Generator stages.
- Enhanced the Job Performance by using proper partitioning methods and analyzing the resources utilized using Job Monitor.
- Experience in UNIX shell scripting, for various functions such as maintenance, backup, and configuration.
- Tuned DataStage jobs for better performance by creating DataStage Lookup files for staging the data and lookups.
- Implemented DataStage Application using Change Data Capture (CDC).
- Implemented the Slowly Changing Dimension (SCD) strategy in DataStage, for Dimension tables for storing history data.
- Created job sequences and job schedules to automate the ETL process. Enhanced the job properties for performance tuning.
Environment: IBM Web Sphere DataStage 8.5, Oracle 9i, DB2, PL/SQL, Win SCP, TOAD, UNIX, Windows XP, MS Word, Excel, Autosys.
Senior DataStage Developer
Confidential, SFO, CA
Responsibilities:
- Involved in understanding Business Process and coordinated with Business Analysts to get specific user requirements to build the Data Marts.
- Involved in preparing the source to target data mapping for ETL design and development.
- Developed DataStage ETL jobs and Data Loader definitions in the Enterprise Warehouse based on the client’s requirements.
- Used Oracle PL/SQL for writing stored procedures, packages for extracting the data.
- Developed DataStage jobs using star schema and snowflake schema modeling and conversant with Data Warehouse concepts like fact tables and dimension tables.
- Created Shared Containers to increase Object Code Reusability and to increase throughput of the system.
- Developed jobs in Parallel Extender PX using different stages like Transformer, Aggregator, Lookup, Join, Merge, Modify, Remove Duplicate, Oracle Stage, Sort, Peek, Row Generator, Column Generator, Sequential File, and Data Set.
- Performed the Unit testing for jobs, sequencers and services, developed to ensure that it meets the requirements.
- Highly involved in Performance enhancement of the process as the batch window to load the data was very small. Using various performance improvement techniques to achieve the target.
- Extensively worked with Job sequences using Job Activity, Email Notification, Wait for File activities to control and execute the DataStage Parallel jobs.
- Created Autosys jils for scheduling and running DataStage jobs.
Environment: IBM Websphere Information Server 8.1/7.5.2(Designer, Quality Stage, Director, Administrator) Teradata, Oracle 9i/10g, TOAD 9.5, Win SCP, Windows XP,UNIX- AIX, XML, MS Word, MS VISIO, Excel, Autosys.
Confidential, Sunnyvale, CA
DataStage Developer
Responsibilities:
- Involved in Analysis, Requirements gathering, function/technical specification, development and testing.
- Involved in Design and Data Modeling using Star schema.
- Created snapshot for the transactional tables in distributed databases.
- Utilized Data stage Parallel Extender PX, to optimize performance in this very large database environment.
- Involved in critical multiple instance DataStage jobs which will send the outbound files for different Lobs (line of business) at the same time and monitored the jobs accordingly.
- Scheduled the parallel jobs using DataStage Director, which is controlled by DataStage engine and also for monitoring and performance statistics of each stage.
- Created Error Tables containing data with discrepancies to analyze, re-process the data.
- Extensively used Parallel Stages like Join, Transformer, Merge, Lookup, Filter, Aggregator, Modify, Copy, Sort, Funnel, Remove Duplicates, Surrogate key Generator, Row Generator, Column Generator, and Peek for development and de-bugging purposes
- Used DataStage Manager to Import and Export DataStage components and Import table definitions from source databases.
- Developed Fact Tables, Dimension Tables, Star Schema modeling and Snowflake Schema modeling.
- Used Teradata Bulk Load stages to Load the data into Tearadata Database.
- Involved in the design, development and testing of the PL/SQL stored procedures, packages and triggers for the ETL processes.
- Written SQL scripts to create and drop the indexes those are passes as parameters in the pre & post sessions.
- Developed shell scripts for job scheduling and logging.
- Involved in performance tuning of SQL queries by providing hints, obtaining Explain Plan, Analyzing tables and adding indexes.
- Involved in full integration test and code reviews of all jobs within each sequence before migrating the jobs and sequencers from the Development environment (Dev) to the QA environment, and from the QA environment to the Production environment.
Software Environment: IBM Information Server 8.1/8.0.1/7.5, Oracle 10g/11g, SQL, PL/SQL, SQL Loader, Teradata, UNIX Shell Script, TOAD, ERWIN 4.2, Win XP and AIX UNIX, Autosys.
DataStage Developer
Confidential - Phoenix-AZ
Responsibilities:
- Developed DataStage Parallel & Server jobs for extraction transformation and loading of the data from source systems.
- Used Environment Variables, Stage Variables and Routines for developing Parameter Driven Jobs.
- Extensively used all activities like Routine Activity, Wait for File Activity, Exception Handler and Notification Activity.
- Worked extensively on different types of stages like Sequential file, Lookup File set, ODBC Stage, Aggregator, Transformer, Copy, Merge, Join, Filter, Column Generator, Funnel, Peek, Change Capture, Change Apply Stage, and several other stages for developing Parallel Jobs.
- Implemented the Slowly Changing Dimension, SCD Type-II strategy in DataStage, for the OLAP Dimension tables storing history data.
- Used DataStage Director to Run and Monitor the Jobs performed, automation of Job Control using Batch logic to execute and schedule various DataStage jobs.
- Created job sequencers using DS parameters.
- Coding Data Correction Utilities on the Corporate Data Warehouse (CDW)
- Created batch files to archives the production files and cleans from production once it’s archived.
- Created Test Plans and Test Scripts for Unit and Integration Testing for designed jobs.
- Loading data into warehouse environment, monitoring certain automatic loads, correction reloads, all in support of an existing environment.
- Worked with others- onsite team and offshore team, in a supporting role, to ensure ongoing quality of and availability of data warehouse environment.
Environment: DataStage 7.5 Parallel Extender (Designer, Manager, Director), DataStage BASIC language Expressions, Windows NT 4.0, PL/SQL, Oracle, MS SQL Server, UNIX, Sequential files, MS Access.
Confidential
Responsibilities:
- Managed the Datastage repository using Datastage Manager.
- Suggested various changes in the physical model to support the business requirements.
- Developed server jobs to load the data from flat files, text files and MS SQL.
- Set various parameters in Datastage Administrator for performance tuning.
- Used Datastage Designer to develop various jobs.
- Utilized shared containers for code reusability and for implementing the predefined business logic.
- Created and scheduled the job sequences by checking job dependencies.
- Wrote complex SQL queries using joins, sub queries and correlated sub queries.
- Wrote PL/SQL stored procedures, functions and packages and triggers to implement business rules into the application.
- Developed shell scripts to invoke back end SQL and PL/SQL programs
- Performed Unit Testing to check the validity of the data at each stage.
- Used Datastage Director to debug the jobs, to view the error log to check for errors.
- Implemented best practices in the development environment (code standards, code migration).
Software Environment: Datastage 6.0/5.0, Windows NT 4.0, Oracle 8i, UNIX, Sequential files, MS Access 2000.
