Datastage Developer Resume
Valley City, OH
SUMMARY
- 6+ Years of extensive experience in Analyzing, Designing, Developing, Testing, Implementing and Maintaining Data Warehouse business systems.
- Experience in ETL (Data Extraction, Transformation and Loading) using IBM InfoSphere Information Server 8.5/8.1 (DataStage, QualityStage), IBM DataStage Enterprise and Ascential DataStage 7.5/7.0 using DataStage Designer, Director, Administrator, Manager, and Parallel Extender to implement ETL Solutions and Data Warehousing Projects.
- Extensive experience in Analysis, Design, Data Extraction, Cleansing, Transformation and Loading into Data Marts.
- Involved in Dimensional Data Modeling (Star Schema, Snow - Flake Schema) Data Architecture, Business and Data Analysis.
- Designed Technical Design Specifications and Mapping Documents with Transformation Rules.
- Extensively worked on DataStage Parallel Extender and Server Edition.
- Used both Pipeline and Partition Parallelism for improving performance.
- Experience in developing Parallel jobs using various stages like Join, Merge, Lookup, Surrogate key, Funnel, Sort, Transformer, Copy, Remove Duplicate, Filter, Pivot and Aggregator stages for grouping and summarizing on key performance indicators used in decision support systems.
- Frequently used Peek, Row Generator and Column Generator Stages to debug.
- Expertise in Software Development Life Cycle (SDLC) of Projects - System study, Analysis, Physical and Logical design, Coding and implementing business applications.
- Expertise in performing Data Migration from various legacy systems to target database.
- Expertise in Data Modeling, OLAP/ OLTP Systems, generation of Surrogate Keys.
- In depth knowledge of Star Schema, Snow Flake Schema, Dimensional Data Modeling, Fact and Dimension tables.
- Experience in Data Warehouse development, worked with Data Migration, Data Conversion, and ETL using Ascential DataStage with DB2 UDB, Oracle, SQL Server.
- Extensive experience in development, debugging, troubleshooting, monitoring and performance tuning using DataStage Designer, Director, an\d Administrator.
- Prepared job sequences and job schedules to automate the ETL processes.
- Experience in handling multiple relational databases like Oracle, MS SQL Server, Complex Flat Files, Delimited Files, Netezza, Teradata and DB2/UDB for Extraction, Staging and Production data warehouse environments.
- Experienced in using MQ Stage for WebSphere MQ enterprise messaging system.
- Configured MQ Connector for client connection mode on Unix and Windows.
- Experience with UNIX Shell Scripting for Data Validations and Scheduling the DataStage Jobs.
- Working Knowledge on WebSphere Information Services Director (WISD).
- Extensive Experience in implementing AUTOSYS jobs in UNIX Environments.
- Used DataStage Version Control to promote DataStage jobs from Development to Testing and then to Production Environment.
- Strong knowledge on RDBMS concepts, SQL, PL/SQL and SQL Performance Optimization.
- Strong analytical, problem solving and leadership skills and has ability to interact with various levels of management to understand requests and validate job requirements.
- Team player with strong ability to quickly adapt to any dynamic developments in projects and capable of working in groups as well as independently.
TECHNICAL SKILLS
ETL Tools: IBM InfoSphere DataStage 8.5/8.1/8.0 (Designer, Director, Administrator), Ascential DataStage 7.5.1/7.5.0 (Manager, Designer, Administrator, Director) and Quality Stage 8.0/7.5.1.
Databases: Teradata 13, IBM DB2/UDB, Oracle 10g/9i/8i, MS SQL Server.
Languages: SQL, PL/SQL, UNIX Shell Script.
Data Modeling: Erwin 4.0/3.5, Oracle Designer
Reporting Tools: MicroStrategy 9i/8.1.1 (MicroStrategy Desktop, MicroStrategy Web, MicroStrategy Intelligence Server, Report Services).
Operating Systems: Windows (2008/2003/7/XP/NT/98/95), UNIX (AIX, Solaris, Linux).
Database Tools: SQL * Plus, SQL * Loader, TOAD, Autosys, Control M, Teradata SQL Assistant.
PROFESSIONAL EXPERIENCE
Confidential, Valley city, OH
DataStage Developer
Responsibilities:
- Involved in gathering requirements from business for DataStage application development.
- Imported metadata from repository, created new job categories, routines and data elements using Datastage Designer.
- Involved in creating Functional and Technical Scope design documents for ETL Processes.
- Worked with the Business Analysts for requirements gathering, business analysis, testing and Project coordination.
- Applying best practices and organizational coding standards in Data Stage jobs.
- Worked on DataStage Designer DataStage Director for developing various jobs.
- Data profiling is done on various Source Systems to identify and accommodate the required data from these various source systems.
- Implemented Audit and Logging functionality in batch jobs.
- Actively participated in developing implementation plans for Go Live projects.
- Used the ETL DataStage Designer to develop processes for Extracting, Cleansing, Transforms, Integrating and Loading data into data warehouse.
- Extensively used Sequential File Stage, Dataset Stage, File Set Stage, Lookup-up, Transformation and other Database plug-ins to perform transformation and load the date.
- Worked on different stages for creating the jobs based upon business application.
- Involved in creation of a new database to monitor the growth and performance of different instances.
- Used different Stages like Join, Lookup, Sparse Lookup, Sequential File, Dataset, Transformer, Sort, Aggregator, Merge, Funnel, Filter, Copy, Modify, Remove Duplicate, Change data Capture, Stored Procedure for developing different Jobs.
- Designed the DataStage Sequencer Joband Shared Containers to implement the business requirements and design specifications.
- Extensively worked with Job sequences using Job Activity, Sequencer, Wait for File activities to control and execute the Data stage Parallel jobs and by using Terminate activity to terminate job on failure condition.
- Imported metadata from repository, new job categories and created new data elements.
- Modified Configuration file according to space constraints specially when using Active stages
- Performed debugging, troubleshooting, monitoring and performance tuning using DataStage 8.5.
- Used SORT, FILTER Stage to DEDUP the data and Generated Surrogate Key by using Transformer in DS 8.5.
- Involved in Performance Tuning of the applications using IBM InfoSphereDataStage8.5.
- Used the Data Stage Director and its run-time engine for the Test solutions and debugging its components, and monitoring the resulting executable versions.
- Developed UNIX Shell Scripts and updated the log for the backups.
- Involved in Unit Testing with the jobs and date loaded into Target database.
Environment: IBM InfoSphere DataStage 8.7/8.5 (Manager, Designer, Director, and Administrator), Parallel Extender, Oracle 10g, DB2, IBM Netezza 100, Erwin 4.5, Microsoft Visio, IBM AIX 4.2, UNIX, Windows.
Confidential, TX
DataStage Developer
Responsibilities:
- Documented user requirements, translated requirements into system solutions and develop implementation plan and schedule.
- Involved in designing DataStage Mapping and the Technical Documentation.
- Identified and documented Data Sources and Transformation Rules required populating and maintaining data warehouse.
- Involved in Performance Tuning Activities, worked with DBAs to tune the queries for best performances.
- Involved in creating jobs and analyzing scope of application, defining relationship within and between groups of data, star schema, etc.
- Used DataStage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into Data Warehouse database.
- Created DataStage jobs using different stages like Transformer, Aggregator, Sort, Join, Merge, Lookup, Data Set, Funnel, Remove Duplicates, Copy, Modify, Filter, Change Data Capture, Change Apply, Surrogate Key, Column Generator, and Row Generator.
- Involved with scheduling team for scheduling the DS nightly jobs.
- Performed Import and Export of DataStage components and table definitions using DataStage.
- Used MetaStage for managing and collecting metadata from various other tools through the use of MetaBrokers.
- Extensively used Flat File Stage, Hashed File Stage, DB2 UDB Stage, FTP Plug-in Stage and Aggregator Stage during ETL development.
- Used shared containers for reusability and reducing job complexity.
- Used DataStage Director to Run and Monitor the jobs performed automation of Job Control using Batch logic to execute and schedule various DataStage jobs.
Environment: IBM Infosphere DataStage 8.5/8.7 (Manager, Designer, Director, and Administrator), Parallel Extender, Oracle 10g, IBM DB2, Teradata 13, and Windows.
Confidential, MI
ETL Developer
Responsibilities:
- Understanding the TTDs provided, developing, processing the code and unit test the Job as per the requirement.
- Designed and Developed DataStage Jobs to extract data from heterogeneous sources, applied transform logics to extract data and load into data warehouse.
- Constant work on the XML extract stage, MQ Series, Complex flat files, Datasets, Flat files, XML stage, Lookups, joiner, FTP the files to mainframe etc..
- Worked on various DataStage Jobs belong to Vendor, Comp Parts, MRC Receipts, Demand & Demand PO, General Ledger, BOM, Service Building indicator, Order Acknowledgement, Change Master, Order Completion, QC Clearance etc.,
- Monitored and scheduled Jobs in DataStage Director.
- Used MQ Stage to read messages from/ to WebSphere MQ enterprise messaging system.
- Used BMC Remedy for creating tickets when on support with migration issues and when DEV, QA, Pre-Prod and Prod disk space issues.
- Used Citrix for secured processing of Jobs for DataStage Designer, test, pre-prod and Prod.
- Involved in Modifying existing DataStage Jobs.
- Interacted with scheduling team for scheduling DS nightly jobs.
- Used DataStage Designer for developing various jobs to Extract, Cleansing, Transforming, Integrating and Loading data into Data Warehouse.
- Used Datastage Director to schedule running the jobs, monitoring scheduling and validating its components.
- Used Erwin for Data modeling.
- Frequent usage of Clear Case version control.
- Running and monitoring of Jobs using Datastage Director and checking logs.
- Involved in performing extensive Back end testing by writing SQL queries to extract the data from database using Oracle SQL and Pl/SQL.
- Involved in Unit testing of the DataStage Jobs.
- Monitoring all data loads and fixing the errors.
- Used Primavera in according to DataStage work requirement.
Environment: IBM InfoSphere DataStage 8.5/8.7 (Manager, Designer, Director, and Administrator), Parallel Extender, IBM Info sphere quality stage 8.5, Teradata V2R5, Oracle 10g, DB2, Erwin 4.5, Windows 2K3.
Confidential, NJ
DataStage Developer
Responsibilities:
- Involved in design and development of data warehouse environment. Translated business processes into DataStage jobs for building data marts.
- Used different databases such as Oracle, SQL Server and Excel and flat files as source.
- Involved with Scheduling Team for scheduling the DataStage nightly batch jobs.
- Used DataStage Administrator to create repository, user groups, and managed users by setting up their privileges and profile.
- Identified source systems, their connectivity, related tables and fields and ensured data consistency for mapping.
- Implemented the Audit and logging functionality in batch jobs.
- Worked with DBAs while implementing the DataStage best practices for performance tuning.
- Used Parallel Extender for parallel processing to improve job performance while working with bulk data sources.
- Analyzed the transactional data model and data elements.
- Interacted with business analyst, SME (subject matter experts) on day-to-day basis to create technical specifications for data conversion programs.
- Developed DataStage Jobs to load the data into Teradata tables using Fast load utility.
- Created Source Table definitions in DataStage Repository by analyzing data sources.
- Designed DataStage ETL jobs for extracting data from heterogeneous source systems, transforming and finally loading into the Data Marts using Datastage Designer.
- Created re-usable components using shared containers for local or shared use.
- Imported and exported repositories across projects.
- Created error files and log tables containing data with discrepancies to analyze and re-process the data. Created job schedules to automate the ETL process.
Environment: IBM Ascential DataStage 7.5.1/8.1 (Parallel Extender), Oracle 9i, SQL Server, SQL, PL/SQL, AIX, Teradata, Windows XP, Control-M, Unix.
Confidential, WI
DataStage Developer
Responsibilities:
- Involved system analysis and design of Data warehouse.
- Involved in creating Functional and scope documents for ETL process
- Worked with the Business analysts for requirements gathering, business analysis, testing, and project coordination.
- Used Information Analyzer to provide complete analysis of source systems and target systems, and assesses structure, content, and quality of data.
- Used Quality Stage which standardizes stage to reformat data from multiple systems for effective matching and output formatting.
- Used Quality Stage Investigation Stage to create input into cleansing process.
- Involved in the design of an efficient, reliable ETL mechanism for updating the Data Warehouse, Data Marts and downstream systems.
- Processed Cleansing, Purging and Optimizing the data in warehouse. Developed and implemented all ETL (Extract, Transform and Loading) Components based on the filter rules to obtain needed data from different source systems for calculating required metrics using PL/SQL.
- Used Resource Registry extensively to improve system portability.
- Developed Parameterized reusable Data Stage jobs where you can use these jobs in multiple instances.
- Used Parallel Jobs for splitting the data into subsets and flowing of data concurrently across all available processors to achieve job performance.
- Written SQL in DB2 for using in Data Stage and testing the date.
- Worked extensively on different types of stages like Transformer, Sorter, Aggregator, Lookup, folder, Joiner, OCI, Flat file and Oracle Enterprise.
- Created/Modified Universes by Adding objects, Tables/Derived tables with Oracle and as Backend Databases.
- Involved in performance tuning of Business Objects reports. Using hints/ aggregate aware, index aware and other optimizing techniques to improve performance.
Environment: IBM InfoSphere DataStage 7.5/8.0 (Manager, Designer, Director, and Administrator), Parallel Extender, Oracle 10g, IBM DB2, IBM Netezza 100, Erwin 4.5.
Confidential, FL
DataStage Developer
Responsibilities:
- Involved in complete Data Warehouse Life Cycle from Requirements gathering to end user support.
- Involved in Designing, Testing and Supporting DataStage jobs.
- Developed Parallel jobs using various stages like Join, Merge, Lookup, Surrogate key, Scd, Funnel, Sort, Transformer, Copy, Remove Duplicate, Filter, Pivot and Aggregator Stages for grouping and summarizing on key performance indicators used in decision support systems.
- Worked in onsite-offshore environment, assigned technical tasks, monitored the process flow, conducted status meetings and making sure to meet the business needs.
- Meet the clients on a weekly basis to provide better services and maintain SLAs.
- Redesigned, modified existing jobs and shell scripts in production environment to fix the daily aborts.
- Use Control-M to schedule jobs by defining the required parameters and monitor the flow of jobs.
- Automated the process of generating daily and monthly status reports for the processing jobs.
- Created Teradata Stored Procedures to generate automated testing SQLs.
- Summed key performance indicators using Aggregator Stages as an aid to Decision Support Systems.
- Provided day-to-day and month-end production support for various applications like Business Intelligence Center, and Management Data Warehouse by monitoring Servers, jobs on UNIX.
- Worked on dropping Indexes, remove duplicates, rebuilt Indexes and rerun the jobs failed due to incorrect source data.
Environment: IBM Ascential Data Stage 8.1/7.5, IBM InfoSphere Quality Stage 7.5.1 Teradata V2R5, Oracle 9i, MS SQL Server, UNIX and Windows.