Senior Datastage Developer Resume
Albany-, NY
SUMMARY:
- 10+ years of extensive career experience in Development, Analysis and Design of ETL methodologies in all the phases of the Data Warehousing life cycle and Expert in all the stages of Software Development Life Cycle to large clients in various industries such as Healthcare, Restaurant, Education, Banking, Insurance and Storage.
- Extensive experience IBM Web Sphere 11.5/11.3/9.1/8.5/8.1.0/8.0.1 (Data stage, Quality Stage, Information Analyzer, Metadata Server) and experience in Ascential DataStage 7.5 using components like DataStage Designer, DataStage Manager, DataStage Director, DataStage Administrator and Parallel Extender
- Strong Decision Support Systems experience using Business Intelligence Tool Business Objects.
- Expert in developing DataStage jobs using star schema and snowflake schema modeling and conversant with Data Warehouse concepts like fact tables and dimension tables.
- Created and implemented common UNIX shell scripts to FTP the files from/to different servers, validate, process the Flat Files for different projects.
- Strong understanding of the principles of Data Warehouse using Fact Tables, Dimension Tables, Star Schema modeling and Snowflake Schema modeling.
- Extensive experience in using highly scalable parallel processing infrastructure using DataStage Enterprise Edition.
- Expert in writing, testing and implementing triggers, functions and procedures at database level using PL/SQL.
- Worked extensively on Erwin and ER Studio in several projects in both OLAP and OLTP applications.
- Experienced in incorporating various data sources like Oracle, MS SQL Server, DB2, Teradata, XML and flat files into the staging area.
- Experienced in troubleshooting of DataStage jobs and addressing production issues like performance tuning and enhancement.
- Having good experience in the preparation of JIL scripts to schedule the jobs through Autosys.
- Expertise in data migration projects with (target - source) same database as well as with heterogeneous databases.
- Involved in creating Test plan, UAT, Integration and System testing.
- Expert in using Standardization, Matching and Survivorship rules using Quality Stage.
- Used Quality Stage to coordinate the delivery, consistency, removing data anomalies and spelling errors of the source information.
- Strong analytical, organizational, presentation and problem-solving skills.
TECHNICAL SKILLS:
ETL Tool: IBM Web Sphere DataStage 11.5/11.3/ 9.1/8.7/8.5/8.1/8.0.1, Information Server, Quality stage, Metadata Server, Ascential DataStage 7.5, Talend Big Data Enterprise Edition 6.x/5.x
RDBMS: Oracle 11g/10g/9i/8i, DB2, Teradata, SQL Server 2000/2005, Netezza, AS400, PL SQL Scripts, Star & Snow Flake Schema.
Programming Languages: COBOL
Operating Systems: Linux/Unix, IBM AIX, Win XP
Tools: Excel, Power Point, MS VISIO, Scrum (Agile), AUTOSYS
PROFESSIONAL EXPERIENCE
Confidential, Albany- NY
Senior DataStage DeveloperResponsibilities:
- Interacted with Business Analysts to finalize the requirements and documented the technical design document for Coding.
- Developed DataStage Parallel Jobs where in using required stages, data from different sources formatted, Cleansed, summarized, aggregated and transformed and loaded into data warehouse.
- Involved in all Phases of project as a senior ETL Resource, which includes Analysis, Design, Development, coding, testing, Implementation, support and Documentation.
- Interacted with end users in finalizing the requirements and documented the Program Specifications for the ETL jobs
- Designed Data Stage Jobs to extract data from various source systems, Transformation and conversion of data, Load data to data ware house.
- Developed Sequence Jobs Using Data stage Sequencer, dependency analysis (which indicates weather Job to run sequentially or in Parallel.
- Have addressed production, UAT issues, proper action was taken accordingly based on priority and requirement.
- Created ETL jobs for different line of business Eligibility, Enrollment, Effectuation and Application.
- Created Enrollment, Coverage, Assistor, Individual New Enrollment and Comparison jobs to identify the change of activity (Cancellation, Termination, Open/Close, and Aid Categories) based selected Dimensional values.
- Associated with other team workers for implementation of data stage best practices for better performance.
- Provided support to the QA team to run all the respective objects in DEV, QA and PRD environments during scheduled Cut-over’s at various phases of the project.
- Extensively worked with DataStage Designer for developing various Parlal jobs using Sequential file, Lookup, Join, Merge, Funnel, Filter, CDC, Row, Column generators and Transformer stages.
- Automation of ETL processes using DataStage Job Sequencer, Job Control routines and Transform functions and worked on Performance tuning.
Environment: IBM InfoSphere Information server 11.5(Designer, QualityStage, Administrator, Director), DB2, Oracle, Toad, SQL Developer, Winscp, UNIX, Shell Script, Scrum, MS Visio, Autosys.
Confidential - New Albany-OH
Senior DataStage Developer
Responsibilities:
- Involved in understanding Business Process, Data Profiling and coordinated with Business Analysts to get specific user requirements to build the Data Marts.
- Involved in preparing the source to target data mapping for ETL design and development.
- Developed DataStage ETL jobs and Sequencers in the Enterprise Warehouse based on the client’s requirements.
- Used Oracle PL/SQL for writing stored procedures, packages for extracting the data.
- Discussed the Data model, data flow and data mapping with the application development team.
- Developed DataStage jobs using star schema and snowflake schema modeling and conversant with Data Warehouse concepts like fact tables and dimension tables.
- Produced initial iterations of an Enterprise Logical Data Model for the Consumer domain, enabling gap analysis for acquisition of Master Data Management software.
- Created Shared Containers to increase Object Code Reusability and to increase throughput of the system.
- Developed jobs in Parallel Extender PX using different stages like Transformer, Aggregator, Lookup, Join, Merge, Modify, Remove Duplicate, Oracle Stage, Sort, Peek, Row Generator, Column Generator, Sequential File, and Data Set.
- Performed the Unit testing for jobs, sequencers and services, developed to ensure that it meets the requirements.
- Involved in UAT, SIT, Integration Test, executed data modeling using Erwin data modeling tool.
- Highly involved in Performance enhancement of the process as the batch window to load the data was very small. Using various performance improvement techniques to achieve the target.
- Extensively worked with Job sequences using Job Activity, Email Notification, Wait for File activities to control and execute the DataStage Parallel jobs.
- Created Autosys JIL for scheduling and running DataStage jobs.
Environment: IBM InfoSphere DataStage 11.3 (Designer, Quality Stage, Director, Administrator), Netezza, Teradata, AS 400, DB2, SQL Server, Agile(Scrum), TOAD, Win SCP, UNIX- AIX, MS Office, MS VISIO, Autosys.
Confidential, -Charlotte-NC
Senior DataStage Developer
Responsibilities:
- Interacted with managers/business users to gather requirements/end user needs.
- Designed Source to Target mappings strictly per business requirements.
- Develop, test and documenting ETL Parallel Jobs, Parallel shared containers, Job sequences using DataStage parallel extender for parallel processing capabilities.
- Designed ETL jobs using DataStage 9.1 to Extract, Transform and load the data into the Oracle database.
- Designed conceptual data model based on the requirement, interacted with non-technical end users to understand the business logics.
- Shared the knowledge in DataStage EE development, planned and developed Parallel Jobs using Various Stages like Join, Lookup, Data Set, Change Capture, Filter, Funnel, Copy, Column Generator, Peek, Sequential File, Oracle Enterprise, Merge, Transformer, Aggregator, Remove Duplicates Stages.
- Designed conceptual data model based on the requirement, interacted with non-technical end users to understand the business logics.
- Created Master Job Sequencers to control sequence of Jobs using job controls.
- Enhanced the Job Performance by using proper Partitioning methods.
- Effectively used standardized stage in standardizing the source data by using the existed rule sets like name, address, area etc for multiple countries, generated the valid and invalid data reports and used Autosys to schedule the jobs.
- Integrated multiple logical data models into a single data model.
- Participated in the review of Technical, Requirements Document for the project.
- Generated Unix Scripts for Data warehouse applications and maintaining batch processing.
- Used Data Stage Director for monitoring performance statistics.
- Constructed the PL/SQL scripts to validate the data after loading process.
- Worked on Production tickets, performance tuning and enhancement.
Environment: IBM InfoSphere DataStage 9.1/8.7 (Designer, Director, Administrator, Quality Stage), Oracle 9i, Teradata, DB2, Flat Files, SQL Developer, Agile (Scrum), Star Team, PL/SQL, Autosys, Winscp, Erwin, UNIX Shell Scripting and AIX.
Confidential, Oakland, CA
Senior DataStage Developer
Responsibilities:
- Reviewed the Functional Design Specification (FDS) provided by the Business Team, Raised concerning issues present in the FDS by understanding the functionality illustrated to do the conversion.
- Extensively used IBM DataStage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into Data Marts and Involved in preparing of High Level, Low Level Design Documentation.
- Developed and loaded data warehouse tables such as dimension, fact and aggregate tables using IBM DataStage.
- Developed DataStage jobs to implement complex business rules and transformation logic using lookup, join, modify, filter, copy, aggregator, sequence stages like job activity, exception handler, execute command, wait for file activity and Used job sequences to automate the ETL process.
- Exclusively used transformer stage to do data validation, cleansing, converting as per business requirements and Created re-usable components like shared and local containers.
- Updated/Modifying the existed rule sets specifically the PAT file to handle the unhandled patterns per the ACL doc where ever it is needed.
- Implemented SCD's using CDC Stage, surrogate key generator.
- Developed simple UNIX scripts to do file transfers, file archiving and file maintenance and have good work experience with UNIX commands.
- Wrote PL/SQL statements and Stored Procedures in Oracle for extracting data as well as for writing data into tables.
- Used Quality Stage to generate the correction report like valid data report and invalid data report requested by the Business to correct the invalid data like incorrect postal codes, incorrect cities, incorrect PO Boxes, incorrect state and incorrect organization names coming from source.
- Performed the unit testing for various conditions as per data quality and business requirements.
- Responsible for performance tuning of DataStage code to increase the performance and throughput and Involved in Integration, System, User Accepting Test (UAT).
- Documented all the implementation process in detail describing the overall DataStage process and Part of the Production Support team to monitor and fix any production issues/failures and making sure that the SLA’s are being met on a timely manner.
- Created Autosys JIL for scheduling and running DataStage jobs.
Environment: IBM WebSphere DataStage 9.1/8.7 (Designer, Director, Administrator, Quality Stage), Oracle 10g, DB2, Teradata, Toad, DB2 Visualizer, Scrum(Agile), SQL Developer, Autosys, AIX UNIX,K-Shell scripts, MS Visio.
Confidential, Concord, CA
Senior DataStage Developer
Responsibilities:
- Communicated with business users and management to get business requirements and translate to ETL specifications.
- Developed Full Software Development Life Cycle (SDLC) including requirement analysis, design, development, testing, implementation and Project documentation.
- Designed Mappings between sources to operational staging targets, using Star Schema, Implemented logic for Slowly Changing Dimensions (SCD).
- Developed jobs for handling different data transformations as per specified requirements using stages like Join, Merge, Lookup, Transformer and Aggregator etc.
- Extensively used SCD, Sort, Merge, Aggregator, Peek, Dataset, Funnel and RMD Stages.
- Designed the Unit testing and integrated testing process and necessary documentation.
- Produced initial iterations of an Enterprise Logical Data Model for the Consumer domain, enabling gap analysis for acquisition of Master Data Management software.
- Experienced with QualityStage for Investigate, standardization, matching and survivorship
- Used UNIX scripts to execute jobs and used the DataStage Director for scheduling, executing and monitoring jobs.
- Used shared containers to reuse the specific business logic in various jobs to eliminate redevelopment.
- Experienced in troubleshooting of DataStage jobs and addressing production issues like performance tuning and enhancement.
- Experience with Scheduling tool Autosys for automating the ETL process.
Environment: IBM InfoSphere DataStage 8.7 (Designer, Director, Manager and Administrator), Quality Stage, Autosys, SQL Server, Agile, K-Shell Scripts, Win SCP, Oracle 10g, Toad, SQL Developer.
Confidential -Charlotte-NC
DataStage Developer
Responsibilities:
- Involved in understanding business processes and coordinated with business analysts to get specific user requirements to build Data Marts.
- Data Profiling, Designed and developed the DataStage server as well as parallel jobs for extracting, cleansing, transforming, integrating and loading data using DataStage designer.
- Developed DataStage jobs to load the data into tables using fast load utility.
- Worked with DataStage director to schedule, monitor and analyze performance of individual stages and run DataStage jobs.
- From one standard database for loans created diversified destinations for auto loans, student loans, home loans using SSIS packages.
- Set various environment variables and parameters in DataStage administrator and also used them in managing the projects.
- Used several stages like sequential file, hash file, aggregator, funnel, change capture, change apply, transformer, merge, join and lookup during the development process.
- Involved in the preparation of ETL documentation by following the business rule, procedures and naming conventions.
- Responsible for unit, system and integration testing. Development test scripts, test plan and test data. Participated in UAT (User Acceptance Testing).
- Analyzed data with discrepancies through error files, log files for further data processing and cleansing.
- Responded to user request on queries on the enterprise data warehouse and creation of reports using cognos.
- Created Autosys JIL for scheduling and running DataStage jobs.
Environment: IBM InfoSphere DataStage 8.5 (Designer, Director and Administrator), Oracle 8i/10g, IBM AIX, Agile, Flat files, Oracle 10g, TOAD, Windows XP, MS Word, Excel, Autosys.
Confidential, Sunnyvale, CA
DataStage Developer
Responsibilities:
- Involved in Data Profiling, Analysis, Requirements gathering, function/technical specification, development and testing.
- Utilized Data Stage Parallel Extender PX to optimize performance in this very large database environment.
- Involved in critical multiple instance DataStage jobs which will send the outbound files for different Lobs (line of business) at the same time and monitored the jobs accordingly.
- Scheduled the parallel jobs using DataStage Director, which is controlled by DataStage engine and for monitoring and performance statistics of each stage.
- Created Error Tables containing data with discrepancies to analyze, re-process the data.
- Extensively used Parallel Stages like Join, Transformer, Merge, Lookup, Filter, Aggregator, Modify, Copy, Sort, Funnel, Remove Duplicates, Surrogate key Generator, Row Generator, Column Generator, and Peek for development and de-bugging purposes
- Used DataStage Manager to Import and Export DataStage components and Import table definitions from source databases.
- Developed Fact Tables, Dimension Tables, Star Schema and Snowflake Schema modeling.
- Used Teradata Bulk Load stages to Load the data into Teradata Database.
- Involved in the design, development and testing of the PL/SQL stored procedures, packages and triggers for the ETL processes.
- Written SQL scripts to create and drop the indexes those are passes as parameters in the pre & post sessions and developed shell scripts for job scheduling and logging
- Involved in Integration, System, and User Accepting Testing process.
- Involved in performance tuning of SQL queries by providing hints, obtaining Explain Plan, analyzing tables and adding indexes.
- Involved in full integration test and code reviews of all jobs within each sequence before migrating the jobs and sequencers from the Development environment (Dev) to the QA environment, and from the QA environment to the Production environment.
Environment: IBM WebSphere DataStage 8.1/8.0.1, Ascential DataStage 7.5, Oracle 10g/11g, SQL, PL/SQL, SQL Loader, Teradata, UNIX Shell Script, TOAD, ERWIN, Win XP, AIX UNIX, Autosys.
Confidential, Hartford-CT
DataStage Developer
Responsibilities:
- Involved in team meetings and interacted with the business people to understand the business requirement specifications.
- Involved in creating high level and low-level design documents from business requirement specifications.
- Developed jobs for extracting, cleaning, transforming, integrating and loading data into data warehouse database using DataStage Designer.
- Imported data from transactional data source residing on Oracle database files and loaded into SQL Server 2008.
- Designed Parallel jobs using various processing stages like join, merge, remove duplicates, filter, dataset, lookup, aggregator, change capture, slowly changing dimension and funnel stages.
- Worked within a team to populate Type I and Type II slowly changing dimension (SCD) tables from several operational source files.
- Created Parameters and Environment variables to run the jobs.
- Involved in Integration, System and User Accepting Testing of jobs.
- Converted complex job designs to different job segments and executed through sequencer job for better performance and easy
- Enhanced maintenance and reusability of the jobs by making and deploying shared containers and multiple instances of the jobs.
- Involved in creating technical documentation for source to target mapping procedures to facilitate better understanding of the process and incorporate changes as and when necessary.
- Involved in the migration of DataStage jobs from development to production environment.
Environment: Ascential DataStage 7.5 Parallel Extender (Designer, Manager, Director, Administrator), IBM DataStage 8.0.1, Windows NT 4.0, PL/SQL, Oracle, MS SQL Server, UNIX, Sequential files, MS Access.
Confidential
DataStage Developer
Responsibilities:
- Managed the DataStage repository using DataStage Manager.
- Suggested various changes in the physical model to support the business requirements.
- Developed server jobs to load the data from flat files, text files and MS SQL.
- Set various parameters in DataStage Administrator for performance tuning.
- Used DataStage Designer to develop various jobs.
- Created and scheduled the job sequences by checking job dependencies.
- Wrote complex SQL queries using joins, sub queries and correlated sub queries.
- Wrote PL/SQL stored procedures, functions and packages and triggers to implement business rules into the application.
- Performed Unit Testing to check the validity of the data at each stage.
- Used DataStage Director to debug the jobs, to view the error log to check for errors.
- Implemented best practices in the development environment (code standards, code migration).
Environment: Ascential DataStage 7.5, Windows NT 4.0, Oracle 8i, UNIX, Sequential files, MS Access 2000.