Data Stage Developer Resume
Nyc, NY
PROFESSIONAL SUMMARY:
- Over 7 years of experience in IT industry.
- Over 6 years experience in using Datastage (8.1/8.0/7.5)Designing and developing jobs using Datastage Designer, Datastage Manager and Datastage Director.
- Have very good experience in ETL life cycle.
- Helping the business in the production failure scenarios.
- Extensively made use of all stages viz., Transformer, Aggregator, Sort, Join, Link Collector, and Link Partitioner in Data stage Designer. Remove duplicates, Lookup, Merge, Join, Change capture, Peek stages in Parallel Extender job and Expertise in working with Server jobs and Parallel jobs.
- Extensively made use of job sequencer in running Datastage jobs.
- Experienced in writing UNIX shell scripting and hands on experience with scheduling of shell scripts using AUTOSYS/CONTROL-M.
- Having very good experience in using the BEST PRACTICES in ETL jobs/stages.
- Experienced in Data analysis, Data modeling concepts, Database design and Data migration.
- Experienced in using the databases like DB2, Teradata, Oracle 10g/9i/8i and SQL Server.
- Strong understanding of the principles of Data Warehouse using Fact Tables, Dimension Tables, Star Schema and Snowflake Schema modeling.
- Experience in Database Design, Entity-Relationship Modeling, Dimension Modeling, Star Schema & Snowflake with Kimball methodologies
- Experienced in using XML source files.
- Extensive experience with business intelligence applications with participation in entire life cycle of projects
- Extensive experience in developing custom routines to simplify the determination of data output paths for complex business logic
- Experienced in developing custom routines to simplify the determination of data output paths for complex business logic.
- Worked on writing numerous routines and transforms for various specifications such as Date conversions, files based operations such as splitting and moving the file and string conversions.
- Experienced in using Erwin for data modeling.
- Knowledge in using Datastage version control.
- Involved in the production support.
- Application Data warehousing experience including Telecom, Retail, Insurance and banking sectors. Proven ability to implement technology based solutions for business problems.
- Excellent skills in documenting the ETL process to facilitate an understanding of the entire ETL process and incorporate changes as and when needed.
- Effective in cross-functional and global environments to manage multiple tasks and assignments concurrently with effective communication skills.
Technical skills:
ETL TOOLS
IBM Info sphere Ascential Datastage 8.1/8.0/7.5.x/7.5/6.x and Quality stage, DMEXPRESS (Syncsort).
Operating System
UNIX (AIX), Windows NT/2000/2003, XP.
Programming Languages
C, SQL, UNIX shell programming, HTML, PL/SQL.
Database
Oracle 10g/9i/8i, IBM DB2, Teradata V2R6 and SQL server 2000/2005.
Database Modeling Tools
ERWIN.
Others
Toad, AUTOSYS, Tivoli, SAP R/3, Tidal, Rapid SQL.
Educational Qualification:
Bachelors in Computers Science Engineering
PROJECT SUMMARY:
Confidential,NYC, NY September2011 – Present
Data Stage Developer
Responsibilities:
- Participated in the Team meetings to gather the business requirements and developing the Specifications.
- Involved with business analysts to understand the business requirement specifications and implemented the ETL jobs using Datastage. Deployed the solutions that maximize the consistency and usability of data.
- Self Documenting ETL Jobs to comply with Metadata Standards.
- Extensively used Datastage Designer to develop various Parallel jobs to extract, cleanse, transform, integrate and load data into Enterprise Data Warehouse tables.
- Used Datastage Director for monitoring performance statistics.
- Involved in Performance Tuning of ETL programs and Stored procedures Code.
- Worked extensively on different types of stages like Sequential file, Data set, Lookup stage, ORACLE Stage, Aggregator, Transformer, Copy, Merge, Join, Filter, Funnel, Change Capture, Change Apply Stage, and several other stages for developing Parallel Jobs.
- Designed ETL jobs to identify and remove duplicate rows using remove duplicate stage.
- Extensively worked with surrogate key generator to generate warehouse keys before loading data into warehouse.
- Used Environment Variables, Stage Variables and Routines for developing Parameter Driven Jobs.
- Involved in Design, Source to Target Mappings between sources to operational staging targets.
- Implemented logic for Slowly Changing Dimensions.
- Improved the performance of the jobs.
- Developed and Configured Jobs/ Job Control Sequence to run the ETL programs.
- Created Stored Procedures using Unix Scripts.
- Migrated Jobs from Development to QA and to Production Environments.
- Involved in production support 24 X 7.
- Unit Test Datastage Jobs in development including creating the appropriate test data.
- Developing and supporting the corporate data migration and reporting needs.
- Extensively used CONTROL-M Scheduler Tool to schedule Datastage jobs.
- Involved in the creation of mapping documents from data warehouse to data marts.
- Participated in the review of Technical, Business Transformation Requirements Document.
- Participated in discussions with Group Members and Technical Manager regarding any technical and Business Requirement issues.
Environment: IBM Info sphere Datastage 8.1, Quality Stage, DMEXPRESS, Oracle 10g, SQL, PL/SQL, SQL server, Shell Script, TOAD, Rapid SQL, ERWIN, Win NT, and UNIX (AIX), TIDAL.
Confidential,TN Dec\'10 – Oct 2011
Sr. DataStage Developer
Responsibilities:-
- Interacted with end users in finalizing the requirements and documented the Program Specifications for the ETL jobs.
- Involved in creating logical and physical Dimensional data models using Erwin.
- Involved in preparing Data Mapping Documents (DMD) and Design the ETL jobs based on the DMD with required Tables in the Dev Environment.
- Provide the staging solutions for Data Validation and Cleansing with Quality Stage and DataStage ETL jobs.
- Lead a team of developers, resolved issues, developed appropriate documentation. Interacted with offshore team every day to develop the jobs
- Designed Quality Stage Jobs in order to perform data Cleansing using Investigate Stage, Standardize Stage, Match Frequency, Survive Stage, Reference match Stage.
- Used information analyzer for column analysis, primary key analysis, foreign key analysis .
- Developed Data Stage Parallel Jobs where in using required stages, data from different sources formatted, Cleansing, summarized, aggregated and transform into data warehouse.
- Documented current SRS and future Process Flows, and Data Flows.
- Designed several parallel jobs using Sequential File, Dataset, Join, Merge, Lookup, Change Apply, Change Capture, Remove duplicates, Funnel, Filter, Copy, Column Generator, Peek, Modify, Compare, Oracle Enterprise, Surrogate Key, Aggregator, Transformer, Decode, Row Generator stages.
- Extensively worked with Join, Look up (Normal and Sparse) and Merge stages.
- Extensively used the CDC (Change Data Capture) stage to implement the slowly changing Dimensional.
- Created multiple configuration files and defined logical nodes, scratch disk, Resource scratch disk and pools.
- Extensively worked with sequential file, dataset, file set and look up file set stages.
- Used the DataStage Director for testing and debugging its components and monitoring the results.
- Parameterized DataStage jobs and also created multi-instance jobs.
- Analyzed the performance of the jobs , project and enhance the performance using standard techniques.
- Created Master Job Sequences.
- Extensively worked with Job sequence stage such as Job Activity, Email Notification and Sequencer Wait for File activities to control and execute the Data stage Parallel jobs.
- Created PL/SQL Procedures, Functions on Database tables before loading to check some validations.
- Migrated jobs from development to QA to Production environments.
- Defined UNIX -shell scripts for file watcher and file archiving process.
- Developed complex queries using different data providers in the same report.
- Extensively developed Data stage server routines using Data stage Basic Language as part of the development process.
- Performed Unit testing and Integration testing in testing environment.
Environment: IBM Info Sphere DataStage 8.1, Quality stage, Oracle 10g, DB2 UDB, Teradata, SAP R3, Main frame, UNIX(AIX) , MSOffice Suite, Erwin, MS Visio, Toad, Control Center.
Confidential,GA April2009-November2010
DatastageDeveloper
Responsibilities:
- Involved in customer financial management project, which deals mainly with the costumer’s information, invoices and payment.
- Experienced with all the new features in Datastage and migrated all the jobs from 7.5.2 to 8.0.
- Prepared the Naming standard documents, Best practice documents, Release documents and code migration steps to perform.
- Prepared Functional Specification documents, Process flow diagrams, Technical specification documents, mapping documents for Source to Target Mapping.
- Collaborated with Business analysts and the DBA for requirements gathering, business analysis designing of technical requirements document.
- Performed OLTP/OLAP system study, understanding database Schemas like Star Schema and Snowflake Schema used in relational, dimensional and multidimensional modeling.
- Involved in Design and developed Data model using Erwin.
- Designed and implemented slowly changing dimensions (SCD’s) methodologies.
- Used technical transformation document to design and build the extraction, transformation, and loading (ETL) modules.
- Imported various Application Sources (tables from Database like flat files and Teradata) into Manager.
- Extracted the data from Web applications using Web Services pack in Datastage.
- Created Data stage jobs using different stages like Transformer, Aggregator, Sort, Join, Merge, Lookup, Data Set, Funnel, Remove Duplicates, Copy, Modify, Filter, Change Data Capture, Change Apply, Sample, Surrogate Key, Column Generator, Row Generator etc.
- Used version Control for data stage to track the changes made to the data stage project components and for protecting jobs by making read only.
- Wrote SQL/PLSQL Procedures to improve the Performance while loading.
- Developed Server jobs using stages ODBC, Link Partitioner, Aggregator, Transformer, Link Collector, and Hash File etc.
- Migrated Jobs from Development to QA and to Production Environments.
- Written Executed Various UNIX kern Shell scripts before scheduling Jobs.
- Know various types of reports like Master Detail, Cross Tab, Drill Down and Linked reports to enable easy analysis.
- Involved in Creating Complex reports using User defined functions like @prompt, variable, condition.
- Extensively used Autosys Scheduler Tool to schedule Data Stage jobs.
- Documented self developed reports and universes as well as supported universe and Reports and Scheduled the Jobs.
- Analyzed the performance of the jobs and project and enhance the performance using standard techniques.
- Used Data Stage Parallel Extender parallel jobs for improving the performance of jobs.
Environment:IBM Info sphere Datastage 8.0/7.5.2(Designer, Manager, Administrator, Director, and Quality stage), SQL, Oracle 10g/9i, PL/SQL, Erwin, Windows NT, Toad, UNIX Shell scripting, Autosys,
Confidential,Boston, MA Mar 2008-March 2009
Datastage Developer
Responsibilities
- Involved in understanding business processes and coordinated with business analysts to get specific user requirements to build Data Marts.
- Modeled the Star Schema Data Marts by identifying the facts and dimensions using Erwin Data modeling tool. Involved in creating policy, premium and claim data marts.
- Developed Datastage Jobs to load the data into Teradata tables using Fast load utility.
- Worked with Datastage Director to schedule, monitor and analyze performance of individual stages and run Datastage jobs.
- Imported and exported repositories across projects using DataStage Manager.
- Set various environment variables and parameters in Datastage Administrator and also used them in managing the projects.
- Used several stages like Sequential file, Hash file, Aggregator, Funnel, Change Capture, Change Apply, Transformer, Merge, Join and Lookup during the development process of the Datastage jobs.
- Involved in the preparation of ETL documentation by following the business rule, procedures and naming conventions.
- Used shared containers for reusability and reducing job complexity.
- Responsible for unit, system and integration testing. Developed test scripts, test plan and test data. Participated in UAT (User Acceptance Testing).
- Analyzed data with discrepancies through Error files and Log files for further data processing and cleansing.
- Developed shell scripts to automate file manipulation and kicking off the Data Stage jobs.
- Migrated Jobs from Development to QA and to Production Environments.
- Extensively used Tivoli Scheduler Tool to schedule Datastage jobs.
- Participated in discussions with Group Members and Technical Manager regarding any technical and Business Requirement issues.
Environments:
Ascential Datastage 7.5 (Designer, Director, Manager), Quality Stage, Oracle 9i, MS SQL Server 2005, IBM DB2, Teradata, Oracle SQL Developer, Teradata SQL Assistant, IBM AIX, Erwin, Tivoli.