Sr. Datastage Developer Resume Profile
Jersey City, NJ
Objective
Over 7 years' Experience of Data warehousing in the areas of ETL design and development. Proficient in all stages of Software Development life-cycle including System Analysis, Design, Development, Implementation, Production Support and Maintenance. Ability to work effectively and efficiently in a team and individually with excellent interpersonal technical and communicational skills.
Summary
- Experience inDatastage 9.x/8.x/7.x using components like Datastage Designer, Datastage Manager, Datastage Director, Datastage Administrator, Quality stage, Information server and Parallel Extender. Designed Mapping documents ETL architecture documents and specifications.
- Strong understanding of the principles of DW using Fact Tables, Dimension Tables, star and snowflake schema modeling.
- Involved in complete Software Development life-cycle SDLC of various projects, including requirements gathering, system designing, data modeling, ETL.
- Extensive experience in analysis and design of database including ER Diagrams and Normalization techniques.
- Experience in using different types of stages like Transformer, Aggregator, Merge, Join, Lookup, sort, Remove duplicate, Funnel, Filter, Pivot, Shared containers for developing jobs.
- Strong knowledge on SQL, Stored Procedures, Functions, Triggers.
- Created local and shared containers to facilitate ease and reuse of jobs.
- Diverse knowledge of data dependencies using Metadata of DataStage and preparing job sequences for existing jobs to facilitate scheduling of multiple jobs.
- Good Knowledge about the principles of DW likeData marts, OLTP, OLAP, Dimensional Modeling, fact tables, dimension tables and star/snowflake schema modeling.
- Expert in Data Warehousing techniques for Data Cleansing, Slowly Changing Dimension phenomenon SCD , surrogate key assignment and CDC ChangeDataCapture .
- Expert in unit testing, system integration testing, implementation, maintenance and performance.
- Helped functional team in writing up of source to target mapping documents.
- Strong knowledge of Relational Database Management Systems RDBMS concept in several databases like SQL Server 2008/2005 and Oracle 10g/9i/8i.
- Worked with UNIX shellscripting for enhancing the job performance.
- Worked on scheduling tools like Auto Sys, Control M and Tivoli to schedule Datastage jobs.
- Upgraded datastage jobs from server to parallel.
- Excellent communication, interpersonal, analytical skills and strong ability to perform as part of a team.
Technical Skills:
ETL Tools | ETL IBM InfosphereDatastage9.1/8.5/8.1/8.0.1/7.5.2/7.1 Administrator, Manager, Designer, Director, Parallel Extender/Orchestrate, Information Server, Quality Stage ,Information Analyzer, Information Stage, IBM Web Sphere Application Server |
Database | Oracle 11g/10g/9i/8i, IBM DB2/UDB, Teradata, SQL Server 2008, MS Access ,TOAD, SQL Navigator, WinSQL, SQL Gate |
Data Warehousing | OLTP, OLAP, Star Snow-Flake schema Modeling, Fact and Dimensions, Physical and Logical Data Modeling, Erwin, SSAS. |
Operating systems | Windows 7x/NT/XP, UNIX, LINUX, MS-DOS |
Languages/Scripting | C, C , C , Visual Basic, PL/SQL, UNIX Shell scripts |
Others | MS office, FTP, SFTP, Putty, SQL server Business Intelligence Development studio |
Scheduling Tools | Control M, Auto Sys and Tivoli |
Professional Experience:
Confidential
Sr. Datastage developer
Responsibilities:
- Designed the ETL jobs using IBM InfosphereDataStage 9.1 to Extract, Transform and load the data into staging, ODS and EDW.
- Designed and developed the ETL jobs using Parallel Edition which distributed the incoming data concurrently across all the processors, to achieve the best performance.
- Designed parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Change Data Capture CDC , Modify, Row Generator and Aggregator.
- Responsible for generation of DDL statements which are executed for database creation.
- Responsible for preparing Physical/logical data models .
- Responsible for data analysis, requirements gathering, report analysis, source-to-target mapping, frequency analysis, process flow diagrams, and documentation.
- Handled Performance Tuning of Jobs to ensure faster Data Loads.
- Designed sequence jobs using the activities such as Job Activity, Nested Condition, Notification Activity, Sequencer Activity, Terminator Activity and Execute Command.
- Performed the Integration and System testing on the ETL jobs.
- Responsible for preparing ad hoc jobs.
- Translated business rules and functionality requirements into ETL derivations.
- Scheduled jobs using Autosys scheduler utility based on the requirements and monitored the production processes closely for any possible errors.
- Migrated projects from development to QA to Production environments.
- Assisted operation support team for transactional data loads in developing SQL UNIX scripts.
- Imported the required Metadata from heterogeneous sources at the process level.
- Created Job Parameters and Environmentvariables to run the same job for different sources and targets.
- Used Multi-job-compiler during deployment of projects.
- Created Batches DS job controls and Sequences to control set of jobs.
- Used the Director to schedule running the job, testing and debugging its components, and monitoring.
- Created Shared Containers for Re-using the Business functionality.
- Collaborated with BO team to design Crystal reporting and reports for enterprise reporting applications.
- Worked with Developers to troubleshootand resolve issues in job logic as well as performance.
Environment: IBM WebSphere DataStage 9.1 Administrator, Designer, Director , IBM Information Analyzer 8.1.1a, Microsoft SQL 2005/2008, IBM DB2 9.1, AIX6.0, Oracle 11g, Toad 9.5, Java, MS-Access, shell scripts, PUTTY, WinSCP, ERwin R8.1, Mercury Quality Center, IBM Rational Rose, Autosys.
Confidential
Role: Sr. DataStage Developer
Responsibilities:
- Involved in various roles of Administrator and Developer throughout the project.
- Conducted the training sessions for other ETL developers on the best practices as well as performance improvement techniques.
- Managed analysis, design, coding and testing of ETL jobs for 7 Source Systems.
- Involved in implementing the Best practices and design standards. The Best practices include Restart-ability, Recovery, Parameter standardization and Capacity planning, etc.
- Participated in the review of Technical, Business Transformation Requirements Document.
- Prepared documentation to describe process development, logic, coding, testing, changes and corrections.
- Used Partition methods and collection methods for implementing parallel processing.
- Developed complex DataStage jobs according to the business requirements / mapping documents.
- Performed Unit Testing, System Integration Testing and User acceptance testing.
- Extensively Designed local containers and shared containers to simplify and modularize job design by replacing complex logics with single container stage and also to promote reusability of job designs.
- Involved in importing and exporting jobs category wise and maintaining the backupregularly.
- Used designer and director to schedules and monitor jobs and to collect the performance statistics.
- Worked within a team to populate Type I and Type II slowly changing dimension tables from several operational source files Created some routines Before-After, Transform function used across the project.
- Involved in creating UNIX shell scripts for database connectivity and executing queries in parallel job execution.
- Involved in the upgrade of 7.5 version to 8.0 version Server to parallel upgrade .
- Responsible to tune ETL processes to optimize load and query performance.
- Created standards document, best practices guide and performance tuning techniques documents.
Environment:Windows 2008, IBM Web sphere 8.1/8.0/7.5/7.0 Administrator Client, Manager, Designer Client, Director Client , Information Server, Teradata V2 R5, Oracle 8/8i/9i/10g, SQL Server 2008R2, shell scripting, AIX/UNIX.
Confidential
DataStage Developer
Responsibilities:
- Working with both onsite and offshore team of ETL Developers and ETL Analyst.
- Involving with business users for identifying, prioritizing resolving numerous data issues, create ETL Project Plans, design and development
- Handling both development and support teams.
- Creating new batches that utilized parallelism better Multi Instance .
- Simplifying the environment by removing redundant objects.
- Automated the processes to reduce the manual efforts.
- Involved in data profiling and data model.
- Developed a complex SQL code to use in the common code approach to avoid writing several SQL scripts for different source table/file formats.
- Preparation of technical specification for the development of Extraction, Transformation and Loading ETL jobs to load data into various tables in Data marts.
- Worked with DataStage server stages like OCI, ODBC, Transformer, Hash file, Sequential file, Aggregator, Sort, Merge, Link Partitioner, Link collector, IPC and other stages.
- Implemented and hardcoded high performance DataStage routines.
- Imported the required Metadata from heterogeneous sources at the project level.
- Used Parallel stages like Lookup, Join and Merge stages for joining various information and also used Parallel Transformer, Column Generator, Funnel, Filter, Switch, Modify, Pivot, Row Generator.
- Used Parallel Extender Development/Debugging stages like Row generator, Column Generator, Head, Tail and Peek for debugging of the jobs.
- Performed debugging on these jobs using Peek stage by outputting the data to Job Log or a stage.
- Developed Job Sequencer to execute jobs in proper sequence. Also automated email messaging was implemented using Sequencer to notify the operations team of any data load issues such as job failure, dropped rows, rejected rows etc.
Environment: Windows 2008, IBM Data stage 8.5 Server/parallel/8.1/8.0/7.5/7.0 Administrator Client, Manager, Designer Client, Director Client , Parallel Extender, Information Analyzer IA , Oracle 11g, SQL Server 2008R2, SQL,and AIX/UNIX.
Confidential
Role: DataStage Developer
Responsibilities:
- Involved in various roles of Administrator and Developer throughout the project.
- Conducted the training sessions for other ETL developers on the best practices as well as performance improvement techniques.
- Managed analysis, design, coding and testing of ETL jobs for 7 Source Systems.
- Involved in implementing the Best practices and design standards. The Best practices include Restart-ability, Recovery, Parameter standardization and Capacity planning, etc.
- Participated in the review of Technical, Business Transformation Requirements Document.
- Prepared documentation to describe process development, logic, coding, testing, changes and corrections.
- Used Partition methods and collection methods for implementing parallel processing.
- Developed complex DataStage jobs according to the business requirements / mapping documents.
- Performed Unit Testing, System Integration Testing and User acceptance testing.
- Extensively Designed local containers and shared containers to simplify and modularize job design by replacing complex logics with single container stage and also to promote reusability of job designs.
- Involved in importing and exporting jobs category wise and maintaining the backupregularly.
- Used designer and director to schedules and monitor jobs and to collect the performance statistics.
- Worked within a team to populate Type I and Type II slowly changing dimension tables from several operational source files Created some routines Before-After, Transform function used across the project.
- Involved in creating UNIX shell scripts for database connectivity and executing queries in parallel job execution.
- Responsible to tune ETL processes to optimize load and query performance.
- Created standards document, best practices guide and performance tuning techniques documents.
Environment: IBM WebSphere DataStageEnterprise Edition 7.5.2, DataStage Server Edition 7.5, ProfileStage 7.5, Oracle 10g, Teradata, Cognos , Toad 8.0, Microsoft SQL 2005, IBM 2094 - System z9, AIX, Java, Star Team, WinSCP FTP , SSIS 2005, Putty, Windows 2003 , Zeke, MS Visio, SAP MDM, Mercury Center.
Confidential
ETL Developer
Responsibilities:
- Involved in support for the existing ECLDW ECL Data ware house .
- Monitoring and resolving data warehouse ETL production issues.
- Worked in analyzing the issue, trace the problem area, suggest a solution, discuss with business.
- Implement the solution with extensive documentation.
- Understanding the functional requirement specifications and design documents.
- Analyzing the source and target flat forms.
- Designed Data Stage ETL jobs for extracting data from heterogeneous source systems, transform,
- Identify source systems, their connectivity, related tables and fields and ensure data suitably for jobs.
- Developed jobs according to ETL Specification to load test data.
- Involved in creating reusable components such as Shared Containers and Local Containers
- Used most of the transformations such as the Transformer, Sequential File, Joiner, Lookup and Aggregator etc.
- Involved in support activities after the implementation Ted extensive documentations for the production support process improvements.
Environment:Windows 2003,Ascential Data stage 7.5.1 Parallel , Oracle 9i.