Senior Datastage/etl Developer Resume
Sfo, CA
PROFESSIONAL SUMMARY:
- Over 7 years of experience in system analysis, design, development and implementation of Relational Database and Data Warehousing systems using IBM DataStage.
- Experienced in designing, developing, documenting, and testing of ETL jobs and mappings in Server and Parallel jobs using DataStage 8.0.1/7.x/6.x/5.x (Information Server, WebSphere, Ascential DataStage) to populate tables in Data Warehouse and Data marts.
- Excellent skills in SQL, PL/SQL, Stored Procedures, Triggers, Debugging, Troubleshooting and Performance tuning.
- Good understanding of OOPS concepts.
- Proficient in developing strategies for Extraction, Transformation and Loading (ETL) mechanism.
- Expert in designing parallel jobs using various stages like Transformer, Join, Merge, Lookup, Remove duplicates, Filter, Dataset, Complex flat file, Modify, Aggregator, XML.
- Expert in designing Server jobs using various types of stages like Sequential file, ODBC, Hashed file, Aggregator, Transformer, Sort, Link Partitioner and Link Collector.
- Experienced in integration of various data sources (DB2 - UDB, SQL Server, Sybase, Oracle, Teradata, XML and MS-Access) into data staging area.
- Proficient in data warehousing techniques for data cleansing, Slowly Changing Dimensions, Surrogate Key assignment and CDC (Change Data Capture).
- Expert in working with DataStage Designer and Director.
- Experienced with TOAD.
- Proficient in creating XML and XSD.
- Excellent knowledge of studying the data dependencies using metadata stored in the repository and prepared batches for the existing sessions to facilitate scheduling of multiple sessions.
- Proven track record in troubleshooting of DataStage jobs and addressing production issues like performance tuning and enhancement.
- Expert in working on various operating systems like UNIX AIX 5.2/5.1and Windows.
- Experienced in Database programming for Data Warehouses (Schemas), proficient in dimensional modeling (Star Schema modeling, and Snowflake modeling).
- Knowledge in Master Data Management.
- Expert in unit testing, system integration testing, implementation and maintenance of databases jobs.
- Effective in cross-functional and global environments to manage multiple tasks and assignments concurrently with effective communication skills.
- Strong understanding of business processes with excellent written and documentation skills for management and development.
TECHNICAL SKILLS
Skillset: IBM Information Server Datastage 8.5, Ascential Datastage 7.5/6.x/5.x/7.x (Datastage Designer, Datastage Manager, Datastage Director, Datastage Administrator), Ascential Datastage Parallel Extender, SQL * PLUS, PL/SQL, XML, Oracle 10g/9i/8.x/7.x, SQL server 2000, 2005, 2008, MS Access 7.0/2000, DB2, TeraData Query Man, Oracle ERP, Erwin, MS Visio, Toad, shell scripting, Perl Scripting, HTML/CSS, Java, C/C++, UNIX, Linux, Sun Solaris v8.0, HP-UX v11.0, Windows 9x/NT/2000/XP/2000 advanced server, Data modeling (Star Schema, Snow Flake Schema), SAP R/3, SAP BW, SAP ECC, LSMW.
PROFESSIONAL EXPERIENCE
Confidential, SFO, CA
Senior DataStage/ETL Developer
Hardware/Software:DataStage 8.0/8.5 Enterprise Edition/Parallel Extender (DataStage, Quality Stage), SQL *Loader, Control M 7.0, IBM DB2 7.2, SQL Server 2008, Oracle 10g, Sequential files, Flat files, SQL, PL/SQL, Shell scripts, UNIX, Windows XP.
Responsibilities:
- Actively participated in the Team meetings to gather the business requirements from users and translated them into technical specifications.
- Designed Source-to-Target mapping document to link between different source systems and data warehouse for loading data into Oracle tables.
- Extracted data from source systems Oracle, SQL Server and flat files.
- Extensively used Built-in stages, for building server and parallel jobs to extract, transform and load the data.
- Created DataStage parallel jobs and job sequences with parameters and environment variables.
- Extensively used Join, Merge, Lookup, Aggregator, Modify, Remove Duplicates, Filter and Funnel stages.
- Developed incremental and full load data jobs using change capture stage, which were scheduled to run on daily, weekly or monthly basis.
- Developed jobs to handle Slowly Changing Dimensions.
- Created XML, XSD files from Oracle tables.
- Wrote SQL queries, stored procedures, functions and packages.
- Worked on UNIX environment and created shell scripts to read parameters from files for invoking DataStage jobs.
- Performed regular backups of the developed jobs and migrated the jobs between various environments using DataStage Export/Import utility.
- Documented the changes made to the jobs and the test results.
- Analyzed data with discrepancies through DataStage Director using error log for further data processing and cleansing.
- Performed troubleshooting and performance tuning for enhancement of DataStage jobs.
Confidential, New York, NY
ETL Developer
Responsibilities:
- Actively participated in the Team meetings to understand the business requirements from business users.
- Developed complex SQL to create views to support microstrategy reports.
- Collaborated with Admin team to create required users and privileges.
- Used frameworks to ETL data from transaction databases to Staging and Source layers area. These were in turn used to build the data mart for reporting purposes.
- Wrote stored procedures to dynamically refresh materialized views with updated data.
- Worked on UNIX environment and created shell scripts to read parameters from files and to use the PLSQL procedures to refresh the materialized views.
- Designed technical documents for all the developed SQL Queries, PLSQL Procedure, Unix Scripts.
- Involved in impact analysis.
- Helped the Testing team in understanding and testing the code to deliver the jobs on time.
- Documented the changes made to the jobs and the test results.
- Performed troubleshooting and performance tuning for enhancement of DataStage jobs.
- Prepared feature lists, technical documentation for source to target mapping procedures to facilitate better understanding of the process and incorporate changes as and when necessary.
- Used Information Analyzer for Column Analysis, Primary Key Analysis and Foreign Key Analysis.
- Extensively worked on DataStage jobs for splitting bulk data into subsets and to dynamically distribute to all available processors to achieve best job performance.
- Converted complex job designs to different job segments and executed through job sequencer for better performance and easy maintenance.
- Enhanced the reusability of the jobs by making and deploying multiple instances of the jobs.
- Imported the data residing in the host systems into the data mart developed in Oracle 10g.
- Extensively used Control-m for automation of scheduling for UNIX shell script jobs on daily, weekly monthly basis with proper dependencies.
- Wrote complex SQL queries using joins, sub queries and correlated sub queries
- Used Oracle PL/SQL for writing stored procedures, packages for loading and validating the data.
- Performed Unit testing and System Integration testing by developing and documenting test cases in Quality Center.
- Involved in reporting services using business objects.
- Worked on troubleshooting, performance tuning and performances monitoring for enhancement of DataStage jobs and builds across Dev, QA and PROD environments.
Confidential NJ
Senior DataStage Developer
Hardware/Software:IBM Web sphere, Ascential Data Stage 7.5(Parallel Extender & Server), Erwin 7.0, SQL* Loader, Oracle 9i,MS Access 2003, Teradata V2R6/V2R5, SQL Server 2000, AIX UNIX, SQL, PL/SQL, XML, Windows XP.
Responsibilities:
- Interacted with the business users to gather the requirements
- Involved in design, source to target mappings between sources to operational staging targets
- Extracted the source data from Oracle tables, MS SQL Server, sequential files and excel sheets.
- Designed and developed jobs using DataStage Designer as per the mapping specifications using appropriate stages.
- Used the DataStage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into the Data Warehouse.
- Developed job sequencesto execute a set of jobswith restartability, checkpoints and implemented proper failure actions.
- Imported metadata from repository, created new job categories, routines and data elements.
- Involved in unit, regression, performance and integration testing of DataStage jobs.
- Used DataStage Director to run, debug and monitor the jobs for performance statistics.
- Used Erwin for data modeling (i.e. modifying the staging and SQL scripts on Oracle and MS Access Environments).
- Wrote SQL queries, stored procedures, functions and packages.
- Wrote UNIX shell scripts to read parameters from files for invoking DataStage jobs.
- Wrote shell scripts to create load ready files from external sources.
- Extensively used Autosys for automation of scheduling for UNIX shell script jobs on daily, weekly monthly basis with proper dependencies.
Confidential
DataStage Developer
Hardware/Software:Ascential DataStage 7.5, MetaStage (Enterprise Metadata directory), (Designer, Director, Manager, Parallel Extender, Data Integration) Debugger, DataStage, Windows 2000, Oracle 9i, SQL Server 2005, Oracle ERP, IBM DB2 7.2, UNIX Sequential Files, CSV Files, UNIX Shell Scripts, SQL, PL/SQL.
Responsibilities:
- Involved in various phases of the Data warehousing life cycle - logical and physical design, ETL process, performance tuning, and maintenance.
- Designed various Server jobs based on the business logics that run daily, weekly and monthly basis.
- Generated surrogate keys avoided the DS Sequence to increment for each record passing through the transformer.
- Developed various jobs using DataStage stages like ODBC, Hashed file, Aggregator, Sequential file.
- Created shared containers to use in multiple jobs.
- Extracted data from flat file, performed complex joins lookup and filter transformations to extract transform and load data.
- Tuned various server jobs and mappings for optimum performance.
- Extensively worked on the Database Triggers, Stored Procedures, Functions and Database Constraints using
- Used TOAD to meet the project requirement and write SQL queries.
- Documented the Purpose of mapping so as to facilitate the personnel to understand the process and incorporate the changes as and when necessary.