Etl Developer Resume Profile
NX
PROFESSIONAL PROFILE
DataStage Developer with 8 years in Information technology having worked in Design, Development, Administrator and Implementation of various database and data warehouse technologies IBM DataStage v9.X/8.X/7.X using components like Administrator, Manager, Designer and Director. 5 years' experience in writing complex SQL queries and PL/SQL, including the use of stored procedures, functions and triggers to implement business rules and validations in Microsoft SQL, Oracle 11g/10g/9i.
PROFESSIONAL SKILLS
- Extensive ETL tool experience using IBM InfoSphere/WebSphere DataStage, Ascential DataStage and SSIS. Worked on DataStage client tools like DataStage Designer, DataStage Director and DataStage Administrator.
- Good Knowledge about the principles of DW like Data marts, OLTP, OLAP, Dimensional Modeling, fact tables, dimension tables and star/snowflake schema modeling.
- Excellent in using highly scalable parallel processing infrastructure using parallel jobs with multi-node configuration files.
- Experienced in scheduling sequence, parallel and server jobs using DataStage Director, UNIX scripts and scheduling tools.
- Designed and developed parallel jobs, server and sequence jobs using DataStage Designer.
- Experience in using different types of Stages like Transformer, Aggregator, Merge, Join, Lookup, Sort, Remove Duplicate, Funnel, Filter, Pivot, Shared containers for developing jobs.
- Worked and extracted data from various data sources such as Oracle, MS-SQL Server, MS - Access, Teradata, DB2, XML and Flat files.
- Knowledge in using PL/SQL to write stored procedures, functions, and triggers.
- Extensive experience in Unit Testing, Functional Testing, System Testing, Integration Testing, Regression Testing, User Acceptance Testing and Performance Testing.
- Created local and shared containers to facilitate ease and reuse of jobs.
- Proven track record in addressing production issues like performance tuning, enhancement and memory issues.
- Imported the required Metadata from heterogeneous sources at the project level.
- Knowledge in using Erwin as leading Data modeling tool for logical LDM and physical data model PDM .
- Knowledge in Business Intelligence Reporting Tools such as QlikView and Cognos.
- Used the Data Stage Director and its run-time engine to schedule running the solution, testing and debugging its components, and monitoring the resulting executable versions on an ad hoc or scheduled basis .
- Experience in Production Support, extensively worked on production support issues.
- Good Knowledge on Business Intelligence Development studio.
- Extensively used BI Integration Services to design ETL process and BI Analysis Services to create cubes.
- Quick learner and adaptive to new and challenging technological environments.
- Project Management experience with excellent problem-solving, organization and leadership skills.
TECHNICAL SKILLS
ETL Tools | IBM InfoSphere / WebSphere DataStage 8.5/8.1/ 7.5.2/7.1 Administrator, Manager, Designer, Director, Orchestrate, Information Analyzer, Quality Stage |
Operating Systems | IBM AIX 4.3 and Windows NT 4.0/2000/2003/XP, Windows 7 |
Languages | PL/SQL, C/C , UNIX AIX, HP-UX Shell Scripting, Java. |
Database | UDB/DB2 on AIX , Oracle 11g/10g/9i/8i/, SQL Server 2000/2005/2008 ,Teradata |
Other | Cyber fusion, SQL Assistant, Pattern Action Language PAL , Perl Scripting, Control-M, AutoSys, SAP MDM 5.5/7.1, Management console, DataStage Trouble Shooting, Data Issue Resolution, Batch Monitoring, QlikView |
WORK EXPERIENCE
Confidential
Role: Sr. Developer
- Expertise in designing and implementing DataStage Architecture in data warehousing and Business Intelligence projects.
- Involved in the Documentation of the ETL phase of the project.
- Provide support for monthly/weekly batches in production run.
- Automated process of job monitoring which helped in minimizing the manual intervention documenting them perfectly.
- Awareness about the functional/business aspects for the components.
- Worked on changed requests as per clients and projects technical specification needs.
- Involved in Quality Assurance Unit Testing and Integration Testing to test jobs and also the system process flow.
- Analyzing Designing.
- Working in a team with other associate product component developers.
- Created DataStage jobs ETL Process for populating the data into the Data warehouse constantly from different source systems like ODS, flat files, scheduled the same using DataStage Sequencer for SI testing.
- Worked on the Architecture of ETL process.
- Data base design data model - Logical Physical design with hands on experience on DDL and DML SQL operations.
- Involved in the Analysis of the functional side of the project by interacting with functional experts to design and write technical specifications.
- Worked with Functional team and Data Modelers/Architects to identify and understand the data from different source systems.
- Developed the reusable components, best practices that were later on used in other Data warehouse.
Environment: Data Stage 9.X, DB2, Oracle 11g, UNIX, MICROSOFT VISIO, Control-M, SQL Plus, WinCVS.
Confidential
Role: Sr. DataStage developer
- Working with both onsite and offshore team of ETL Developers and ETL Analyst.
- Involving with business users for identifying, prioritizing resolving numerous data issues, create ETL Project Plans, design and development
- Handling both development and support teams.
- Creating new batches that utilized parallelism better Multi Instance .
- Simplifying the environment by removing redundant objects.
- Automated the processes to reduce the manual efforts.
- Involved in data profiling and data model.
- Developed a complex SQL code to use in the common code approach to avoid writing several SQL scripts for different source table/file formats.
- Preparation of technical specification for the development of Extraction, Transformation and Loading ETL jobs to load data into various tables in Data marts.
- Worked with DataStage server Stages like OCI, ODBC, Transformer, Hash file, Sequential file, Aggregator, Sort, Merge, Link Partitioner, Link collector, IPC and other Stages.
- Implemented and hardcoded high performance DataStage routines.
- Imported the required Metadata from heterogeneous sources at the project level.
- Used Parallel Stages like Lookup, Join and Merge Stages for joining various information and also used Parallel Transformer, Column Generator, Funnel, Filter, Switch, Modify, Pivot, Row Generator.
- Used Parallel Extender Development/Debugging Stages like Row generator, Column Generator, Head, Tail and Peek for debugging of the jobs.
- Performed debugging on these jobs using Peek Stage by outputting the data to Job Log or a Stage.
- Developed Job Sequencer to execute jobs in proper sequence. Also automated email messaging was implemented using Sequencer to notify the operations team of any data load issues such as job failure, dropped rows, rejected rows etc.
Environment: Windows 2008, IBM Data Stage 8.5 Server/parallel/8.1/8.0/7.5/7.0 Administrator Client, Manager, Designer Client, Director Client , Parallel Extender, Information Analyzer IA , Oracle 11g, SQL Server 2008R2, SQL and AIX/UNIX.
Confidential
Role: ETL developer
Responsibilities:
- Part of design team and production Support team for the migration project.
- Part of a design team for design of STAR schema for data warehouse project.
- Interacted with the End users / Customers for Creating Mapping documents.
- Created Mapping documents for Migration project.
- Done extensive business analysis to analyze the source system and talking to the business groups to understand the reporting requirements.
- Designed the mapping documents between source databases and target databases.
- Designed and developed Customer mart and Sales mart using the data from the centralized data warehouse using top-down approach.
- Worked on critical Occurs and Redefines in the complex flat file structures.
- Done data analysis, quality analysis, and data loading.
- Developing processes for extracting, cleansing, transforming, integrating and loading data into databases.
- Created extract processes, analyzing the data, DB2 code to pull the required data.
- Developed many DataStage server jobs for data processing and loading of data.
- Developed Load jobs Oracle and DB2databases.
- Used TOAD tool for the analysis part.
- Used AutoSys for Scheduling of the Jobs
Environment: IBM InfoSphere DataStage 8.5 Manager, Designer, Director, Administrator , Oracle 10g, DB2 UDB, SQL, PL/SQL, Lotus Notes, Toad, SQL Loader, UNIX, AutoSys, Windows XP/NT, Erwin 3.5
Confidential
Role: DataStage Developer
Responsibilities:
- Involved in various roles of Administrator and Developer throughout the project.
- Conducted the training sessions for other ETL developers on the best practices as well as performance improvement techniques.
- Managed analysis, design, coding and testing of ETL jobs for 7 Source Systems.
- Involved in implementing the Best practices and design standards. The Best practices include Restart-ability, Recovery, Parameter standardization and Capacity planning, etc.
- Participated in the review of Technical, Business Transformation Requirements Document.
- Prepared documentation to describe process development, logic, coding, testing, changes and corrections.
- Used Partition methods and collection methods for implementing parallel processing.
- Developed complex DataStage jobs according to the business requirements / mapping documents.
- Performed Unit Testing, System Integration Testing and User acceptance testing.
- Extensively Designed local containers and shared containers to simplify and modularize job design by replacing complex logics with single container Stage and also to promote reusability of job designs.
- Involved in importing and exporting jobs category wise and maintaining the backup regularly.
- Used designer and director to schedules and monitor jobs and to collect the performance statistics.
- Worked within a team to populate Type I and Type II slowly changing dimension tables from several operational source files Created some routines Before-After, Transform function used across the project.
- Involved in creating UNIX shell scripts for database connectivity and executing queries in parallel job execution.
- Responsible to tune ETL processes to optimize load and query performance.
- Created standards document, best practices guide and performance tuning techniques documents.
Environment: IBM WebSphere DataStage Enterprise Edition 7.5.2, DataStage Server Edition 7.5, ProfileStage 7.5, Oracle 10g, Teradata, Cognos , Toad 8.0, Microsoft SQL 2005, IBM 2094 - System z9, AIX, Java, Star Team, WinSCP FTP , SSIS 2005, Putty, Windows 2003 , Zeke, MS Visio, SAP MDM, Mercury Center.
Confidential
Role: DataStage ETL Analyst
- Designed/wrote the tech specs Source-Target mappings for the ETL mappings along with the Unit Test scripts.
- Involved in migration of DataStage projects and jobs from earlier versions to IBM InfoSphere 8.0.1 version.
- Used AutoSys to schedule jobs and e-mailed the status of ETL jobs to operations team daily.
- Used IBM InfoSphere Federation Server to incorporate data from multiple data sources into reports and analytics with a single query.
- Used Director Client to validate, run, schedule and monitor the jobs that are run by IBM InfoSphere DataStage server.
- Used DataStage Designer to develop parallel jobs to extract, cleanse, transform, integrate and load data into Data Warehouse.
- Used DataStage Director to schedule, monitor and analyze DataStage jobs.
- Developed jobs in Ascential Parallel Extender PX using different Stages like Transformer, Aggregator, Lookup, Join, Merge, Modify, Remove Duplicate, Oracle Stage, Sort, Peek, Row Generator, Column Generator, Sequential File and Data Set.
- Designed DataStage sequences to specify Job execution order.
- Loaded data into staging area and then into Data Marts.
- Worked as a DataStage administrator to perform routine administrative roles.
- Imported and exported Repositories across DataStage projects using DataStage Manager
- Unit tested DataStage Jobs in development including creating the appropriate test data.
Environment: IBM InfoSphere 8.0.1 Latest version of DataStage , Ascential DataStage7.5.2, Erwin, Oracle11g, DB2 ,PL/SQL, Toad, Solaris and Windows XP, AutoSys
Confidential
Role: Data Warehouse Consultant
Responsibilities:
- Analyzed, conceptualized/designed the database that serves the purpose of providing critical business metrics.
- Developed ETL procedures to ensure conformity, compliance with standards and lack of redundancy, translates business rules and functionality requirements into ETL procedures using Informatica PowerMart.
- Worked with ERwin tool in Data Modeling both Physical and Logical Design .
- Developed and documented data Mappings/Transformations, Audit procedures and Informatica sessions.
- Assisted in the design and Maintenance of the Metadata environment.
- Developed and tested all the backend programs, Informatica mappings and update processes.
- Effectively managed the migration of the transformations/mappings from development to Production.
- Developed various bulkload and update procedures and processes using SQL Loader and PL/SQL.
- Involved in the error checking and testing of the ETL procedures and programs of Informatica session log.