Senior Etl/datastage Lead/consultant Resume
Charlotte, NC
SUMMARY:
- Dynamic, result oriented IBM Information Server/DataStage SME/Consultant/Developer with 9 plus years of experience in implementing full life cycle data warehousing projects
- Strong knowledge of Star schema and Snow flake schemas, involved in designing DW concepts like conformed dimensions, conformed facts and Data Warehouse bus architecture for ETL development
- Strong working experience in the Data Analysis, Design, Development, Implementation and Testing of Data Warehousing projects
- Experience in being in constant interaction with Data Modelers and Architects, and involved in the process of giving and taking feedback as a part of improving the Data Model so that the Data Model supports the structures needed for ETL development
- 9 plus years of experience in Analysis/Design/Development/Implementation of ETL processes for Data Warehousing/Data Mart using DataStage with multiple database.
- Experience in designing the DataStage Master Design Document for the project and Detail Design Document for a release to guide ETL (Extract, Transform and Load) development
- Expert in establishing DataStage patterns for the project/release in order to allow maximum reusability of the DataStage modules and processes
- Expert knowledge in DataStage best practices and in designing jobs for optimal performance, reusability and restartability
- Expert in configuring database connections for Oracle, DB2, Sybase and SQL Server from DataStage
- Expert in setting up parallel configuration files depending upon Client’s hardware architecture
- Strong experience in making DataStage processes as repeatable as possible to ensure there are significantly less man hours used, less defect ratio, and ability to deliver a task in less time; and as a result be more cost effective
- Strong and proven experience in making DataStage ETL processes function as a well - oiled machine with lesser development time required as the project progresses
- Strong working experience in mentoring, training, guiding and leading teams of three to four DataStage developers, experienced and beginners both onsite and offsite
- Data Warehousing implementation experience using DataStage PX (DataStage Designer, DataStage Director, DataStage Manager, DataStage Administrator), DataStage Information Server
- Expertise in implementing DataStage PX, Sequencer, Shared Container jobs
- Strong experience in coding using SQL, PL/SQL Procedures/Functions, Triggers and Packages
- Strong experience in Unix scripting and running DataStage jobs using Unix scripts
- Strong working experience in designing DataStage job scheduling outside the DataStage tool and also within the tool as required by Client/Customer company standards
- Expert in configuring and setting up DataStage PX on AIX, DB2 UDB, Teradata, Oracle and other environments
- Expert in maintaining Metadata at the database level using a combination of DS PX jobs
- Expert in designing the DataStage configuration files for optimal performance and best resource usage
- Worked extensively on different types of stages like Filter, Join, Merge, Lookup , Funnel, Aggregator, Transformer, Sort, CDC stages, Surrogate Key Generator, Modify, Remove Duplicates, Sequential File, Dataset, File set, Copy, Head, Tail, Connector stages for Databases (Teradata, Oracle, DB2), ODBC Connector, Real Time, Shared Containers, Column Import, Column Export, Switch, MQ Connector for developing jobs.
- Worked on several Real Time Stages such as MQ, XML input and XML output etc.
- Worked extensively on sequence Stages - Job Activity, Execute Command, EndLoop Activity, Exception Handler, Nested Condition, Notification Activity, Routine Activity, Sequencer, StartLoop Activity, Terminator Activity, User Variables Activity
- Worked IBM Information analyzer for columns analysis, primary key analysis, foreign key analysis and cross-domain analysis
- Install and configure IBM Information server.
- Performed administration tasks like role setup, connectivity, security, configuration of projects, user registries, create or update users and groups, mapping credentials, configuring DSN and TNS entries, parallel, compiler, reporting, operator and user-defined environment variables.
- Configuration of XMETA repository, Node Agents and DS Engine.
- Worked extensively on Teradata BTEQ Scripts to implement CDC process and Table load.
- Working experience in Teradata RDBMS using FastLoad, MultiLoad, TPump, FastExport, Teradata SQL Assistance and BTEQ Teradata utilities.
- Proficient in SQL, Procedures, Functions, Triggers, Cursors & SQL Loader developed various applications using different databases. Good understanding of Data Dictionary, RDBMS and Normalization Techniques.
- Experience in design and implementation of Star, Snowflake schemas and multi-dimensional modeling. Good Understanding of Data Modeling concepts, Erwin - Forward & Reverse Engineering, Dimensional Data Modeling.
- Hands on Experience in UNIX Shell Scripting. Written many scripts for loading Staging & History databases, File monitoring & reporting using Bteq.
- Used the DataStage PX Director, Control-M, AutoSys Scheduler to run, schedule, monitor, debug and test the application on development and to obtain the performance statistics.
- Good Understanding of COBAL copybook files from mainframe to DataStage conversion projects.
- Well versed in documentation of Design Document, Unit Test and SIT scripts, Implementation Plan, Production Run books, Production Schedule. Thorough experience in unit testing, system integration testing, UAT, implementation, maintenance and performance tuning.
- Investigating and fixing the code which fails to deliver the expected results by working on (CR)
- Change Requests. Discussion with ICT team and Business team to get approval to implement CR. Preparation/Review of Design document / root cause analysis document, Solution design document, Test document, Change Management documentation, post implementation tests.
- Unique ability to understand long-term project development issues from a budgeting/management perspective and work through constraining situations. Strives for the best results through meticulous analysis and review. Have good presentation skills.
- Expertise in Integration of various data sources like DB2 UDB, Oracle 10g,9x/8.x/7.x, SQL Server, MS Access and Teradata
- Experience and Knowledge on ANSI SQL usage and it’s Standards.
- Experience in creating required documentation for production support hand off, and training production support on commonly encountered problems, possible causes for problems and ways to debug them
- Experience in writing server routines in Basic and Parallel routines in C++ for custom logic and leveraging the same in DataStage jobs.
- Experience in Extraction, Transformation, Loading (ETL) data from various sources into Data Warehouses and Data Marts using Informatica Power Center (Designer, Workflow Manager, Workflow Monitor, Metadata Manger).
- Received Advanced DataStage training from IBM and also taught several DataStage classes
- Knowledge and Trained on Master Data Manager (IBM MDM), IBM Initiate MDS, IBM Optim, Test Data Management (TDM), Data Masking (DM), Metadata Workbench, InfoSphere Meta data Asset Management(IMAM)
TECHNICAL SKILLS:
- IBM InfoSphere Server DataStage 11.3/9.x/8.x, WebSphere Information Server DataStage 8.0.1, QualityStage 8.0.1, Information Analyzer 8.0.1, DataStage EE 7.5.x ( DataStage Designer, DataStage Director, DataStage Manager, DataStage Administrator), IBM Initiate MDS, IBM Optim, Test Data Management (TDM), Data Masking (DM), Metadata Workbench, InfoSphere Meta data Asset Management(IMAM), Informatica, Cognos Framework and Report Studio,SQL, PL/SQL,UNIX(AIX, HP-UX,SOLARIES,LINUX), MS-DOS, Windows Server, Dimensional Modeling ERWin, UNIX Shell Scripting, Teradata, DB2, Oracle, ANSI SQL, SQL Server, Toad, SQL Visual studio, SQL Developer, DataStage Director, Control-M, AutoSys, JIRA, HP Quality Center and IBM RTC
PROFESSIONAL EXPERIENCE:
Confidential, Charlotte, NC
Senior ETL/DataStage Lead/Consultant
Environment: IBM DataStage 11.3 (Designer, Director, Parallel Extender), Linux, Teradata, Teradata SQL Assistance, Teradata Utilities like FastLoad, MultiLoad, TPump, FastExport, and BTEQ, Flat files, Control-M, IBM RTC, WinSCP, Putty
Responsibilities:
- Work as ETL Team Technical lead and mentor the team for smooth project operations.
- Involved as primary ETL Developer during the analysis, planning, design, development, and implementation stages of projects using IBM InfoSphere Datastage ETL tool.
- Prepared Data Mapping Documents and Design the ETL jobs based on the DMD with required Tables in the Dev Environment.
- Active participation in decision-making and QA meetings and regularly interacted with the Business Analysts &development team to gain a better understanding of the Business Process, Requirements & Design.
- Involved in ETL Estimation and proactively worked for setting up the environments
- Participated in Requirement gathering and Business meetings, understand the requirements and translated into Technical Design Document.
- Contributes to project success by providing technical leadership on assigned projects and Writes technical architecture and solutions.
- Design and develop a scalable complex solution using an optimum number of stages and optimal data partitioning methodology.
- Create the Story in JIRA tool and track the status and Project operates in Agile and DevOps mode.
- Designed and Developed DataStage Jobs to Extract data from heterogeneous sources, applied Transform logics to extracted data and Loaded into Data Warehouse Databases.
- Experience with FastLoad, MultiLoad, TPump, FastExport, Teradata SQL Assistance, Teradata Parallel Transporter and BTEQ Teradata utilities
- Experience with UNIX Scripts and Teradata BTEQ Scripts.
- Developed different reusable utilities like SCD/CDC process using Teradata BTEQ script, File to Table load, File Validation, Table Purging and DB collect stats
- Production deployment, monitoring the schedule, pre-implementation planning and resolving post implementation issues and emergency production deployments.
- Worked closely with Team to set up the Control-M schedule both in UAT and PROD
- Raising CR’s and incident tickets for code deployment from UAT to PROD.
- Created reusability Utilities/Components and best practices.
- Converted complex job designs to different job segments and executed through job sequencer for better performance and easy maintenance.
- Creation of jobs sequences using email notification activities load status update to business users.
- Created shell script to run data stage jobs from UNIX and then schedule this script to run data stage jobs through CONTROL-M scheduling tool.
- Analyze performance and monitor work with capacity planning.
- Performed performance tuning of the jobs by interpreting performance statistics of the jobs developed.
- Documented ETL test plans, test cases, test scripts, and validations based on design specifications for Unit Testing, Unit Integration Testing and analysis.
Confidential
Senior ETL Consultant /DataStage Developer
Environment : IBM DataStage and QualityStage 11.3, UNIX, SQL Server PDW, Flat files
Responsibilities
- Understanding systems with respect to different Lines of Business and preparation of various understanding documents.
- Analysis of Business Requirements and understanding the business model and customer requirements.
- Coordinate with Customer to gather and understand the Business requirements.
- Developing Jobs using InfoSphere DataStage and QualityStage
- Experience with Quality stage for data profiling, standardization, matching.
- Install and configure IBM Information server.
- Performed administration tasks like role setup, connectivity, security, configuration of projects, user registries, create or update users and groups, mapping credentials, configuring DSN and TNS entries, parallel, compiler, reporting, operator and user-defined environment variables.
- Working under different modules to achieve the project deliveries possible.
- Involved in detail design and development of jobs using DataStage Designer.
- Worked on several stages such as Copy, Filter, Joiner, Aggregator, Lookup, Merge and Transformer
- Migration of the Jobs, Sequencers and Routines and other objects from Development to
- Testing and Testing to Production
- Developed reusable Jobs, Sequencers, and Routines.
- Used parameters sets and variables to make jobs and sequencers are more flexible.
- Applying the parallelism for better performance and Creating Sequencers for jobs.
- Developing shell scripts to automate the run of DataStage sequencers and jobs.
- Unit testing the Jobs.
- Coordinating with team members are effectively carry out all project activities such as analysis & design, coding, integration and testing.
Confidential
Senior ETL/DataStage developer
Environment : IBM DataStage 9.x (Designer, Director, Manager, Parallel Extender), UNIX, Oracle,ANSI SQL standards, Putty, WinSCP, Cognos Framework and Report Studio
Responsibilities:
- Understanding systems with respect to different Lines of Business and preparation of various understanding documents.
- Analysis of Business Requirements and understanding the business model and customer requirements.
- Coordination with Customer to understand the Business requirements.
- Involved in Development, Production Support and enhancements to OTR, KPI and other applications.
- Operational activities like Batch Monitoring, Investigating Incident tickets and finding root cause.
- Good command knowledge on BATCH Control process and implementation.
- Providing solutions to incident tickets and Implementation of Change requests(CR)
- Investigating the incidents/tickets reported by business users and finding the root cause and fixing
- Investigating and fixing the code which fails to deliver the expected results by working on CR.
- Discussion with ICT team and Business team to get approval to implement change request.
- Preparation/Review of Design document / root cause analysis document, Solution design document, test document, Change Management documentation, post implementation tests.
- Involved in Cognos Report studio and Metric Studio development and enhancements.
- Ensure seamless movement of code between Unit test, System Integration test, User Acceptance test, Production and Production replica environments.
- Responsible for communication related to any day-to-day issues and problem resolutions in EDW / BI applications.
- Attend weekly and monthly review meetings to provide client with status of various enhancement/development activities.
- Providing Support and Enhancements for Production and other environments like UAT, Prod Fix.
- Coordinating with team members to effectively carryout all project activities such as analysis & design, coding, integration and testing.
Confidential
ETL DataStage Developer
Environment : IBM DataStage 9.x (Designer, Director, Manager, Parallel Extender), UNIX, Teradata, Putty, WinSCP
Responsibilities:
- Performed data analysis and gathered columns metadata of source systems for understanding requirement feasibility analysis.
- Created Logical Data flow Model from the Source System study according to Business requirements on MSVisio.
- Transformed Logical Data Model to Physical Data Model ensuring the Primary Key and Foreign key relationships in PDM, Consistency of definitions of Data Attributes and Primary Index considerations.
- Created UML Diagrams including Use Cases Diagrams, Activity Diagrams/State Chart Diagrams, Sequence Diagrams and Deployment Diagrams, Data Flow Diagrams (DFDs), ER Diagrams using MS Visio.
- Worked on the Teradata stored procedures and functions to confirm the data and have load it on the table.
- Developed procedures to populate the customer data warehouse with transaction data, cycle and monthly summary data, and historical data.
- Worked on optimizing and tuning the Teradata views and SQL’s to improve the performance of batch and response time of data for users.
- Worked closely with analysts to come up with detailed solution approach design documents.
- Provided initial capacity and growth forecast in terms of Space, CPU for the applications by gathering the details of volumes expected from Business.
- Prepared low level technical design document and participated in build/review of the BTEQ Scripts, Fast Exports, Multi loads and Fast Load scripts, Reviewed Unit Test Plans & System Test cases.
- Creating and maintaining source-target mapping documents for ETL development team.
- Providing requirement specifications and guide the ETL team for development of the ETL jobs through Datastage ETL tool.
Confidential
DataStage Developer
Environment : IBM DataStage 9.x (Designer, Director, Manager, Parallel Extender), UNIX, DB2, Putty, WinSCP
Confidential
DataStage Developer
Responsibilities:
- Developing Jobs using InfoSphere DataStage v8.x PX
- Analyzed requirements, involved in High Level, Mapping sheet documents.
- Created a LLD and Technical Understanding design documents.
- Involved in project Estimations, involved in detail design and development of jobs using DataStage
- Worked on several stages such as Copy, Filter, Join, Aggregator, Lookup, Merge, Sequential stage, Complex Flat File (CFF) and Transformer etc.
- Creating COBOL copy book for table definition and creating sample data.
- Migration of the Jobs, Sequencers and other from Development to Testing.
- Strong experience in Developing/Optimizing/Tuning jobs in DataStage.
- Used parameters sets and variables to make jobs and sequencers are more flexible.
- Unit testing the Jobs.
- Prepared for CLI entries for Framework.
- Working under different modules to achieve the project deliveries possible.
- Providing the technical help to other teams.
- The data format and documentation of programs had lot of inconsistencies.
- Parsing COBOL records, packed data types in DataStage
- Attaching headers to the output complex flat files
- Testing DataStage jobs against mainframe programs as both are in different domain having different database.
Environment : IBM DataStage 8.x (Designer, Director, Manager, Parallel Extender), UNIX, Oracle, TOAD, Mainframe, Putty, WinSCP
Confidential
DataStage Developer
Responsibilities:
- Developing Jobs using InfoSphere DataStage v8.0 PX.
- Working under different modules to achieve the project deliveries possible.
- Preparing required document like LLD, TSD.
- Involved in detail design and development of jobs using DataStage Designer.
- Worked on several stages such as Copy, Filter, Joiner, Aggregator, Lookup, Merge and Transformer
- Worked on several real time stages such as MQ, XML input and XML output etc.
- Migration of the Jobs, Sequencers and Routines and other objects from Development to
- Testing and Testing to Production
- Experience in Developing/Optimizing/Tuning jobs in DataStage.
- Developed reusable Jobs, Sequencers, and Server Routines.
- Used parameters sets and variables to make jobs and sequencers are more flexible.
- Applying the parallelism for better performance and Creating Sequencers for jobs.
- Developing shell scripts to automate the run of DataStage sequencers and jobs.
- Unit testing the Jobs.
- Working actively in the integration testing.
- Providing the technical help to other teams.
Confidential
DataStage Developer
Responsibilities:
- Developing Jobs using InfoSphere DataStage v7.5 PX.
- Working under different modules to achieve the project deliveries possible.
- Working with Flat files as source and loading into Oracle Database.
- Using the stages like Copy, Sequential File, Dataset, Modify, Transformer, Lookup etc
- Applying the parallelism for better performance and Creating Sequencers for jobs
- Developing shell scripts to automate the run of DataStage sequencers and jobs
- Unit testing the Jobs
- Working actively in the integration testing
- Providing the technical help to other teams