Datastage / Teradata Developer Resume
IL
PROFESSIONAL SUMMARY:
- Over 6 years of work experience in Data Warehousing and Data integration, Data Quality and conversion processes, including Software installation, Design, Development and Administration.
- Experience with DataStage Adminstartion activities and Qualitystage.
- Extensive experience in developing strategies for Extraction, Transformation, Loading (ETL) data from various sources into Data Warehouse and Data Marts.
- Extensive experience in Data warehousing tools IBM Information server 8.0.1/8.1(Designer, Director and Administrator) and Ascential DataStage 7.5.2/7.5.1 (Manager, Designer, Director and Administrator) and Data Modeling Tools.
- Experience in the Analysis, Design, Modeling, Development, Implementation and Testing of Data Warehousing applications.
- Data modeling knowledge using Dimensional Data Modeling, Star Schema Modeling, Snow Flake Modeling, Fact and Dimension Tables.
- Good knowledge in MDM and also worked as Team lead for a project.
- Established projects, roles, users, privileges in different ETL environments like DEV and QA.
- Experienced working with various operational sources like Teradata, SQL Server, Oracle, DB2, Flat files, Greenplum.
- Experience in integration of various data source like Oracle, SQL Server, and DB2, Netezza.
- Ability to work with client to identify source - target data and their availability, data extraction, processing and testing skills, proficient with multiple extraction, transformation and load tools.
- Experience with Conceptual, Logical and Physical data modeling.
- Excellent work experience in AGILE project methodology and development.
- Involved in complete Software Development life-cycle (SDLC) of various projects, including requirements gathering, system designing, data modeling, ETL development, production enhancement, support and maintenance.
- Knowledge on Infosphere IGC ( Information Governance Catalog), IA ( Investment Analyzer), IMAM ( Metadata Asset Manager)
- DesignedDataStageJobs to extract data from XML files using XML input stage, Used XML transformer stage to cleanse and transform the data to load it into the Data Mart.
- Experienced in scheduling sequence, parallel and server jobs using DataStage Director, UNIX, and AIX scripts and scheduling tools.
- Experience in using Erwin as leading Data modeling tool for logical (LDM) and physical data model (PDM).
- Implemented the complex business rules by creating reusable transformations and Mappings / Mapplets (STM).
- Quick learner and adaptive to new and challenging technological environments.
- Working experience in interacting with Business Analysts and developers to analyze the user requirements, functional specifications and system specifications.
- Extensive experience in analysis and design of database including ER Diagrams and Normalization techniques.
TECHNICAL SKILLS:
ETL Tools: DataStage 11.3/9.1/8.7 /8.5/7.5.2/7.5.1/7.0/6.0 (Manager, Director, Designer), IBM Information Server 8.1.
Operating systems: Windows 98/2000/XP/7/8/10/NT, UNIX (HP, Sun Solaris 7/8,Humming bird), LINUX
Databases: DB2 V10, 8.2, DB2UDB, Oracle 11g/10i/10g/9.x/8i/8.0/7i, SQL Server 2008 & 2012, Netezza, TOAD 7.0, Teradata, Informix
Languages: PL/SQL, SQL server 2012 and 2016, UNIX shell programming, TSQL
Others: SQL Assistant, Oracle DDL, Control M, Autosys, TWS, Netezza, UNIX AIX, ERwin 9.6/6.0/4.5.
PROFESSIONAL EXPERIENCE
Confidential, IL
Datastage / Teradata Developer
Responsibilities:
- Involved in all phases of Project Life cycle: Requirements gathering, Development, Testing, and Production Support.
- Coordinated with business users at the time of requirements gathering and helped in developing the Application Design Document.
- Created source to target mapping documents according to client standards.
- Developed DataStage code to process the customer membership feed from which comes in flat file format and change it in to XML and send it to downstream.
- Developed Technical Design Documents that help developers to start coding.
- Extensively worked with Shared Containers for Re-using the Business functionality.
- Created job parameters and Environment variables to run the same job for different schemas.
- Involved in Unit testing of the code and coordinated with the testing team, Integration team and the User acceptance team.
- Involved in debugging of existing jobs that had different sources like Complex Flat Files, XML files and Databases, and participated in UAT testing.
- Export and Import jobs between different environments via packages and versioning the code using InfoSphere Information server Manager.
- Created and executed SQL queries to fetch data from Teradata to validate and compare expected results with those actually obtained.
- Worked on changed requests as per clients and projects technical specification needs.
- Developed jobs with multiple Look-ups and maintained the integrity of the data mart in accordance with the requirement.
- Used DataStage Manager for importing metadata from repository, new job categories and creating new data elements.
- Used Information Analyzer for column analysis, primary key analysis and foreign key analysis and also used IBM data Mirror.
- Created and used Table Partitions to further improve the performance while using tables containing large number of columns and Rows.
- Created Shared Container stages to use in other jobs and scheduled to use the containers in Parallel Extender.
Environment: IBM InfoSphere Information Server 8.7, Unix AIX Scripting, Flat Files, Teradata 14.0.1, Zena.
Confidential, VA
Datastage Developer
Responsibilities:
- Data is extracted from Teradata or loaded into Teradata tables using Connector
- As a lead member of ETL Team, responsible for analyzing, designing and developing ETL strategies and processes, writing ETL specifications.
- The data extracted is validated and analyzed for any data quality issues.
- Teradata is queried through Teradata SQL Assistant to determine the consistency of the data.
- Used Datastage stages namely Sequential file, Transformer, Aggregate, Sort, Datasets, Join, Lookup, Change Capture, Funnel, Peek, Row Generator, CDC stages in accomplishing the ETL Coding.
- The data extracted needs to be aggregated or looked up or joined depending on the requirements.
- Experience in developing jobs to maintain the history of the data.
- Used Teradata tables as reference table to get the data.
- Involved in writing SQL queries.
- Used SVN Checkout for migrating the jobs to different stages.
- Controlled jobs execution using sequencer, used notification activity to send email alerts.
- Created shared containers to simplify job design.
- Used Infosphere IGC (Information Governance Catalog), IA (Investment Analyzer).
- Involved in writing the Unix AIX scripts for deleting the datasets or for moving the files from one location to another.
- Created architecture documents for different stages of the project life cycle which were used by developers to build the solution.
- Responsible for maintaining, updating and creating MDM datamodels.
- Experience in developing purge and the maintenance jobs.
- Designed jobs by calling the match templates prepared and generated the results.
- Experience in designing the Diagnostic jobs for verifying the connections of the databases.
- FTP ing the logs to the desired location for the access to the team.
- Used Data Stage Director to run and monitor the jobs for performance statistics.
- Involved in performance tuning of the job.
Environment: IBM Infosphere Datastage 8.5, Autosys, MQ, Teradata 13.11.03, Teradata SQL Assistant, Unix AIX Scripting, SQL, Windows XP, TOAD 7.0.
Confidential, Raleigh, NC
Sr. DataStage Developer
Responsibilities:
- Created high level design documents for extract, transform, validate and load ETL process and flow diagrams.
- Participated in business meetings with vendors and clients in finalizing the enhancements.
- Designed logical and physical models using Erwin data modeling tool.
- Involved in designing and development of both server jobs and parallel jobs to extract data.
- Wrote SQL and PL/ SQL queries for aggregation and outer joins for better performance.
- Wrote shell scripts for job scheduling.
- Created job sequencers.
- Involved in system and performance testing.
- Took part in code review and performance analysis of new jobs.
- Involved in production support along with designing and development of jobs.
- Involved in tuning of server jobs for performance using Hash File stage, Orabulk stage and others.
- Worked with DataStage Manager for importing metadata from repository, new job categories and creating new data elements.
- Preformed code migration, and used normalization techniques to improve performance.
- Used the DataStage Director and its run-time engine to schedule running the solution, testing and debugging its components, and monitoring the resulting executable versions (on an ad-hoc or scheduled basis).
- Built mappings by using SQL Override, Connected/Unconnected Lookup, Lookup Override, Normalizer, Expression, Joiner, Router, Update Strategy, Stored Procedure transformation, Sequence Generator transformation, etc.
- Optimized and modified the triggers, complex store functions, procedures, user base data type etc.
- Involved in DataStage security - assigned IIS suite and suite component roles in IIS Web Console.
- DataStage disk usage - projects, system files, datasets & scratch.
- Worked with Administrator Client to manage projects and set their properties.
- Worked with Designer Client to debug server, parallel and sequence jobs using stages.
Environment: Infosphere DataStage 8.1/ 7.5.2 (Designer, Director, Manger), Information Analyzer, Oracle 11g, Oracle DDL, Teradata V2R6, DB2, UNIX, PL/ SQL, sequential files, and complex flat files, Erwin 6.0, SQL Server 2008.
Confidential, Dallas, TX
DataStage Developer
Responsibilities:
- Involved in the design and development of Data Warehouse.
- Worked closely with data modeler and database administrator to understand the business process and participated in gathering of business requirements to build the data warehouse model.
- Design the Data Mart model with Erwin using Star Schema methodology and worked as a team lead.
- Worked extensively with Parallel Extender using Parallel Processing (Pipeline and partition parallelism) techniques to improve job performance while working with bulk data sources.
- Extensively used Parallel Job Stages like Stored Procedure, Dataset/File set, Lookup File set, Aggregator, Join, Transformer, Sort, Merge, Filter, FTP, Lookup, Head, Tail and Pivot.
- Used Job Sequencer stages to link multiple jobs in Series/Parallel based on the requirement.
- Used partition key in Parallel Extender job for bulk loading of data and to get better performance.
- Used the DataStage Designer to develop processes for extracting, cleansing, transforms, integrating, CDC and loading data into data warehouse database.
- Used DataStage Manager for importing metadata from repository, new job categories and creating new data elements.
- Used the DataStage Director and its run-time engine to schedule running the solution, testing and debugging its components, and monitoring the resulting executable versions (on an ad hoc or scheduled basis).
- Implemented shared containers for multiple jobs, which have the same business logic.
- Extensively used Shell Scripts to trigger the DS jobs through a Master Sequencer.
- Designed code management tools such as Clear Case.
- Added, deleted and moved projects and jobs. Purged job log files and traced server activity.
- Migrated DataStage and QualityStage jobs from earlier versions to DataStage 8.0 version.
- Involved in analysis based on requirements and developed Cluster & Grid Architecture for data integration and created Data lineage Documents.
- Managed DataStage services like DSRPC, Telnet and Engine Resource Services.
- Used IIS Web Console for administration areas: security, licensing, logging, and scheduling.
Environment: IBM InfoSphere Data Stage 8.0.1, Oracle DDL, Oracle 11g/9i/10g, PL/SQL, IBM DB2/UDB 8.x, UNIX Shell Scripts, SQL Developer, Teradata SQL Assistant
Confidential
DataWarehouse Developer
Responsibilities:
- Business Analysis and Requirements Gathering.
- Involved in creating logical and physical database design-using Erwin.
- Implemented various strategies for Slowly Changing Dimensions.
- Involved in developing process for Extracting, Cleansing, Transforming, Integrating and loading data into data warehouse database using the database designer.
- Wrote Triggers and Stored Procedures using PL/SQL.
- Created Batches to run several Sessions sequentially and concurrently.
- Designed lookup strategies using Hash file stage.
- Used Datastage as an ETL to extract Data from sources like Oracle, Mainframes and DB2 and loaded to target database.
- Developed parameter driven ETL process to map source systems to target data warehouse with Datastage complete source system profiling.
- Extensively used debugger to test the Data Flow and Fix jobs.
- Generated UNIX shell scripts for creating, dropping tables and scheduling the jobs.
Environment: DataStage 6.0, DB2, PL/SQL, UNIX Shell Scripting and Windows NT 4.0,Orcale 8i, Erwin 4.0.