Datastage Developer Resume
New York, CitY
SUMMARY
- Around 11 years of experience in System Analysis, Design and development in the fields of data Warehousing, Data Integration & Migration and developing business applications and Support using IBM Infosphere DataStage (11.5.0.1/8.5/ v8.1/7.5.2).
- Experience in gathering, documenting and analyzing the business and functional requirements and use cases.
- In - depth knowledge in Data Warehousing & Business Intelligence concepts with emphasis on ETL and full Life Cycle Development including requirement analysis, design, development, testing and implementation.
- Worked with various relational data bases (RDBMS) as Oracle, DB2UDB, Teradata, Mainframe databases (DB2 10.5), SQL server and ODBC stages as both source and target systems.
- Extensive experience in Extraction, Transformation and Loading (ETL) processes using DataStage ETL tool.
- Extensively used DataStage Client tools like DataStage Designer, Director and Administrator in Data Warehouse development.
- Developed ETL jobs as per business rules using ETL design, technical and mapping documents.
- Designed Job Sequencers, Batch Jobs and Parallel jobs. Handled multiple pieces of a Project.
- Extensive knowledge and experience in stages like (Datasets, sequential file, aggregator, Funnel, Filter, lookup, surrogate key, remove duplicate, transformer, column/row generator and peek).
- Extensively used Export/Import DataStage Job components and Import Plug in Table Definitions from DB2UDB, Oracle and Teradata databases
- Developed jobs using various types of stages like Sequential file, ODBC, Hashed File, Aggregator, Transformer and Sort etc.
- Experience in integration of various data sources like Oracle, Teradata, DB2, SQL Server, MS Access and Flat files into the Staging Area.
- Extensively used SQL coding for overriding the generated SQL in DataStage
- Extensively used sql in DataStage jobs for data processing.
- Strong experience in Multi-Dimensional Modeling using Star and Snowflake schemas to build Fact and Dimensional tables, star-schema modeling and snowflake-schema modeling.
- Involved in Performance Tuning and troubleshooting of ETL jobs via configuration files and shared containers.
- Experienced performing Unit, Integration, Functional testing of ETL jobs and data.
- Documented ETL test cases, test scripts, and validations based on design specifications for testing purposes and analyzing errors.
- Worked with DataStage Administrator for creating projects, users and their access.
- Worked with Unix shell scripts and JIL scripts for scheduling jobs.
- Worked on different scheduling tools like Autosys and Control-m.
- Good communication and organizational skills, hardworking, ability to work independently or cooperatively in a team.
TECHNICAL SKILLS
ETL Tools: DataStage 11.5/8.5/8.1/7.5 , DataStage Enterprise Edition & Server Edition, IBM Information Server Suite 8.1
Database: DB2 10.5, Oracle 8i/9i/10g/12g, Teradata V2R6, SQLServer 14/2005
Environments: Win XP/2000/2003, Linux, HP AIX
Languages: C
Office: MS Word, MS Excel, MsAcess
Others: Autosys
PROFESSIONAL EXPERIENCE
Confidential
DataStage Developer
Responsibilities:
- Gather business requirements, information and data necessary for strategic decision making, developing and driving strategic initiatives.
- Conduct requirement assessments and research for projects to provide appropriate input and collates documents in a professional format.
- Involved in all phases of software project development life cycle (through phases of requirement, design, development, unit testing and deployment).
- Communicate regularly, with users and management; regarding the status of technology issues, work request and projects.
- Coordinate with ETL team to implement all ETL procedures for the involved projects and maintain effective awareness of all production activities according to required standards and provide support to existing applications.
- Created the Mapping sheet, Design document, Business Requirement document, Technical Design Document (TDD), SCM Documents.
- Developed DataStage Parallel jobs using various stages like Join, Lookup, Remove duplicates, Filter, Funnel, Dataset, Sequential file, Sort, CDC, Transformer, ODBC, Unstructured Data stages.
- Developed DataStage Sequence jobs using stages like Sequencer, Nested Condition, Terminator Activity, Exceptional Handler, Notification Activity, Job Activity and Execute Command.
- Used DataStage Director to debug, validate, schedule, run and monitor Data stage jobs.
- Performed assigned activities involving decision support systems, query and reporting, online analytical processing. Provide data to develop insightful business intelligence reports and dashboards. Clean and transform data as required to deliver actionable information.
- Performed analysis to identify potentially problematic data and eliminate root cause for a range of complex data problems.
- Perform data validation and quality control checks to ensure better extraction, transformation and loading protocols.
- Worked closely with Data modeler and DBA to accommodate the Requirements
- Created Test Plans, Test Cases, Test data and performed testing in QA environment and documented unit-test cases, scripts, and validations based on design specifications and used HPQC for creating Test Plans, performing Test labs, Test runs
Environment: Infosphere DataStage 11.5 (DataStage Manager, Designer, Director, Administrator), DB2, IBM Data Studio, Brio, WinSCP, Putty, sftp, SAP, ESP, HPQC
Confidential, New York City
DataStage Developer/Analyst
Responsibilities:
- Gather user requirements and incorporate them into the overall RES data model and work with the various agencies or vendors to source the data required.
- Design, develop and schedule IBM Infosphere DataStage jobs to load the RES databases and Reports Data Mart. This includes the associated documentation, ETL development and QA activities. Data is arranged, transformed, and consolidated in order to meet DOF business needs.
- Create data maps for the elements extracted from the originating source systems (such as E-file, Gentax, Fairtax and PASS), or external data sources (NYS, IRS, FISA, and Dun & Bradstreet), to a target data repository based on the overall RES data model.
- Worked with the Business to translate business requirements into technical specifications documentation, perform quality assurance on all data loads and coordinate user acceptance testing (UAT) of loads. Worked on the Data loads for the departments 2+ yearlong backlogs of unprocessed data into DW from data sources.
- Worked in a 2-person team, which Designed, developed, and Reviewed ETL processes (DataStage jobs, KSH and SQL scripts) and Job flow for the Confidential E-File tax data project.
- Create views, columns, tables and indexes to support the PASS and Business Objects applications and ongoing ETL development.
- Reviewed and worked on Quality Assurance testing of ETL processes for all assigned data projects and also provided DOF user support on a-hoc data requests.
- Worked on mapping documents for Data Items from Source Systems to the Target System for future reference.
Environment: IBM Infosphere DataStage 11.5.0.1, DB2 10.5, DataStage 7.5, AIX 5.2, SQL Server 14, studio, Oracle 12g
Confidential, Dearborn
Sr ETL Developer
Responsibilities:
- Gather Business requirements and prepare Technical Specifications.
- Analyzed the source system to understand the source data, business rules, to check data integration and develop the data model.
- Worked with DBA to create the physical model and tables and apply the data model changes.
- Documented ETL Coding standards, test cases, test scripts, and validations based on design specifications for unit testing, system testing, functional testing and prepared test data for testing, error handling and analysis.
- Performed Design, build & unit test on parallel jobs to extract files from ODBC stage, Teradata connector stage & Flat files.
- Designed and developed ETL Jobs using designer to load data from Oracle, Teradata and Flat Files (Fixed Width) to staging database and from staging to the target Data Warehouse database.
- Worked on creating fixed width flat files and csv files.
- Developed job sequencers with proper job dependencies, job control stages and triggers and schedule jobs to run jobs in sequence.
- Utilized the stages of Job Sequence such as User Variable Activity, Notification Activity, Routine Activity, Terminator Activity, Local containers, shared containers, job parameter sets and Stage Variables.
- Used the DataStage stages like Join, Change Capture, Lookup, Transformer, Aggregator, Sequential file, Sort stage, Filter stage, Data Set, Remove Duplicate, surrogate key.
- Used Data stage Administrator for defining environment variables and project level settings.
- Used UNIX scripts to execute jobs and also used the DataStage Director for scheduling, executing and monitoring jobs.
- Scheduled jobs through Autosys
- Performance Tuning of DataStage jobs and SQL queries.
- Created Shell Scripts to read parameter files and to automatically notify Business Exceptions and Rejects during the Loads and file processing stages.
- Responsible for doing the code reviews, Planning and estimating the project requirements and to report the status to business managers.
- Walkthrough of SIT Documents with Client for UAT sign off.
Environment: IBM Web sphere Datastage8.0.1, IBM Infosphere DataStage 8.7, Teradata, DB2, Windows XP, UNIX, SQL Server, Autosys.
Confidential
DataStage developer
Responsibilities:
- Reviewed the project scope, requirements, architecture diagram, Conversion design documents (CDD) and development guidelines.
- Worked closely with Mach 1 process team members during the application development process (requirement gathering, design, build and test, and implementation).
- Designed and developed ETL jobs using IBM Infosphere DataStage 8.5 to extract, transform and load the data into SAP.
- Analyzed the data and responsibilities for data cleansing and data conversion for Mach 1 deployment sites.
- Developed parallel jobs using different processing stages like Transformer, Aggregator, Lookup, Join, Merge, Sort, Remove Duplicate, Copy, Funnel and Filter and other stages.
- Worked on Quality stage to develop jobs for data cleansing and data quality improvement.
- Performed various conversions/transformations for the data fields/values as per the business logic specified in mapping document.
- Used DataStage Director to run, monitor and validate the jobs.
- Performed various tasks with SQL programming language for Data retrieval and query on the data as part of application development.
- Dealt with various databases (Oracle, DB2) stages & connectors as a source and target systems.
- Loaded the Data into SAP R/3 using various programs like LSMW, Z-Tran and DataStage.
- Created and managed defects/tickets and scripts with HP Quality center for quality assurance, requirement management, test management and business process testing for IT and application environments.
- Used Data stage Director to clear the job logs, job resources and status files.
- Performed excellent customer engagement skills and understood the importance of creating and deploying quality software deliverables.
- Worked closely with various teams in various geographical locations.
Environment: IBM Information server (DataStage 8.5, Information Analyzer), Oracle, DB2 UDB, TUFops, SQL, SAP R3 PACKS, HP Quality Center.
Confidential, Charlotte, NC
Data stage developer/Analyst
Responsibilities:
- Involved in requirement gathering, analysis and study of existing systems.
- Involved in preparing technical design/specifications for data Extraction, Transformation and loading.
- Extensively used DataStage Designer to develop various jobs to extract, transform, integrate and create extract files as needed.
- Involved in writing Jil Script’s to create Autosys Jobs to trigger ETL jobs and Shell Script.
- Created Technical Specs document for the DataStage Jobs, developed several Test Plans and Error Logs / Audit Trails were maintained.
- Developed new and effective controls (data, report, ETL) for data validation.
- Worked on EDW Conversion project mainly designed for migrating EDW 2.0 code to IIS Grid environment.
- Worked on migration project, which includes ETL jobs, scripts & Autosys JIL jobs, would be forklifted to EDW 3.0 installed on IIS Grid environment.
- Implementing performance-tuning techniques along various stages of the ETL process.
- Coordinated with the Admin team in migration of the DataStage code on different environments (Development, test and production).
- Developed a generic shell script to initiate file transfer between two servers.
- Co-coordinating with client managers, business architects and data architects for various sign offs on data models, ETL design docs, testing docs, migrations and end user review specs.
- Worked extensively in executing the daily IT Cycle, ST Cycle and debugging the problems in daily runs and provided technical and functional help to team members.
Environment: IBM Infosphere DataStage 8.1, OBIEE, Teradata 13.0, Autosys 4.5.1, StarTeam, WinXP, UNIX and Teradata Sql Assistance
Confidential, Detroit, MI
Data Stage Developer
Responsibilities:
- Analyzed and gathered the business requirements. Interacted with Business users and Technical Architects to analyze the data & gathering the requirements from various sources.
- Used Web Sphere DataStage to design, execute, manage and deploy and administer DataStage jobs.
- Used Designer client for basic, advanced search and impact analysis and to import and export
- Components for moving jobs between different environments.
- Worked on DataStage for subjecting the data to multiple stages, thereby transforming it and prepared documentation. Used DataStage Manager to define Table definitions, Custom Routines and Custom Transformations.
- Worked extensively with different stages in DataStage like Join, Sort, Merge, and Aggregator and Remove Duplicates, Funnel, Transformer, Surrogate Key Generator, SCD, Range and Case Lookup.
- Extensively worked on DataStage Job Sequencer to Schedule Jobs to run jobs in Sequence
- Created data connections and parameter sets or environment variables to use them for the project across dev, test and production servers.
- Used Director Client to validate, run, schedule and monitor the jobs that are run by WebSphere DataStage server.
- Used Administrator Client for general and project-related and mappings.
- Involved in migration of jobs from DataStage 7.5x2 to DataStage 8 version.
- Used IBM IS Web Console for administration areas: security, licensing, logging, and scheduling
- Installed and Configured Director, Manager, Administrator and Designer components of DataStage.
- Understand the business needs and implementing the same into a functional database design.
- Maintaining warehouse metadata, naming standards and warehouse standards for future application and development.
- Used Autosys to schedule jobs and e-mailed the status of jobs to operations team daily.
Environment: IBM Infosphere DataStage 8.1 (Designer, Director, Administrator), Unix, Oracle10g, DB2, SQL, Autosys, Windows XP/2003
