We provide IT Staff Augmentation Services!

Sr. Datastage Developer Resume

4.00/5 (Submit Your Rating)

Baltimore, MD

SUMMARY:

  • Sr. IBM InfosphereDatastage Developer with 6 years of experience in Data Integration and migration for Data Warehouses including IBM Info Sphere/Web Sphere Datastage, Enterprise Edition (Manager, Designer, Director, Administrator, Parallel Extender), Profile Stage/ Information Analyzer, Quality Stage.
  • Extensive experience in Extract Transformation Loading applications using IBM Infosphere Information Server Versions 11.5, 11.3, 9.1, 8.7, 8.5, 7.5, Ascential DataStage 7.5.
  • Experience using IBM InfosphereDataStage 11.5, 11.3, 9.1, 8.5, IBM InfosphereDataStage 8.1 and Ascential DataStage 7.5.
  • Strong understanding of the principles of Data Warehousing using fact tables, dimension tables and star/snowflake schema modeling.
  • Experience in Cutting Edge Technologies like IBM Infosphere Information Analyzer, Infosphere Governance Catalog.
  • Extensive experience in IBM InfoSphere Information Server and patch installations.
  • Hands on experience in Installing IBM Information Server suite on AIX, UNIX, RHEL Linux and Windows platforms.
  • Hands on experience with the Hadoop (HDFS, Hive, Impala).
  • Worked extensively with Dimensional modeling, Data migration, Data cleansing, ETL Processes for data warehouses.
  • Experience in IBM Info Sphere Information Server (IIS), ETL software administration, performing environment, capacity and performance monitoring, MDM patches maintenance/installs or other directly related experience.
  • Handled errors using Exception Handling extensively for the ease of debugging and displaying the error messages in the application.
  • Good knowledge and understanding of Hadoop architecture and components including HDFS, Hive, Sqoop, Flume, Hbase & Pig.
  • Implemented, designed and analyzed the Relational Database (OLTP) and Data Warehousing Systems (OLAP).
  • Installing, Configuring, Managing, Monitoring and Troubleshooting SQL Server 2016/2014/2012/2008.
  • Experience in backing up and Restoring Information Server and Datastage projects.
  • Hands on experience in migration and upgrade ETL environments from Infosphere lower versions to higher versions like from 8.5 to 11.3 and 11.3 to 11.5.1.
  • Excellent knowledge on IBM Information Server suite components like Qualitystage, Information Analyzer and Metadata workbench, etc.
  • Experience in both production support projects and Development projects.
  • Good experience in data extraction/transformation/loading from different sources to a target data warehouse.

TECHNICAL SKILLS:

ETL: IBM InfoSphere& WebSphere DataStage 11.5/11.3/9.1/8.7/8.5/8.1.1/8.0/7.5, Quality Stage, Parallel Extender, Profile Stage,MDM.

Databases: Oracle 12c/11g/10g, SQL Server 2005/2008 R2/2012, DB2, Teradata, MS - Access, Sybase, Netezza, Greenplum, etc.

Database Tools: SQL* Plus, SQL Loader, Toad, Autosys

OS: Windows, AIX, Sun Solaris, HP-UX, RHEL, SuSe Linux

Languages: SQL, PL/SQL, UNIX Shell scripting, Java, PostgreSQL, C, C++, JavaScript, HTML5, DHTMLMethodologies: Agile, Waterfall

Data Warehousing: Star & Snow-Flake schema Modeling, Fact and Dimensions, Physical and Logical Data Modeling, Erwin,Cognos.

PROFESSIONAL EXPERIENCE:

Confidential, Baltimore, MD

Sr. DataStage Developer

Responsibilities:

  • Worked on Confidential 11.5,11.3 to develop processes for extracting, cleansing, transforming, integrating, and loading data into data warehouse database
  • Implementing Industry ETL standards and best practices, performance tuning during designing the Datastage Jobs.
  • Extracted data from Oracle, DB2 and Flat File and Load to target tables using IBM InfoSphereDatastage platform.
  • Extensively used Informatica Data Validation tool to build and test unit test cases.
  • Implemented data extraction, transformation and load processes in a parallel framework.
  • Used stages like Transformer, sequential, Aggregator, Data Set, File Set, CFF, Remove Duplicates, Sort, Join, Merge, Lookup, Funnel, Copy, Modify, Filter, Change Data Capture, Change Apply, Head, Tail, Sample, Surrogate Key, External Source, External Target, Compare, Teradata Connector
  • Upgraded the Infosphere Information server 9.1 from the existing version Infosphere Information server 8.5.
  • Customizing the Data stage Parallel jobs as per current business enhancements.
  • Worked on UNIX scripts for running and validating the job.
  • Building DataStage ETL interfaces to aggregate, cleanse and migrate data across enterprise-wide MDM ODS and Data Warehousing systems using staged data processing techniques, patterns and best practices.
  • Worked on DataStage V9.1 to develop ETL jobs that loads the data from staging to target tables in Teradata server as database.
  • Worked on analyzing Hadoop cluster and different big data analytic tools including Pig HBase database and Sqoop.
  • Building Datastage jobs to migrate data from Oracle to Netezza databases.
  • Built DataStage jobs to read files from multiple tables using Oracle Connector Stage Created Sequencers with Exception Handling and Restarting Logics.
  • Managed, designed, and created the Star Schema and Snowflake Schema for a financial data mart using Erwin and DB2 using Ralph Kimball dimensional modeling techniques.
  • Performed data manipulations using various Informatica Transformations like Aggregate, Filter, Update Strategy, and Sequence Generator etc.
  • Involve in Data validating, Data integrity, performances related to DB, Field size validation, check Constraints and Data Manipulation and updates by using SQL.
  • Generated the Pattern report and Token report as requested by the Business in identifying the Data patterns and inconsistency in the source data using InfoSphere Quality Stage.
  • Involved in Performance and Tuning the Parallel Extender jobs to the maximum extent and achieved the best performance by reducing the Loading Time
  • Used different Parallel Extender Partitioning techniques in the stages to facilitate the best parallelism in the Parallel Extender jobs.
  • Developed ETL mappings and populated EDW tables from ODS, as required by business.
  • Used shared containers for multiple jobs, which have the same business logic
  • Autosys and Datastage Director for Job Scheduling, Emailing production support for Troubleshooting from LOG files.
  • Responsible for using the data mapping to direct into correct systems, data reconciliation and validations.
  • Designed Star and Snowflake Data models for Enterprise Data Warehouse using ERWIN.
  • Migrated the DS server jobs to Parallel jobs by using the IBM InfoSphere Connector Migration tool.
  • Extensively worked with Job sequences using Job Activity, Email Notification, Sequencer, Wait for File activities to control and execute the Data stage Parallel jobs

Environment:: Confidential 11.5/11.3/ 9.1 (Designer, Administrator, Director), Teradata Server, Teradata SQL Assistant, Windows 7,IBM Rational Quest, Java, Netezza, MDM, Autosys, UNIX Shell Scripting.

Confidential, St Louis, MO

Datastage Developer

Responsibilities:

  • Extensively used DataStage for extracting, transforming and loading databases from sources including Oracle, DB2 and Flat files.
  • Developed, tested and implemented Datastage Jobs, JIL Jobs,Ksh scripts for several projects in an Operational Data Store.
  • Extensively used Informatica Power Center Data Validation tool to unit test the ETL mappings.
  • Developed various ETL jobs including Data Extractions, Transformations rules based on business requirements using IBM InfosphereDatastage 8.5.
  • Involved in peer code reviews and testing of ETL flows. Performed Unit testing and Data validation testing using Informatica Data Validation tool.
  • Worked in integration of various data sources (DB2-UDB, SQL Server, Oracle, Teradata, Netezza, XML and MS-Access, SAS, HDFS and JSON) into data staging area.
  • Responsible for loading unstructured data into Hadoop File System (HDFS).
  • Designed, developed and tested the DataStage jobs using Designer and Director based on business requirements and business rules to load data from source to target tables.
  • Developed PL/SQL stored procedures for source pre load and target pre load to verify the existence of tables.
  • Used IBM InfosphereDatastage 8.5 to develop various ETL jobs including Data Extractions, Transformations rules based on business requirements.
  • Involved in the design of Match Templates suggestions made by the Business and validated the results to identify the duplicates coming from the source depending on different Match types using Match Stage - InfoSphere Quality Stage.
  • Designed the Data Marts in dimensional data modeling using star and snowflake schemas.
  • Deploying the code into all other test environments and making sure QA to pass all their test cases.
  • Established best practices for DataStage jobs to ensure optimal performance, reusability, and restart ability.
  • Used Autosys to schedule, run and monitor Datastage jobs.
  • Extracted the data from the DB2 database and loading into downstream Mainframe files for generating the reports.

Environment:: IBM InfoSphere Information Server DataStage 8.5, SQL, PL/SQL,UNIX, AIX, DB2, Java, Mainframe files, Job control, SVN.

Confidential, Jersey City NJ

Datastage Developer

Responsibilities:

  • Data stage 8.5 was used to transform a variety of financial transaction files from different product platforms into standardized data.
  • Designing ETL jobs incorporating complex transform methodologies using Data Stage tool resulting in development of efficient interfaces between source and target systems.
  • Developed ETL jobs to load data from VSAM, GDG, IMS, DB2 databases, Flat files, CSV files to Target and experience with high volume databases on Mainframes.
  • Worked with stages like Complex Flat File, Transformer, Aggregator, Sort, Join, Lookup, and Data masking pack.
  • Co-coordinating with client managers, business architects and data architects for various sign offs on data models, ETL design docs, testing docs, migrations and end user review specs.
  • Primarily involved in Job Design, Technical Reviews and Troubleshooting of jobs.
  • Extensively involved in different Team review meetings and conferences with remote team.
  • Participated in requirements gathering and created Source to Target mappings for development.
  • Extensively designed, developed and implemented Parallel Extender jobs using Parallel Processing (Pipeline and partition) techniques to improve job performance while working with bulk data sources.
  • Created and used Data Stage Shared Containers, Local Containers for DS jobs.
  • Extensively Worked on Job Sequences to Control the Execution of the job flow using various Triggers (Conditional and Unconditional) and Activities like Job Activity, Email Notification, Sequencer, Routine activity and Exec Command Activities.
  • Tuning the jobs for optimum performance.
  • Used Data Stage Director to validate, run and monitor the Data Stage jobs.
  • Experience in generating and interpreting mapping documentation and translating into detailed design specifications using ETL code.
  • Resolved the QA and UAT issues for DataStage jobs
  • Performed the Unit testing for jobs developed to ensure that it meets the requirements.
  • Extensively involved with business team for analyzing the source systems data and building the design documents
  • Extensively worked with architects and proposed solutions in building common design approach for building the Job control and error recording tables.
  • Completely prepared Naming Standards Document and Low Level Design Documents which was used across the Projects.
  • Prepared ETL job run dependency list by discussing with scheduling team and java extracts team and by considering the load and availability of various systems.
  • Prepared mapping documents, technical design document and process flow documents using Visio.
  • Prepared integration test case plans and test scenarios along with testing.
  • Extensively used the advanced DataStage Data warehousing capabilities of almost all Processing stages like Change capture stage, Lookup/Join/Filter/Funnel/Surrogate Key Stages

Environment:: IBM Data stage 8.7 (Director, Designer, Administrator), IBM DB2, UNIX, Oracle, Teradata, Control M, Autosys, DB2, SQL server, Mainframes.

Confidential

ETL Developer

Responsibilities:

  • Extensively used Informatica to load data from various Data Sources like Flat files, Oracle, SQL Server, into the Enterprise Data Warehouse.
  • Used Joiner, Aggregator, Expression, Router, Sequence Generator, Update Strategy and Lookup Transformations to manipulate data related to customers.
  • Designed and developed Informatica Mappings, Mapplets and Sessions for data loads and data cleansing.
  • Extensively worked on confirmed Dimensions for the purpose of incremental loading of the target database.
  • Improved performance by identifying the bottlenecks in Source, Target, Mapping and Session levels.
  • Developed OLAP models for analysis of facts, measures, dimensions and hierarchies.
  • Tuning of the mappings for a better response.
  • Established connection between Informatica and hadoop using hadoop connector.
  • Configured the sessions using Server manager to have multiple partitions on Source data to improve performance.

Environment:: Informatica Power Center 8.6, ORACLE 11g, SQL Server 2008, Flat Files, Windows XP, UNIX, Notepad++.

We'd love your feedback!