Sr. Etl Developer/architect Resume
New Britain, CT
SUMMARY
- Over 11 years of dynamic career reflecting pioneering experience in the field of Information Technology as a DataStage/ETL Developer in Data Warehousing and Client/Server technologies, with strong understanding in all phases of SDLC involving Requirement gathering, Business analysis, Application Design, Data Modeling, Development, Implementations and Testing of Database business systems for Pharmaceutical, and Technology industries.
- Experience in latest version of IBM InfoSphere Information Server DataStage and QualityStage 8.5/8.1.
- Expertise in Data Requirement Analysis, Design, Development of ETL process using IBM InfoSphere DataStage 8.5/8.1/7.x/6.x, DataStage Designer, DataStage Director, DataStage Administrator, Parallel Extender (PX)/Orchestrate.
- Expertise in cleansing the data using IBM QualityStage 8.1, Ascential QualityStage 7.5.2/7.
- Strong experience in Name and Address standardization using the USNAME, USADDR and USAREA rule sets
- Experience in Information Analyzer Column Analysis, Primary Key Analysis, Cross domain Analysis and Base line Analysis
- Strong experience in setting up ODBC data connections to Information Analyzer/Profile Stage
- Created complex data rules for profiling source data files/ before loading the data to the target tables.
- Created Metrics and Rule Sets in Information Analyzer.
- Strong working experience on Data Warehousing applications, directly responsible for the Extraction, Transformation and Loading of data from multiple sources into Data Warehouse.
- Efficient in Analyzing, Designing, Developing, and maintaining highly complex ETL processes.
- Experience in integration of various data sources which include DB2 - UDB, SQL Server, Sybase, Oracle, RTI Webservices, Teradata, XML, SAPR/3Plugin, MS-Access and Sequential files.
- Excellent noledge of Server jobs, Parallel jobs, SQL, PL/SQL, Stored Procedures and Triggers, debugging, troubleshooting and performance tuning.
- Extensively contributed in the areas of Dimensional modeling (Star Schema and Snowflake Schema).
- Excellent noledge of studying the data dependencies using Metadata stored in the DataStage Repository.
- Developed complex UNIX Shell Scripts for automation of ETL processes and Data transfer needs.
- Proven track record in troubleshooting of DataStage jobs and addressing production issues like performance tuning and enhancement.
- Expert in unit testing, system integration testing, implementation and maintenance of DataStage jobs.
- Excellent experience in elevating successful code to production.
- Highly motivated and adaptive with the ability to grasp things quickly.
- Ability to work TEMPeffectively and efficiently in a team and individually with excellent interpersonal, technical and communication skills.
TECHNICAL SKILLS
- IBM InfoSphere Information Server (DataStage and QualityStage
- Information Analyzer
- Metadata Workbench) 8.5/8.1
- Ascential DataStage Enterprise and DataStage 7.5/7/6.x(Designer
- Manager
- Director and Administrator)
- IBM Information Server Suite
- Ascential QualityStage 7.5.2/7
- QlikView 6.0/5.04 (QlikView Analyzer
- QlikView Publisher
- QlikView Enterprise
- Macro Debugger
- Security override
- Profiler)
- Business Objects (Designer 5.0
- Supervisor
- Reporter & WEBI)
- Erwin 7.x
- Power Designer
- Oracle 10g/9i/8i
- DB2 UDB 7.1/8.1
- SQL Server 2000/05
- MS Access
- Sybase 12.x/11.x
- SQL
- PL/SQL
- Visual Basic 2005/6.x/5.x
- HTML
- DHTML
- Java
- UNIX Shell Scripting
- Solaris 2.x./8
- Linux 7.1/7.2/8
- IBM-AIX
- HP UX
- SCO-UNIX
- Windows 2000 Server/Advanced Server
- Windows 95/98/2000/XP
- OS/390
- WinNT4.0
- TOAD 9.x/8.x
PROFESSIONAL EXPERIENCE:
Confidential, New Britain CT
Sr. ETL Developer/Architect
Responsibilities
- Used Technical transformation document to design and build the extraction, transformation, and loading (ETL) modules.
- Installed and configured Websphere Datastage 8.1 on Linux.
- Used ISALite application to gather diagnostic information and troubleshooted problems.
- Applied Fix Packs on Datastage 8.1 on Linux.
- Extensively involved in migration of projects from Datastage 7.5 dat are on Windows platform to Datastage 8.1 on Linux.
- Created Metrics and Rule Sets in Information Analyzer.
- Configured Project Level Environment variables, security, and connectivity. Tuned the deployment for performance, and backed up the installation.
- Managed 2 ETL developers.
- Designed and Developed Data warehouse system dat served as a single source of limited duration transactional or operational information for downstream application
- Created Data stage jobs using different stages like Transformer, Aggregator, Sort, Join, Merge, Lookup, Data Set, Funnel, Remove Duplicates, Copy, Modify, Filter, Change Data Capture, Change Apply, Sample, Surrogate Key, Column Generator, Row Generator, and SAPR/3Plugin Etc.
- Created parameters to run the same job for different schemas
- Used DataStage Parallel Extender stages like Sequential file, Datasets, Copy, Oracle Enterprise, Join, Sequencer, Aggregator, Merge, Filter, Transformer, Lookup, Funnel, Compress stage.
- Create master controlling sequencer jobs using the DataStage Job Sequencer
- Extensively used RTI input and RTI output stages and created webservice jobs
- Used parallel extender for splitting the data into subsets and flowing of data concurrently across all available processors to achieve job performance, worked with Orchestrate environment for parallel processing at Job stage & lookup stage.
- Developed star schema data model using suitable dimensions and facts. Involved in analyzing the scope of application, identifying the relationship within and between the groups of data.
- Utilized SQL *Loader to load bulk data into target Oracle data warehouse.
- Developed several Test Plans, UNIX Scripts for Unit/Team Testing.
- Involved in Unit testing, Functional testing and Integration testing and provide process run time.
- Create and use DataStage Shared Containers, Local Containers for DS jobs and retrieving Error log information.
- Developed system test plans and test cases for unit testing. Performed data validation, unit testing, and performance analysis for the designed components.
Environment: InfoSphere Data Stage 8.5/8.1/7.5.2/Parallel Extender, Quality Stage, Oracle 10g/9i, FTP, SQL, PL/SQL, SQL Server 2008, Toad 9.0, Erwin 7.x, UNIX, PERL, UDB DB2, Autosys and AIX 5.
Sr. ETL Developer
Confidential, Wilmington, DE
Responsibilities:
- Managed off-shore developers
- Understand the business rules and implement the same into a physical data model schema using Erwin.
- Prepared Functional Requirements Document & designed ETL Technical design document.
- Modeling and populating the business rules using mappings into the Target Data base.
- Extensively worked with Longitudinal Prescription Data from various data sources like WKH, Verispan and IMS.
- Designed and developed database Procedures, Triggers, Functions and Packages using PL/SQL
- Extensively used Sql*Loader to import the data into database objects.
- Created complex queries and Optimized Query Performance for faster retrieval of the results.
- Developed DataStage after Job Routines to retrieve statistics of the load process and insert to a statistic table defined in the target.
- Designed the DataStage Job Control methodology, which controls the execution and monitoring of DataStage jobs and for handling complex job dependencies and enabled checkpoints as part of restart strategy.
- Extensively used the DataStage Client Components Designer, Director, Manager and Administrator.
- Manage the Metadata and Created Reports using DataStage Manager.
- Used the Plug-ins, BCPLoad to Populate the Data for Major Databases like Oracle 9i/10g
- Extensively Created Lookups for Code- Decode tables using ODBC lookup and Hashed file Lookup.
- Enhanced the reusable logic using the Shared Containers in DataStage Designer.
- Tuning DataStage Jobs for Optimizing the Performance.
- Import and Export Repository and Objects using DataStage Manager and also from Command line using DsExport.
- Extensively Created and used shell Scripts to run the job at specified times.
- Populate Data to Multiple Targets like Flat File (Fixed, Delimited), SQLServer, Oracle, Sybase.
- Filtered, Purged log files for Whole Project and each Job.
- Identify the Source of the Errors and fixing them and Maintain log of the Errors.
- Used DataStage Administrator to Add, Delete, Move Project files and Tune
- Defined the program specifications for the data migration programs, as well as the necessary test plans used to ensure the successful execution of the data loading processes.
- Redesigned the existing jobs based on the client specification.
- Analyze the user requirement for the reports.
- Created Daily Reports, Dynamic Reports and Ad-hoc Reports.
- Extensively involved in creating Card Program Tracking Reports.
- Created indexes on Reporting tables and tuned the SQL Queries for better lead times of Reports.
- Scheduled loading of reports after the ETL Batch Load and update the BI Users through E-mail.
- Extensively used QlikView reporting tool to create Sales reports, Calls reports
Environment: Oracle 10g/9i, PL/SQL, Toad 9.x, Erwin 4.5, Ardent DataStage 7.5.(DataStage Manager, DataStage Director, DataStage Designer), UNIX, Windows 2000 Server, SunOS 5.9, QlikView 6.0 (QlikView Analyzer, QlikView Publisher, QlikView Enterprise, Macro Debugger, Security override, Profiler), MS VISIO, Install Shield
Confidential, Morristown, NJ
Sr. ETL Developer
Responsibilities
- Analyzed the functional spec and designed the technical design specification documents for different interfaces. Involved in design and code reviews and extensive documentation of standards, best practices, and ETL procedures.
- Gather requirements and design of data warehouse and data mart entities.
- Played role in design of scalable, reusable, and low maintenance ETL templates.
- Extensively worked with Longitudinal Prescription Data from various data sources like WKH, Verispan and IMS.
- Build efficient Data Stage jobs for processing fact and dimension tables with complex transforms and type 1 and type 2 changes.
- Designed and developed various jobs for loading data from multiple sources oracle, UDB to ODS.
- Used DataStage Parallel Extender stages like Sequential file, Datasets, Copy, Oracle Enterprise, Join, Sequencer, Aggregator, Merge, Filter, Transformer, Lookup, Funnel, Compress stage.
- Developed Perl and UNIX shell scripts for FTP source files maintenance. Involved in modifying the existing scripts for new enhancements.
- Adding custom standard rules in Standardize Stage (QualityStage) for data cleansing according to the business requirement and company standards to enrich the data.
- Strong experience in setting up/installing Information Analyzer/Profile Stage
- Experience in Information Analyzer Column Analysis, Primary Key Analysis, Cross domain Analysis and Base line Analysis
- Strong experience in generating reports using Information Analyzer for business users based on column analysis and baseline analysis
- Used Column Analysis results to understand the hidden business rules, data types, data anomalies, cardinality and nullability.
- Created complex data rules for profiling source data files/ before loading the data to the target tables.
- Wrote Sql Code, using TOAD to handle complex business logic and to compare source data with the target data during the Unit and Functional Testing.
- Oversaw unit and system tests and assisted users with acceptance testing and experienced in creating the test cases for each stages.
- Train and manage developers, advising other groups in organization on data warehouse development, and ETL development best practices.
- Provide on-call support to production system to resolve any issues.
Environment: Ascential DataStage 8.x/7.5.2/Parallel Extender, QualityStage, Reportive, QlikView 6.x, Oracle 10g/9i, FTP, SQL, PL/SQL, Toad 9.0, Erwin 7.x, UNIX, PERL, UDB DB2, Autosys and AIX 5.3.
Confidential . Morristown, NJ
Sr. ETL Developer
Responsibilities
- Understand the business rules and implement the same into a functional star schema database design, using Kimball’s methodology schema using Erwin.
- Prepared Functional Requirements Document & designed ETL Technical design document.
- Modeling and populating the business rules using mappings into the Target Data base.
- Designed and developed database Procedures, Triggers, Functions and Packages using PL/SQL
- Designed and developed jobs using Parallel Extender for splitting bulk data into subsets and to dynamically distribute to all available nodes to achieve best Job performance.
- Designed the DataStage Job Control methodology, which controls the execution and monitoring of DataStage jobs and for handling complex job dependencies and enabled checkpoints as part of restart strategy.
- Staging the information from various sources and external system for the Data warehouse.
- Managed off-shore developers/team.
- Used the Plug-ins, BCPLoad to Populate the Data for Major Databases like Oracle, Informix, and Terradata.
- Enhanced the reusable logic using the Shared Containers in DataStage Designer.
- Tuning DataStage Jobs for Optimizing the Performance.
- Import and Export Repository and Objects using DataStage Manager and also from Command line using DSExport.
- Created shell Scripts to run the job at specified times.
- Populate Data to Multiple Targets like Flat File (Fixed, Delimited), SQLServer, Oracle, Informix and Terradata.
- Filtered, Purged log files for Whole Project and each Job.
- Identify the Source of the Errors and fixing them and Maintain log of the Errors.
- Defined the program specifications for the data migration programs, as well as the necessary test plans used to ensure the successful execution of the data loading processes.
- Migrated the existing jobs from DataStage 6.0 version to DataStage 7.5 version
Environment: Oracle 10g/9i, IBM/Ascential DataStage 7.5.(Enterprise Edition)/7.0/6.X (DataStage Manager, DataStage Director, DataStage Designer), UNIX, Windows 2000 Server, SunOS 5.9IT Group, Inc., (Assignment at Confidential ., NJ) Jan’00 to Jan’03
ETL Developer
Confidential
Responsibilities:
- Worked in Block Drug Pharmaceuticals - Handled Clarify Cases, Daily Data Loads, Developed UNIX Shell and SQL Scripts for Login Analysis, Errors Analysis, Actively involved in Error Validation Process, Adhoc Reports and made Operations Guide, Actively involved in generating reports in Data Center Move
- Worked in Cell Tech - Actively Involved in Pilot and Rollout Stages, Developed Unix Shell and SQL Scripts for Login analysis, Error Analysis and made Operations Guide, Handled Clarify Cases, Involved in long term Data Validation Process, Regular Data Loads and Analysis
- Worked in BMS - Involved during the Initial Data Loads, Actively Involved in Pilot Stage. Developed Unix Shell and SQL Scripts for Login Analysis, Errors Analysis, Actively Involved in Regular Data Loads using Data Stage and SQL Loader, Developed Operations Guide (SOP) for BMS
Environment:Oracle8.x, UNIX, NT, SQL, PL /SQL, and SQL Loader, Data Stage 4.1, Force Pharma 1.3., Web Force 1.5., Clarify