Etl Lead Resume
SUMMARY:
- 8+ years of combined IT experience and technical proficiency in Data Warehousing space with hands - on expertise in comprehensive Data Integration ETL processes, Dimensional Data Modeling, Data Analysis, Application Design and Implementation of business applications for financial and insurance verticals. Expertise in database replication using Informatica Power Center and Informatica Power Exchange.
- Strong experience in ETL using Informatica Power Center 9.x (Workflow Manager, Workflow Monitor, Source Analyzer, Transformation Developer, Mapplet Designer, Mapping Designer, Repository Manager & Informatica Admin Console).
- Experience in database replication using Informatica Power Exchange for Mainframe and Oracle using Oracle Log-miner to read change Data from oracle Redo Logs.
- Experience in extraction, transformation and loading of data directly from heterogeneous sources like Flat files, Oracle, SQL Server, DB2 and Teradata.
- Extensively involved in Optimization and Tuning of Mappings & Sessions in Informatica through identifying and eliminating Bottlenecks, employing Partitioning and using Log Statistics.
- Experience with Oracle 11g, SQL, PL/SQL, Stored Procedures, TOAD, Explain Plan, Constraints, Triggers, Views, Materialized Views.
- Experience in Teradata Database design, implementation and maintenance mainly in large scale Data Warehouse environments, experience in Teradata RDBMS using FastLoad, MultiLoad, TPump, FastExport, Teradata SQL Assistance, Teradata Parallel Transporter and BTEQ Teradata utilities.
- Strong understanding of Oracle Database Partitioning and Partition Exchange to manage tables holding huge volumes of data to achieve optimum performance.
- Experience with Netezza Database integration with Informatica and load process with Netezza Bulk load utilities like Netezza Bulk reader and Bulk writer.
- Experience working with distribution keys on Netezza to achieve optimum performance.
- Well-versed in Data Modeling concepts incorporating Dimensional Modeling (Star & Snowflake Schema), Slowly Changing Dimension of type I, II and type III (version, flag and timestamp).
- Experience using Automation tools such as Maestro, UC4 and Autosys for scheduling Informatica & Oracle jobs.
- Proficient in coding UNIX shell scripts for Informatica scheduling and file handling.
- Experience with SDLC models - waterfall and Agile software development to perform requirements gathering, prototyping, proof of concept, design, documentation, implementation and testing.
TECHNICAL SKILLS:
ETL Tool: Informatica 9.x
Data Replication Tool: Oracle EBR and Informatica power exchange
Data Modeling: Star and Snowflake Schema Modeling, Dimensional Data modeling, Fact and Dimensions tables
Configuration Tools: VSS
Database: Oracle, Teradata, Netezza, SQL Server
Languages: SQL, PL/SQL,UNIX and Python
Scheduling Tools: Autosys, Maestro and UC4
Script: UNIX Shell script
Service Management Tools: Service Manager and Service Now
Project Management Tools: MS Word, Excel, Visio, Power Point, Outlook
PROFESSIONAL EXPERIENCE;
Confidential
ETL Lead
Responsibilities:
- Interacting with the SME's, architects and data modelers to understand the changes that need to be implemented and prepare the ETL design approach.
- Carrying out impact analysis on the existing components based on the changes proposed as part of the enhancement.
- Developing new Informatica mappings/workflows based on the requirement outlined by business users for new capabilities.
- Modifying existing mappings/workflows as per the business requirement to integrate with new feature addition.
- Reviewing various SDLC documents like LLD's, Build, UTC's and UTE's prior to QA testing.
- Carrying out peer review of Informatica workflows/mappings to ensure adherence to the standards and best practices.
- Performing Unit testing to ensure ETL code satisfies all test cases and to avoid any performance bottlenecks at various stages of the data flow.
- Providing assistance in System testing and supporting the release team to test and deploy the ETL code successfully to the production environment.
- Creating Autosys command jobs and box jobs to schedule Informatica workflows for batch processing.
- Analysis of defects raised by business users to identify the root cause and implement code changes/fixes to resolve the issue.
Confidential
Sr. Data warehouse developer
Responsibilities:
- Identifying the Informatica bottlenecks by simulating the production load scenarios in lower environment and resolve them by implementing much optimal options available.
- Implementing Informatica Partial and Full pushdown optimization as required to reduce the overall processing time.
- Replacing current ODBC target connections with much faster connections for target writers by utilizing TPT utilities as needed.
- Developing automated scripts to scan through the job logs to identify the long running processes which are causing the overall delay in processing.
- Collecting runtime metrics for the long running process to identify the potential candidates for performance tuning.
- Introducing the incremental load instead of full table rebuilds to avoid redundant data load.
- Identifying and implementing the correct statistics collection process at the table level to optimize the SQL code using multiple joins.
- Introducing new Primary indexes and secondary indexes to avoid full table scans for the tables with huge data volume.
- Implementing table level partitioning to enhance the performance wherever a selective dataset is fetched, ensuring no full table scan.
Confidential
Sr. Data warehouse developer
Responsibilities:
- Used Informatica 9.6 to extract, transform and load data from multiple input sources like Oracle, SQL Server, Flat files into Teradata v14.
- Worked on Source Analyzer, Warehouse Designer, Mapping, Mapplet Designer and Transformations, Repository Manager, Workflow Manager and Workflow Monitor.
- Used Python framework to enable logging and last step record to allow the recovery during runtime interruption or fresh start on the completion of job.
- Implemented Python ETL jobs using Teradata utilities like Fast-load, BTEQ, Fast Export to load data into database to achieve high performance while loading huge amount of data into Teradata.
- Created Informatica Mappings to load data using transformations like Source Qualifier, Sorter, Aggregator, Expression, Joiner, Normalizer, Union, lookups, Filters, Sequence, Router and Update Strategy.
- Involved in the development of Informatica mappings and used Informatica partitioning (parallel processing capabilities, Session-Partitioning and Target Table partitioning utilities) for performance tuning thereby reducing the load time.
- Used Informatica Power Exchange for Oracle using Oracle log-miner to read change data from oracle redo logs.
- Worked upon Power Exchange Listener starter and Logger shut-down process in cold and warm start mode.
- Used Informatica partial/full push down optimization to load data with improved performance.
- Implemented Slowly Changing Dimensions (SCDs: Type 1, Type 2 & Type 3).
- Conducted peer-reviews for the code changes and was actively involved in the regression testing of the changes/fixes implemented.
- Analyzed the existing database schemas and designing star schema models to support the users reporting needs and requirements.
- Used Maestro and UC4 Scheduling Tool to schedule UNIX shell scripts, PL/SQL scripts, Informatica jobs and automated the entire process.
Confidential
Data warehouse developer
Responsibilities:
- Worked with the business users as well as technical teams in analyzing and understanding the existing structure of the framework and Data model.
- Led the design team in coming up with data provisioning framework.
- Worked alongside technology/application leads in segregating applications based upon their roles as source and consumers.
- Led the design, creation and implementation of integration framework using Informatica Power Center and ORACLE.
- Manage design review and approval of solution designs, technical specifications, and ETL processing designs.
- Designed ETL mappings using Informatica Designer 8.x and Oracle PL/SQL to extract the data from Oracle databases and Flat files.
- Created Informatica Power Center mapping / mapplets, reusable transformations using all the transformations like normalizer, lookup, filter, expression, stored procedure, aggregator, update strategy, sessions and workflows etc.
- Worked extensively in Oracle PL/SQL, Toad, SQL*PLUS, Query performance tuning, created DDL scripts, created database objects like Tables, Indexes, stored procedures etc.
- Created utility programs for file manipulations, pre-session, post-session using shell scripts and also used Autosys for scheduling workflows.
- Involved in ETL testing created Unit test plan and Integration test plan to test the mappings and created the test data.
- Analyzed the ORACLE stored procedures to convert them into efficient Informatica mappings.
- Defined the collection strategy and integration of data from multiple enterprise business solution sources.
- Managed design review and approval of solution designs, technical specifications, and ETL processing designs.
- Worked on Optimizing and Tuning of the Mappings & Sessions in Informatica by employing Partitioning and using Log Statistics, thereby improving the performance.
- Ensured that all application code adheres to industry/company best practice design principles.
- Supporting System / User Acceptance testing for defect fixing and clarifications.
- Support to DBAs/ Release Management teams in deployment of systems in production.
- Worked with COGNOS Reporting team to enhance the performance of existing reports on a regular basis.
Confidential
Informatica Developer
Responsibilities:
- Requirement gathering, effort estimation, coding and delivering completely packaged Informatica/Teradata ETL processes.
- SPOC for Development, Maintenance and Production Support of Informatica, UNIX and Teradata processes.
- Delivered hard deadlines on Projects, conducted validation of ETL code and data, provided post-Production support.
- Identified and fixed bottlenecks and tuned the complex Informatica mappings for optimized performance.
- Coordinating with source system owners, performed data migration and monitored day-to-day ETL progress, Data warehouse target schema Design and maintenance.
- Strategically used ELT (Extract, Load, Transform) approach to increase performance of poor performing processes by using the features of relational database systems.
- Involved in performance tuning and optimization of Informatica mappings and session using features like Session Partitioning.
- Development, Maintenance, Production Support for scheduled ETL jobs that run on WLM scheduler.
- Provided administrative and operational support of the Informatica environment - Production Releases, configuration, monitoring, maintenance, and Backups.
- Peer reviewed, QA tested and approved ETL code readiness for Production releases.
- Generation of high level business reports and involved in the analysis.
- Design and development of strategy for the extraction and cleansing of data.
- Ensuring any technical environmental issues at offshore are closed without any impact on the project plan.
- Organizing/Conducting sessions for Newcomers to familiarize them with the various concepts in Teradata and Informatica.
Confidential
Informatica Developer
Responsibilities:
- Participated in requirement gathering meetings with Business Analysts and ETL Architects to understand source and data warehouse data models, technical metadata and transformation rules.
- Used Informatica shared folder and shortcuts to reuse objects without creating multiple objects in the repository and inherit changes made to the source automatically.
- Guide the Quality Assurance team to validate the reports against front end applications and solve related issues.
- Developed complex mappings using multiple sources and targets in different databases, flat files.
- Worked on Informatica Power Center tool-Source Analyzer, Mapping and Mapplet Designer, Transformations, Informatica Repository Manager, Informatica Workflow Manager and Workflow Monitor
- Creation of Transformations like Lookup, Joiner, Rank and Source Qualifier Transformations in the Informatica Designer.
- Applied performance tuning on targets, sources, mappings and sessions to improve system performance.
- Performed Unit testing of Informatica Mappings, Workflows, Mapplets to ensure performance.
- Participated in out-of-hours support duties. Met SLAs by running data loading ETL jobs during peak time and weekends.
- Coordinating a production support team from Infosys offshore with Cognizant offshore and cognizant onshore production team.
- Responsible for monitoring ticket queues and resolving technical operation issues.
- Performed production implementations and solved production issues by working with development teams.
- Monitoring the databases used for the development of other projects running under same account.
Confidential
Informatica Developer
Responsibilities:
- Involved in discussions with Leads for requirement gathering, understanding the requirements and explanation of technical probabilities and possibilities with business users.
- Created Time Estimate Proposal document with estimation of hours required for completion of each ETL task.
- Worked with data modeler and business users for designing of tables.
- Extensively used SCD’s (Slowly Changing Dimension) to handle the Incremental Loads.
- Extensively Used Source Qualifier, Filter, Connected Lookup, Unconnected Lookup, Update strategy, Router, Aggregator, Sequence Generator.
- Developed mappings to handle exceptions and discarded data.
- Created UNIX scripts for the batch run of BTEQ scripts.
- In systems analysis logical data models are created as part of the development of new databases.
- Assisted data modelers in database design and successfully brought database to 2nd Normal form.
- Worked with DBA's to identify the candidate tables to create Partitions, adding Indexes to the columns.
- Unit testing the data and report generation for review of business users for special accounts.
- Developed UNIX shell scripts and used PMCMD to execute the workflows.
- Exported the workflows from Repository Manager, checked the workflows into VSS (Version Control Management tool).
- Prepared documents for QA and PRODUCTION migration.
- Worked with business users and QA team during testing phases.
- Created Test cases for Integration testing and UAT. Reviewed UAT with business users.
Confidential
Informatica Developer
Responsibilities:
- Analyze, design, construct and implement the ETL jobs using Informatica.
- Extracted Data from Different Sources by using Informatica.
- Extracted data from different sources of databases. Created staging area to cleanse the data and validated the data.
- Extensively used Informatica client tools Source Analyzer, Mapping designer, Mapplet Designer, Transformations.
- Cleanse the source data, Standardize the Vendors data, Extract and Transform data with business rules, and built Mapplets using Informatica Designer.
- Designed and developed complex Aggregate, expression, filter, join, Router, Lookup and Update transformation rules.
- Developed schedules to automate the update processes and Informatica sessions and batches.
- Defect tracking for the entire project in all the environments i.e. Development, SIT, UAT.
- Preparation of unit test plan, integration test plans and user test plans.
- Creation of Technical Design Documents by referring to the High level design documents.
