Senior Data Quality Engineer/informatica Developer Resume
San Francisco, CA
SUMMARY:
- Almost 9 years of IT experience in software development life cycle which includes analysis, design, development, Data Warehouse, ETL, testing
- Experience in Data Warehouse development working with Data Migration, Data Conversion, Data Cleansing and Extraction/Transformation/Loading using Informatica Power Center with Oracle, SQL Server and Heterogeneous Sources
- Extensive experience in the Analysis, Design, Development, Quality Assurance & Testing, Implementation and Maintenance of Data Warehouse applications and Databases.
- Experienced in Data Modeling, making use of Dimensional Data Modeling, Star Schema, and creating Fact and Dimension tables.
- Extensively worked on Power Center Mapping Designer, Mapplet Designer, Transformation developer Warehouse Designer, Workflow Manager, Repository Manager and Workflow Monitor
- Working knowledge on all the data quality transformation which will be used throughout the development.
- Used IDQ to profile the project source data, define or confirm the definition of the metadata, cleanse and accuracy check the project data, check for duplicate or redundant records, and loaded into exceptional tables.
- Experience in performance tuning of sources, transformations, mappings, targets, and sessions
- Extensively used SQL and PL/SQL to write stored procedures, functions, packages and triggers.
- Experience in UNIX shell scripting.
- Exceptional analytical and problem - solving skills.
- Ability to write complex Test Plans, Test Cases and Test Scripts.
- Strong skills in performing Functional, Unit, Integration, GUI, Regression and User Acceptance Testing.
- Strong organizational, time-management, analytical and problem-solving skills.
- Strategic thinker, mentor and team player with good communication and problem-solving skills.
TECHNICAL EXPERIENCE:
ETL Tools: Informatica Power Center 10.2/9.6/9.1/8.6/7.1 , IDQ 9.1/9.6/10.2 , Informatica Rule Point
Data Modeling Tools: Erwin 3/ 4, Visio
Scheduling Tools: Informatica Scheduler, Autosys, Microsoft Visual Studio 2015, Tortoise SVN Browser
Data Modeling: Relational, Dimensional (Star, Snowflake Schema)
Performance Tuning: SQL Tuning, Explain Plan, Table Partitioning, Materialized views
RDBMS: Oracle 11g/10g/ 9i/8i, SQL Server 2005/2008, MS Access, TeradataVersion12, TOAD, SQL Loader, SQL* PLUS, SQL Developer, Agility Netezza 4.9.1
Environment: s Windows NT/98/95/2000/XP, UNIX
Languages: SQL, PL/SQL, C, expose to Java/J2EE, UNIX Shell scripting
Agile: JIRA, Bit Bucket, Bamboo, Scrum
PROFESSIONAL EXPERIENCE:
Confidential, San Francisco, CA
Senior Data Quality Engineer/Informatica Developer
Responsibilities:
- Responsible for analysing, assessing, and documenting diverse data from its source through many complex transformations to all applications that consume the data.
- Responsible for identifying the issues from the profiling and follow up with the business and document the business rules and Data Quality rules that would be used for improving the quality of the data.
- Performed profiling using Informatica Analyst and DQ scorecards in IDQ Developer/Analyst
- Investigating invalid/erroneous data sources by identifying the root causes of data integrity issues, and creating corrective processes and systems to prevent reoccurrence
- Design, Development and implementation of Informatica Developer Mappings for data cleansing using Address validator, Labeler, Association, Parser, Expression, Filter, Router, Lookup transformations etc.
- Design, Development and implementation of Power center mappings for DQ Framework using SQL, Normalizer, Expression, Filter, Router, Lookup transformations etc. and tuning of mappings for better performance.
- Imported mapplets and mappings from Informatica developer (IDQ) to Power Center.
- Worked code migration process using Tortoise SVN Browser and Tidal for scheduling.
- Developed and implemented the UNIX shell script for archival and email notification.
- Worked on Customer communication project to parse Json schema using data processor transformation in Informatica developer tool and exported mapping in Informatica power center to load the data into relation tables
Environment: Informatica PowerCenter 10.2, Informatica Data Quality 10.2, Oracle 11g, PL/SQL, Agility Netezza 4.9.1, Tortoise SVN Browser, JIRA v6.4.13, Microsoft Visual Studio 2015, Shell Scripting
Confidential, San Francisco, CA
Apps System Engineer
Responsibilities:
- Analysed highly complex business requirements and developed technical design documents and process diagrams and presented among business users, testing and technical teams.
- Extracted the data from the flat files and RDBMS databases into staging area and populated into Data warehouse based on the business rules and transformation specifications.
- Validation, Standardization and cleansing of data will be done in the process of implementing the business rules.
- Implementation of data profiling, creating score cards, Creating tables and documenting Data Quality metrics/dimensions like Accuracy, completeness, duplication, validity, consistency.
- Created business rules in Informatica Developer and imported them to Informatica power center to load the standardized and good format of data to staging tables.
- Used IDQ’s standardized plans for addresses and names clean ups.
- Used Session parameters, Mapping variable/parameters and created Parameter files for imparting flexible runs of workflows based on changing variable values.
- Performance tuning is performed at the Mapping level as well as the Database level to increase the data throughput
- Used various inbuilt functions like Date, Character, Special functions to calculate new fields in the mapping using power center Designer tool.
- Developed ETL processes using ETL control tables, error logging, and auditing.
- Wrote UNIX shell Scripts & PMCMD commands for FTP of files from remote server
- Working on Informatica Rulepoint to develop and validate rules based on eligibility matrix of team member for complex data analysis.
- Used session logs, verbose data and debugger to debug the session to identify the bottlenecks in the sources, targets, mappings, sessions and resolved the problems.
- Worked on existing mapping for the performance tuning to reduce the total ETL process time.
- Followed standards for build procedure to deploy code in testing and production environments using visual studio and informatica labels.
- Created and executed unit test plans based on system and validation requirements and provided production support.
- Created and scheduled jobs with LSF flow manager.
Environment: Informatica PowerCenter 9.6, Informatica Data Quality 9.6, Oracle 11g, PL/SQL, Toad, IBM LSF Flow Manager, JIRA v6.4.13, Microsoft Visual Studio 2015, Informatica Rule Point, Shell Scripting
Confidential, LA, CA
Technical Lead
Responsibilities:
- Worked with Data architect and BSA to understand specific user expectation, source data from different data sources (XML DB2, flat files, SQL Server)
- Prepared scripts to create Tables, Views, Indexes, Triggers, and Store Procedures and DML scripts for metadata for base tables.
- Developed mappings that perform Extraction, Transformation and load of source data using various power center transformations to meet business logic in the mappings.
- Designed and developed ETL and Data Quality mappings to load and transform data from source to Application tables.
- Working knowledge of Informatica Data Quality transformations like Address validator, Match, Exception, Association, Standardizer and other significant transformations
- Extensively involved in Performance tuning of the Informatica ETL mappings by increasing the caching size, overriding the existing SQLs in Source Qualifier and Lookup
- Used Debugger, session logs and other error handling techniques to validate the Mappings and to figure out the Bottlenecks and tuned them for better performance.
- Involved in unit testing of the code to make sure that the data is flowing correctly through all the transformations used.
- Migrated codes from Dev to Test environments using source tree and supported QA team with code deployment related issues.
- Involved in creating, scheduling autosys jobs to be run in a batch data processing
- Supported the Quality Assurance team in testing and validating the Informatica workflows.
- Used Error handling strategy for trapping errors in a mapping and sending errors to an error table.
- Wrote UNIX shell Scripts & PMCMD commands for FTP of files from remote server and backup of repository and folder.
- Worked in agile environment and attended sprint planning and daily status meetings.
- Supported offshore team and take up evening calls.
Environment: Informatica PowerCenter 9.6, Informatica Data Quality 9.6, Oracle 11g, PL/SQL, Toad, RapidSQL, MS-Excel, XML, Flat files,, Shell Scripting, OBIEE 10.3.2, Autosys, JIRA v6.4.13, SourceTree for GIT 1.6.20
Confidential, San Diego, CA
Informatica Developer
Responsibilities:
- Worked with business Analysts and Architects to get the functional specs and design documents in order to get a better understanding of the user requirement.
- Involved in analysing the source data coming from different Data sources such as DB2, flat files etc.
- Extensively used connected and unconnected lookups, Update strategy, Router transformations in the mapping to bring in new fields into target table.
- Responsible to monitor data during the daily and weekly loads and fix the issues related to load failures.
- Used IDQ to complete initial data profiling and removing duplicate data.
- Used IDQ’s standardized plans for addresses and names clean ups
- Extensively used Sorter, Filter, Expression, Consolidation, Match, Exception, Association and address validator transformations in IDQ.
- Worked with Teradata 12 utilities like BTEQ, Fast Load, multi Load and Query man.
- Updated numerous BTEQ /SQL scripts, making appropriate DDL changes and completed unit and system test.
- Worked on Teradata SQL Assistant querying the source/target tables to validate the BTEQ scripts.
- Converted existing PL/SQL Packages to ETL Mappings using Informatica Power Center.
- Used Error handling strategy for trapping errors in a mapping and sending errors to an error table.
- Created various reports to fulfil the marketing needs and customized the views using Oracle BI Answers.
- Used Informatica features to implement Type-2 changes in slowly changing dimension (SCD) tables.
- Schedule the job through IBM LSF Flow Manager.
- Involved in Performance Tuning at various levels including Target, Source, Mapping, and Session for large data files.
- Supported the Quality Assurance team in testing and validating the Informatica workflows.
Environment: Informatica PowerCenter 9.1, Informatica Data Quality 9.1, PL/SQL, TeradataVersion12, SQL*Plus, Erwin 4.0/3.5.2, DB2, MS-Excel, XML, Flat files, Windows NT, Shell Scripting, OBIEE 10.3.2.
Confidential
Informatica Developer
Responsibilities:
- Integrating functional knowledge of the client’s Business with technical requirements.
- Design and Development of ETL routines, using Informatica Power Center Aggregator, Ranking, Mapplets, connected and unconnected Lookups, SQL overrides usage in Lookups and source filter and data flow management into multiple targets using Routers were extensively done.
- Extracted data from various sources using transformations and populated the data marts.
- Implemented error logging logic to capture the rejected/wrong data in the mapping for further analysis.
- Provide support during User Acceptance Testing (UAT) of project.
- Developed mapping to implement type 2 slowly changing dimensions
- Created Star and Snowflake Schema for the Data Model Designing.
Environment: Informatica Power Center 7.x, Oracle 9i, UNIX, Windows NT, Toad, Control M.
Confidential
Software Engineer
Responsibilities:
- Involved in gathering Business requirements and technical aspects for development.
- Scheduling, monitoring, coding and testing of custom PL*SQL modules to load data into the data warehouse from the legacy systems.
- Working with Erwin tool in Data Modeling (both Physical and Logical Design).
- Interpreted logical and physical data models for Business users to determine common data definitions and establish referential integrity of the system.
- Worked with mappings using expressions, aggregators, filters, lookup, update strategy and stored procedures transformations.
- Designed ETL process using Informatica Tool to load from Sources to Targets through data Transformations
- Created reusable Mapplets and transformations
Environment: Informatica Power Center6.1.1, Oracle7.0, SQL, PL/SQL, TOAD.
