Metadata And Data Quality Architect Resume
Des Moines, IowA
OBJECTIVE:
To grow as a professional in IT/Consulting industry that would help me utilize my Data Management skills to the fullest and to ensure that the employer gains most of my knowledge and experience
PROFESSIONAL SUMMARY:
- Over 11 years of experience in Data management and Data Quality improvements of Organizations projects for end to end implementation from various sources to Data Warehouses and Data Marts using Informatica Tools
- Seasoned Consultant with proven track record in providing out of the box solutions to Data challenges within an Enterprise
- Experienced in Data Stewardship, Master Data Management (MDM), MDM - Member domain (Patient), MDM Provider domain (Prescriber), Member Enrollment, Claims, Data Governance aspects of Data Management in the Healthcare industry
- Expertise in providing end to end ETL (Extraction transformation and Load) solutions using Informatica PowerCenter (PC) tool.
- Expert in Performance Tuning of sources, targets, mapping and sessions to overcome the bottlenecks in ETL processing using Informatica PowerCenter (PC)
- Perform Data Quality Discovery and Profiling using Informatica IDQ Analyst and Developer to identify current state of Raw Source Data and propose remediation strategy for Data Quality issues. (Content, Completeness, Cardinality, Uniqueness, Validity, Accuracy, Correct Format and Pattern, etc.) based on the Business rules set forth by the system of record owners and SME’S
- Expertise in providing metadata management, end to end data lineage solution of organizations using Informatica metadata manager (IMM) and Analyst tool (IA).
- Expertise in Master Data Management concepts, Methodologies and ability to apply this knowledge in building master data management solutions using Informatica MDM and IDD tools.
- Expertise in Reference Data Management (RDM) concepts, Methodologies and ability to apply this knowledge in building enterprise reference data warehouse using Informatica RDM/MDM accelerator and IDD tools.
- Installed Informatica Domain and Application services like Power center, Integration, Data integration, Model repository, PC Repository manager etc.
- Proficient in overall process of requirement gathering from business, converting those in technical requirements for development team and overlooking end to end execution of these requirements
- Proficient in Project Life Cycle and Software Development Life Cycle, have managed and successfully completed projects end to end
KEY TECHNICAL SKILLS:
- Proficient with following programming languages, Databases, Technologies and Tools including but not limited to:
- Informatica data quality V9.1, V9.6, V9.7, V10.2
- Informatica Metadata Manager V8.6, V9.1, V9.6, V10.2
- Informatica Power Center V8.5.1, V8.6, V9.1, V9.6, V9.7, V10.2
- Informatica MDM hub, Informatica data director (IDD) V10, V10.1
- Informatica Power Exchange V9.1, V9.6, V10.2
- Oracle 12c, 11g, 10g,9i | Microsoft SQL Server | Teradata | Sybase IQ
- TWS scheduler, Control M scheduler, Active batch scheduler, Service now, Manage now
- Working Knowledge of UNIX/LINUX Shell Scripting
- Leader, with a proven track record to lead teams and Projects to deliver better Quality of data in Organizations
- Well versed in Client Interaction for all hands meeting, requirement gathering and annual team gathering
- Constant yearning towards learning
- Well versed in the induction/ selection/ training of new joiners
WORK EXPERIENCE:
Confidential, Des Moines Iowa
Metadata and Data Quality Architect
Key Tools:
- Informatica data quality V9.1, V9.6, V9.7, V10.2
- Informatica Metadata Manager V8.6, V9.1, V9.6, V10.2
- Informatica Power Center V8.5.1, V8.6, V9.1, V9.6, V9.7, V10.2
- Informatica MDM hub, RDM accelerator, Informatica data director (IDD) V10, V10.1
Responsibilities:
- Conducted workshops with Business stakeholders to identify the current business challenges and categories all challenges into two categories (Business Process, Technical Process)
- Perform Data Quality Discovery and Profiling using Informatica IDQ Analyst and Developer to identify current state of Raw Source Data and propose remediation strategy for Data Quality issues. (Content, Completeness, Cardinality, Uniqueness, Validity, Accuracy, Correct Format and Pattern) based on the Business rules set forth by the system of record owners and SME’S
- Define/Design metadata management framework which can be leverage at enterprise level to manage enterprise metadata of all the sources systems.
- Extract and load the metadata from disparate source systems into MM warehouse using informatica metadata manager packaged models as well as custom models and create data lineage among these systems.
- Create and Govern Enterprise Business Glossary (BG) using Informatica Analyst tool by working closely with business owner.
- Extract Business Glossary metadata from Informatica analyst and create linking between Business metadata and Technical metadata extracted from all the source systems within organization.
- Build enterprise warehouse of reference data using Informatica RDM/MDM accelerator. This process involves extraction of reference data from source system like EDW, CDMA, STAR, Gen1 application and loading them to MDM landing tables using informatica PowerCenter (PC). Informatica MDM stage batch group and Load batch group will further load data to MDM base tables.
- Configure IDD application to manage and Govern Enterprise reference data (RDM) ongoing basis.
- Build ETL process to publish reference data to all downstream applications using Informatica PowerCenter (PC).
Confidential, New York, NY
ETL Lead and Data Quality Consultant
Key Tools:
- Informatica PowerCenter V9.6.1 HF3
- Informatica Developer V9.6.1 HF3
- Informatica MDM V10.1
- Oracle 12C - SQL Developer, Microsoft SQL Server
- Control-M, Linux scripting, Service now.
Responsibilities:
- Conducted workshops with Business stakeholders to identify the current business challenges and categories all challenges into three categories (Business Process, Technical Process)
- Analysis of different data sources as input to data quality process such as data cleansing and data standardization etc.
- Identify critical/governable data elements and define MDM logical data model adhering to data modeling standards
- Work closely with MDM implementation team to prioritize and in corporate data governance technical pain points. Also interact with business team around data steward UI customization to perform data steward operations
- Based on data anomaly and match/merge analysis, work closely with business to define new data policies or identify changes that need to be done within existing standard operating procedures.
- Worked with different Source systems, Product owner, Business Analyst and MDM Architect to capture all the requirement for Member/Provider domain and document the same.
- Closely worked with enterprise and Solution Architect to finalize the ETL architecture. ETL process extracts the source systems data and transform it into MDM common record format (CRF)
- Designed ETL CDC (Change data capture) process, which extract the data from DQ database and transform them to Informatica MDM CRF format before loading it to MDM landing layer.
- Designed Reference data management (RDM) process to load the enterprise reference data from varies systems to centralized RDM database using Informatica data quality (IDQ) tools.
- Built different Informatica ETL components like Mappings, Mapplets, transformations, Session tasks and Workflows etc to implement the ETL CDC mechanism.
- Created complex ETL mappings to load data using transformations like Source Qualifier, Sorter, Aggregator, Expression, Joiner, Dynamic Lookup, and connected and unconnected lookups, Filters, Sequence Generator, Router and Update strategy
- Implemented audit, balance, control mechanism to manage and maintain Informatica job loads and to effectively monitor jobs for auditing purposes and reprocessing the failure records
- Performed various exercises for Performance Tuning of sources, targets, mapping and sessions to overcome the bottlenecks in ETL processing
- Built Data quality components like transformations, mapplets, and mappings and export them to Informatica power center mapping to create the sessions and workflows. These workflows are used to load the reference data to RDM database
- Extensively used Informatica Data Quality transformations - Labeler, Parser, Standardizer, Match, Association, Consolidation, Merge, Address Validator, Case Converter, and Classifier.
- Configured MDM Stage jobs, Load jobs, Match and Merge jobs using Informatica MDM hub console.
- Created Exact and Fuzzy match rules and perform various iterations on these rules and discuss the result set with Product owner. Perform these iterations until match and merge rules gets finalized.
- Created different subject areas on IDD using base objects and relationship among those base tables and deploy them as applications.
- Created and save custom search and queries on IDD tools so that product owner can reuse them.
- Scheduled ETL CDC jobs, Data quality and RDM jobs, MDM batch jobs using Control - M Scheduler.
Confidential, Pleasanton, CA
Sr. Architect and Developer
Key Tools:
- Informatica Power Center V9.7
- Informatica Developer V9.7
- Informatica MDM V10
- Oracle - SQL Developer, Unix Scripting, Control - M Scheduler
Responsibilities:
- Initiated Engagement process with business users to understand the project requirements.
- Architect and designed the MDM Hierarchy manager using Informatica MDM hierarchy manager.
- Architect and Designed ETL process to load data from spreadsheet to MDM staging tables directly using Informatica data quality (IDQ) tool.
- Created the Entity and Relationship data model to build the hierarchies.
- Configured Landing, staging, and base tables from entity relationship model using MDM hub console.
- Built the MDM hierarchy packages (Put and Display) and profiles to determine how the data should be displayed in MDM Hierarchy Manager and IDD UI.
- Developed the ETL code using Informatica Developer to load the Entity and Relationship staging tables from the excel source.
- Created various data quality mappings in Informatica Data Quality tool and imported them into Informatica powerCenter as mappings, mapplets to create the Informatica workflows.
- Scheduled Informatica workflows and MDM batch groups using Control-M scheduler.
- Built and deployed IDD application and configured the subject areas to govern the hierarchical data.
- Built basic and advanced queries to view and modify the hierarchical data through IDD.
- Built IDD user exists to extend the MDM functionalities.
Confidential, Pleasanton, CA
Sr. Architect and Developer
Key Tools:
- Informatica Power Center V9.6 HF3
- Informatica Metadata Manager V9.6 HF3
- Informatica Analyst V9.6 HF3
- Oracle - SQL Developer, Teradata, DB2 Client, Unix, Control - M Scheduler
Responsibilities:
- Initiated Engagement process with ASG Rochade team to understand the project requirements.
- Architect and designed the Informatica MM solution based on the business requirement.
- Prepared Technical requirement document based on project/business requirement from the project team.
- Developed Custom Model for 7 different region files to bring in custom metadata into MM warehouse.
- Connected to Teradata and DB2 using Packaged Model to bring in Clarity metadata into the MM warehouse.
- Developed PowerCenter ETL process to load csv files required for custom X-CONNECTS
- Scheduled Custom and Packaged Resource to load the Clarity metadata to the MM warehouse
- Created linking rule files and Enumerated link files to links metadata across source system.
- Ran data lineage Analysis to perform Impact analysis and RCA analysis.
Confidential, Minneapolis, MN
Sr. ETL (Informatica) and Data Warehouse Tech Lead
Key Tools:
- Informatica Power Center V8.5.1, V9.1, Informatica Power Exchange
- Informatica Developer /Analyst V9.1, Informatica Metadata manager V9.1
- Oracle - SQL Developer, Teradata Utilities, MS SQLServer, Mainframe DB2, Flat files, z/OS and VSAM files
- Control - M and TWS Scheduler
Responsibilities:
- Initiate Engagement process and provide project estimation, Impact analysis, and change business requirement into technical requirements.
- Initiate Engineering and Design process, work with Architects to identify the dependency system, impacts to existing system, data flow and accuracy between the systems.
- Design and develop appropriate reusable components, Create Informatica mapping sheet which derives the relation between sources, look up and target systems. Validate them against business rules and data definitions.
- Create Job flow diagram, Architecture and Data Model diagram.
- Develop, test and implement Informatica workflows, sessions, command tasks, mappings, reusable mapplets and transformations. Implement performance techniques and best practices in each and every component
- Develop, test and implement Unix Shell scripts, FTP, SFTP’s, file transfers, Teradata BTEQ, Fast Load, Mload, TPump, FastExport Oracle/DB2 stored procs & Bulk loads
- Design and develop data warehouse solutions, integrate new files and data’s from other applications into Enterprise data warehouse, Utilize Teradata for reporting, OLAP & history.
- Develop and migrate Power Exchange Data maps to read VSAM/COBOL files, source and target imports from various systems.
- Design and develop precheck s like file validation, duplicate & empty file check, date& count check and post checks like post balancing, checks and balance, errors and referential integrity.
- Design and develop operational and Non-functional requirements like job Restart ability at Informatica task & workflow level, automated batch recovery and create necessary documents.
- Create Job schedule and properties document, work with IBM TWS team to implement the jobs in production, non-production environments.
- Design and develop Control-M and TWS jobs for late start, long running, late competitions incident/email notifications to production support and applications teams.
- Perform unit, regression, system integration, load/stress testing and UAT testing; work with application/business teams to get UAT Sign off.
- Conduct Design, Implementation and production support transition walkthrough and deliver all operational/batch recovery documents. Provide warranty support and implement fix for PCR’s and production incidents.
- Apart from application development, maintain and support Informatica V8.5.1 V9.1 environment/platform, responsible for bringing up application services like integration service, repository service or MM service, Analyst service etc when the outage happens. Work with IBM, VERITAS and Informatica vendor team incase if the services are not coming up or getting hung with unknown issues.
- SME Support for incident and problem management and production batch issues.
- Leading the team off 10+ at offshore and 2+ on onsite. Work hand in hand with offshore team for round the clock support, have daily calls to delegate tasks and updating them about the progress/next steps of the project
Confidential
Sr. Informatica/Teradata Developer
Key Tools:
- Informatica Power Center V8.6
- Teradata 12, BTEQ, MLOAD, FLOAD, TPump
- Unix Shell scripting, TWS
Responsibilities:
- Expertise in creating the detailed design document and source to target mapping (STM) based on the high level design and business requirements.
- Created complex ETL mappings using transformations like Source Qualifier, Sorter, Aggregator, Expression, Joiner, Dynamic Lookup, and connected and unconnected lookups, Filters, Sequence Generator, Router and Update strategy to load data from SQL server to Teradata staging tables.
- Expertise in creating FLoad/Mload/Bteq/TPump scripts to load the data to staging to data warehouse
- Worked on Unix Shell scripting to build audit and balance frame works.