Aws Big Data / Informatica Platform Architect Resume
SUMMARY:
- I have extensively worked on suite of Informatica products (Power Center, Power Exchange, Informatica Data Quality, Metadata Manager, Dash Board & reporting, TDM - Data Masking Data Sub setting, Data Archiving and Data Validation etc.), as an Informatica administrator and Knowledge of Informatica BDM (Edge Node & Blaze Monitoring).
- I was involved in establishing Business Intelligence Competency Center, Data Governance, Metadata Maturity Model For Business Glossary and Information linkage to data assets.
- Implemented Enterprise Data Warehouse for a Confidential which includes many subject areas like Core Bank, Debt Investments, Foreign Trade, Fixed Deposits, Credit swaps, Anti Money Laundry, Basel II etc.
- Proficient in provisioning AWS cloud technologies like VPC, IAM Security group, EC2 platform, EBS, EFS, S3, Glacier storage, RDS and Redshift databases.
TECHNICAL SKILLS:
DWH ETL Tools: Specialized in Informatica 9.6.1 HF2, Power Exchange 9.6.1 HF2Informatica IDQ 9.6.1HF2, ILM-TDM 9.6.1 HF1 and B2B etc.
Databases: Oracle All Version, SQL Server
Oracle HA: RAC on Linux/Solaris, shareplex, golden gate, VERITAS Cluster etc.
EMC Storage: Tier 2 Clariion CX Series.2
OS: Solaris/SunOS, Red hat Linux, IBM AIX, Windows
File Systems & VM: VERITAS Volume Manager, VXFS, VERITAS cluster file systemASM, NFS, ZFS, CIFS etc.
Databases: Oracle 7/8i/9ir2/10gr2/11gr2/12c, DB2 UDB 9.7, SQL Server 2k, 2008
Data Quality: Informatica IDQ (Athenor and Axio), Siperian MDM Administration.
BI Tools: OBIEE 10g, 11g
Storage: Tier 2 NetApp (ONTAP 8.1)
Apps servers: IBM Web Sphere 6.1,15, 7.0.0.17, Hadoop Cloudera Hadoop Cluster (HDFS)
RDBMS: Oracle 11g, 10g R2, 9.2.6(9iR2), 9.0.1(9iR1), 8.1.x(8i), 7.3.4,8.0.x, Oracle Exadata, SQL Server 2005, 2000, DB2 UDB 9.7,9.5,9.1,8.2
Replication/Integration tools: SharePlex 5.1, Oracle Streams, Power Exchange
ETL/BI Tools: OBIEE 10.x (DAC, Informatica, OBIEE Reporting tools, BI Apps), Informatica PC 5/6/7/8.1.1,9.0.1, PWX 5.2/8.1.1, Metadata Manager 8.1.1, Informatica IDQ Athenor/Axio etc
Operating Systems: Solaris 10, 9, 2.7, 2.6, 2.5.1, Red hat 4.x/5.x/6.2, IBM AIX 4.1, 5.1, 5.2,6.1
Query Languages: SQL ISQL PL/SQL, TSQL.
Hardware Worked On: Sun Enterprise Server 3500,3400,Sun fire 10k, 12k, 6800, 3800, 880, 480
Storage & SAN: DMX1000, DX4, Clariion CX4, CX700, CX600 (SAN/SAN). Netapp FAS 6040
AWS Cloud: VPC, EC2 Platform, VSphere 5.1, S3, Glacier, Redshift
WORK EXPERIENCE:
Confidential
AWS Big Data / Informatica Platform Architect
Responsibilities:
- Upgraded Informatica 9.6.1 to 10.2.0 and HF1, Installed and setup new IDQ & IMM environment, migrated Informatica repositories from DB2 to Oracle Exadata platform, integrated in Informatica domain with LDAP, enabled LDAP authorization and authentication, updated CA certificate and Java Keystores.
- Installed and upgrade DVO resolving DVO issue, configured Solace JMS and resolving JSM issue as needed, part of Bigdata POC project.
- Provisioned EC2 Instances with EBS volumes, created encryptions key pairs for connectivity provisioned S3 shared across multiple systems, configure data files archiving policies. established connectivity between on premise system, AWS systems, Informatica and leveraging lambda, kinesis stream SQS, Big Data Connector for EMR and DynamoDB to maximize integration, installed and configured Informatica 9.6.1 on oracle database and migrated Informatica IDQ, MM, PC, TDM and PWX.
- Created data warehouse schemas into Redshift and running in parallel as pilot.
- Installed and configured Informatica suite of products (PowerCenter, Informatica Data Quality, Metadata Manager, Test Data Manager, Data Masking, PowerExchange and configure archiving policies on glacier, Administrating Informatica services, configured PowerExchange client to extract data from legacy VSAM files, AS/400 and load it into Redshift, configured EMR, EMRFS with master node and multiple core nodesCurrent OLTP application are mostly deployed on Oracle, SQL Server, SAP NetWeaver, SAP HANA, DB2 DB/400, MQ, JSON file, SQL Server Publisher for PWX Confidential, xml files etc., Analyzing integration, automation and cost effective reusable services.
- Developed blue prints and TO-BE cloud architecture for business services and IT Infrastructure, planning full data warehouse data migration strategy.
- Cost analyzing to use AWS service like data pipeline, Kinesis firehose, lambda, SQS (Queues), scala spark S3Distcp etc.
- Architected and responsible for setting up the AWS environment with Informatica to extract data from legacy and In-house systems and load it into AWS redshift Data warehouse and Big data, implementing AWS Hybrid, AWS VPC, AWS EMR, AWS EC2, CLI & Python, Cloud API, Cloud Front, Auto Scaling, Cloud watch and management solutions.
- Responsible for Big Data deployment, data extraction, Inbound data volume and velocity planning, Inbound Data Ingestion, loading, capacity planning, HDFS cluster and Data node capacity planning and HDFS storage infrastructure planning with EMR machines on AWS cloud, HDFS performance tuning, Data modeling and Architecture, YARN, HDFS, Spark & KAFKA Real time analytics architecture, Spark RDD, Data frame, Data sets, Spark SQL design and implementation.
Confidential
Informatica Admin
Responsibilities:
- Installed Informatica in all environments, Upgraded Informatica Platform from 8.6.1 to 9.5.1/9.6.1 HF2, Maintaining Informatica Platform (Power Center, IDQ, MM, PWX, DVO, and ILM-TDM), Applied Hotfixes and EBFs. All changes are applied following SDLC process. Proposed migration strategy of Informatica Objects (PC, IDQ, MM etc.) Used versioned repository, dynamic deployment group for migration, used SVN version control for other objects like shell scripts & database scripts.
- Configure PowerExchange Listener and Logger service on Informatica HA Grid Control, setup monitoring of Confidential workflows, Configure Connection session setting to restart from the last token. Setup seamless migration process using DTLURDMO utility. Written shell scrips o monitor the Confidential process, starting &U stopping PowerExchange Confidential workflows etc.
- Developed and suggested Informatica best practices to the team, helped developer in setting up the right encoding, helped developer in resolving common issues. Configured PowerExchange for SQL Server & Oracle with TDE enabled.
- Suggested alternate architectures for HA & DR based on Enterprise Architecture team input on RTO, implemented systems in DR using Informatica spanning domain architecture.
- Developed more than 50 mappings for real time replication using Power Center and Power Exchange Confidential .
- All workflows are running under Informatica Grid environment.
- Data profiling, Join Profiling, Custom Profiling, domain discovery, score carding and build cleaning rules in IDQ Knowledge of Human Task workflows for data governance.
- Architected and Implemented Informatica Disaster Recovery Site using VERITAS Cluster file system, Veritas replication software, Oracle MAA, Oracle extended Data Guard and Informatica spanning domain. Tested all service active & active-passive in DR periodically.
- Moved Informatica and Data Modeling ER Studio infrastructure from old data center to new datacenter seamless without any issues included Informatica Power Center, Informatica PowerExchange, Informatica IDQ, Informatica MM, spanning domain (4) nodes and ER Studio Repository Server and License Server etc.
Confidential
Informatica Architect
Responsibilities:
- Installation, maintenance and upgrade of Informatica Power Center from 7.1/ 8.1.1 to 9.5, configured and maintained session on grid control 2 node cluster environment, Power Exchange 9.5, Informatica Data Quality 9.5, Oracle Golden Gate 11g and Metadata Manager 9.5.
- Used Data Cleansing transformations and matching based on agreed profiling result later these rules was incorporated into oracle Siebel UCM (MDM) within data governance umbrella and applied fixes on the source system.
- Setup Code Migration strategy according to SDLC process and Informatica velocity best practices, used Version control repositories, create dynamic deployment groups and roll back of deployment if necessary. Responsible for preview and code migration to SIT/UAT/PROD.
- Maintained PCI compliance for data stored onto the file system data masking, write shell script to automate jobs using third part scheduler, wrote shell scripts for archiving and cleanup, backup the Informatica repository etc.
- Integrated Power Center & Metadata Manager with Active Directory (LDAP) and deployed Metadata Manager on IBM WebSphere Application Server (network deployment).
- 30% of the time developing complex mappings using Informatica velocity best practices mapping standards, extracted data from AS/400 DB2, VSAM (VMS Cobol) file, XML, DB2, Oracle, SQL Server, MQ, Swift format file etc.
- Involved in the full implementation of DWH project, developed new maps (slowly changing Dimension Type 2), modified existing one for data warehousing and BASEL II project; suggested and implemented error handling. Assisted in data reconciliation process for ALGO, ACLM and Basel II.
- Implemented load control tables for recovery and reload of data into the target systems, automated the generating parameter file for daily and monthly load from the control tables.
- Excellent skill on performance tuning of database SQL, Informatica Power Center and inefficient mappings.
- Used Power Center data profiling to identify the Data Quality issues and scheduled reporting to source system owner for getting it fixed.
- Installed, configured and maintained Informatica Siperian MDM, established to other systems using MQ & Adaptors.
- Established Data Governance which includes Data steward, Business Analyst and source system business owners.
- Installed DB2 software, created databases and Upgraded DB2 databases from DB7.2 to 9.1 twostep upgradeConfigured backups exported and imported database objects, performance tuning, migrated full databases from DB2 9.1 to Oracle 11g and taken care of different Foreign character encoding as part of DB2 decommissioning projects...
- Installed and configured WAS 6.1 and 7.1 application servers (3 node Cluster), maintained, upgraded, applied fix packs and deploying code provided by development team into production following complete release management process.
- Applying configuration changes as required, writing shell scripts for archive log files and automation, write maintenance jython scripts.
Type of Sources/Target used: Oracle, DB2, Informix, Sybase, SQL Server, XML, Cobol Copy Book using Power Exchange, Informatica Data Quality, Data Masking etc.
Confidential, California
Informatica Professional Services (IPS) Consultant
Responsibilities:
- Installed, configured and supported Siebel Analytics which is also known as OBIEE suite.
- Designed database for company standards where entire enterprise lookup for company standards.
- Involved in the full Data Quality life cycle (Assessing Data Quality, Defining Data Quality Targets, Designing Quality Enhancement plans, Implementing Quality Enhancement Plans, and Monitoring Data Quality Vs Targets).
- Analysis phase includes (Assess data, completeness, conformity, consistency, data duplication, integrity, accuracy )
- Designed and developed complex mappings from source systems to Enterprise Data Warehouse and from EDW to Data Marts. - Type 2, Type 3 dimensions, designed and developed complex mappings from xml and cobal sources to EDW, designed Crosswalk tables for translation between new and old systems for data integration, designed Job processing control tables to monitor jobs (number of rows loaded, job kickoff time, job end time, job status, Batch ID, etc.).
- Designed and developed complex workflows using command task, decision task, assignment, timer, Event waits, Event Raise, custom shell scripts and used Control M to schedule workflows etc.
- Designed and developed recoverability procedures if the job fails. Implemented error handing to capture errors into database tables for reconciliation purpose.
- Analyzed applicability of new features to enhance existing session performance, analyzed Informatica mappings and sessions for performance impact, helped customer in educating the impact of repository configurations on performance and load on the system, implemented session partitioning and explained Informatica partitioning benefits.
- Upgraded Informatica for more than 9 customer sites (Upgraded options metadata manager, SAP Power Connect, Teradata, etc) Upgraded from 4.7 Power Mart to Informatica 8.1 (two/three steps upgrade), moved repository from Informix to oracle database, and wrote shell scripts for clean shutdown/startup of all services including MM, backup shell script, file parsing script, etc.
- Developed blue print for data warehouse initiative, baseline architecture, proposed architecture (applications, high level data entities, prebuild analytics, Hardware & Software Infrastructure), guidelines on data governance, business Intelligence competency, etc.
- Developed and enhanced TRM, looking at the corporate strategic initiatives, added software and hardware products by looking at the current infrastructure, Confidential & Confidential rating wherever necessary, and consulted with ARB & APC for final approvals.
