Etl Architect/developer Resume
CA
SUMMARY:
- 8 Years of IT experience in System Analysis, Design, Development, Implementation, Maintenance, and Testing of Data Warehouse and Data Integration Applications using ETL tools like Informatica Power center, SSIS, DATA STAGE, OBI.
- Experience on Business Requirements Analysis, Application Design, Development, Testing, Implementation and maintenance of client/server Data Warehouse and Data Mart systems in the Healthcare, Finance and Pharmaceutical industries.
- Expertise in designing confirmed and traditional ETL Architecture involving Source data warehouse and database systems like Teradata(14.x), Netezza(6x), Oracle(10g - 11g), Flat Files (fixed width, delimited), DB2, SQL server(2008), XML and Target databases.
- Extensive Experience in designing and developing complex mappings applying various transformations such as Lookup, Source Qualifier, Update Strategy, Router, Sequence generator, Aggregator, Rank, Stored Procedure, Filter, Normalizer, Joiner, Expression, Union, SQL and Sorter transformations.
- Experience in using various Informatica Power center Client components Designer/ Workflow manager /Workflow Monitor/Repository Manager.
- Extensive experience in developing the Workflows, Worklets, Sessions, Mappings, and configuring the Informatica Server using Informatica Power Center.
- Proficient in ETL Unix shell script processes, data loading, manipulation using UNIX shell scripting, SQL
- Extensively worked with Teradata utilities like BTEQ, Fast Export, Fast Load, Multi Load to export and load data to/from different source systems including flat files.
- Hands on experience in maintaining the code versions using Visual Source Safe / Informatica Versioning.
- Performed the data profiling and analysis making use of Informatica Data Explorer (IDE) and Informatica Data Quality (IDQ).
- Dimensional Data modeling experience using Data modeling tool Sybase Power designer. Worked closely with the Business Objects reporting team to design the data mart tables for Business Objects reports.
- Strong knowledge of Dimensional Modeling, Star and Snowflake schema. Expertise in Data Flow Diagrams, Process Models, and ER diagrams with modeling tools like ERWIN & VISIO.
- Proficient in ETL Unix shell script processes, data loading, manipulation using UNIX shell scripting, SQL.
- Extensively worked on ODS/EDW database table design by providing guidance in logical and physical design, defining and/or updating database tables, and maintaining the data dictionary metadata. Worked on Plans, executes and tests ODS and EDW upgrades.
- Expertise in tuning and optimizing the performance of mappings by following Informatica best practices and also applied several methods to get best performance by decreasing the run time of workflows.
- Proficient in delivering the high data quality by designing, developing and automation of audit process and implementing the reconcile process accordingly.
- Experience in implementing the best practice standards throughout the development cycle following the Informatica Velocity methodology.
- Strong Knowledge in Relational Database Concepts, Entity Relation Diagrams, Normalization and De normalization Concepts.
- Extensively worked on backend testing and well versed with SQL, PL/SQL queries to run the data integrity tests.
- Automated the jobs using the Scheduling tools like Tidal, AutoSys and Control-M.
- Strong written and oral communication skills.
- Excellent team player and self-starter with good ability to work independently and possess good analytical, problem solving and logical skills.
TECHNICAL SKILLS:
ETL Tools: Informatica Power Center 9. 6.1 /9.5/9.1/8. x/7.x/6.x, IDE, ODI,IBI
BI tools: Microstrategy, Tableau, Business Objects, Power BI
Databases: Teradata (14.x, 15.x),Oracle (10g-11g), DB2, SQL Server 2012/2016, Netezza(6.x)
Programming Skills: PL/SQL,sql, Python, R programming
Database Tools: TOAD, SQL*Plus, SQL Developer, PL/SQL Developer, Teradata SQL assistant, WinSQL.
Data Modeling: Sybase Power Designer 15.2
Schedulers/Scripting: UNIX Shell Scripting, TIDAL, Autosys, Control-M
PROFESSIONAL EXPERIENCE:
Confidential, CA
ETL Architect/Developer
Responsibilities:
- Interacted with the Business users to identify the process metrics and various key dimensions and IBI measures Involved in the complete life cycle of the project.
- Developed FRD (Functional requirement Document) and data architecture document and communicated with the concerned stakeholders.
- Worked on dimensional modeling to design and develop STAR schemas by identifying the facts and dimensions. Designed logical models as per business requirements using Erwin.
- Creation of customized Mload scripts on UNIX platform for Teradata loads.
- Writing teradata sql queries to join or any modifications in the table
- Performed high volume maintenance on large Teradata tables using MultiLoad loader utility.
- Created TPT to transfer the data Oracle system to Teradata.
- Designed and Developed ETL mappings using transformation logic for extracting the data from various sources systems.
- Involved in performance tuning and optimization of Informatica mappings and sessions using features like partitions and data/index cache to manage very large volume of data.
- Used Informatica debugging techniques to debug the mappings and used session log files and bad files to trace errors occurred while loading.
- Developed processes on both Teradata and Oracle using shell scripting and RDBMS utilities such as Multi Load, Fast Load, Fast Export, BTEQ (Teradata) and SQL*Plus, SQL*Loader (Oracle).
- Created, Tested and debugged the Stored Procedures, Functions, Packages, Cursors and triggers using PL/SQL developer.
- Involved with Informatica team members in Designing, document and configure the Informatica MDM Hub to support loading, cleansing, matching, merging, and publication of MDM data.
- Used Metadata manager for validating, promoting, importing and exporting repositories from development environment to testing environment.
- Involved with Data Steward Team for designing, documenting and configuring Informatica Data Director for supporting management of MDM data
- Used the feature EXPLAIN PLAN to find out the bottlenecks in a given Query, thus improving the performance of the job.
- Involved in Unit testing, User Acceptance testing to check whether the data is loading into target, which was extracted from different source systems according to the user requirements.
- Documented ETL test plans, test cases, test scripts, test procedures, assumptions, and validations based on design specifications for unit testing, system testing, expected results, preparing test data and loading for testing, error handling and analysis.
- Automated UNIX shell scripts to verify the count of records added everyday due to incremental data load for few of the base tables in order to check for the consistency.
- Involved in production and deployment phase to make sure the job schedules and dependencies are developed in such a way that we are not missing the SLA on a day to day basis.
Environment: Teradata (14x -15x), ETL Informatica Power Center 9.6.1, Microstrategy, Oracle 11g/10g,SQL Server, XML Files, TOAD, SQL, PL/SQL, Windows XP, UNIX.
Confidential, TN
Lead Informatica Developer
Responsibilities:
- Involved in the analysis of source to target mapping provided by data analysts and prepared function and technical design documents.
- Involved in creation of Informatica mappings to build business rules to load data using transformations like Source Qualifier, Expression, Aggregator, Lookup, Filter, Router, Update Strategy, Normalizer, Java, Stored procedure, and Sequence generator transformations.
- Designed and developed Complex ETL Informatica mappings. Scheduled and Managed Informatica sessions.
- Extensively worked on Profiling and Parsing the Unstructured data coming out of Ryan Solutions data using various Informatica complex Functions.
- Developed scripts using Teradata utilities like MLOAD, FLOAD, fast export.
- Assisted another Teradata DBA to bring the system in production.
- Generated reports /scripts using Teradata BTEQ.
- Used Change Data Capture (CDC) to implement Incremental Data Extraction so that only the modified and new records will be extracted to the destination rather than full extraction.
- Designing and developing ETL maps based on the methodologies like Inmon and Kimball.
- Extensively worked on Performance tuning of queries. Complex SQL queries are used for data retrieval.
- Designed, Developed and Deployed UNIX shell scripts.
- Development & Implementation of a data warehousing project and also production support for enhancements and maintenance.
- Used Session parameters, Mapping variable/parameters and created Parameter files for imparting flexible runs of workflows based on changing variable values.
- Coordinated and worked closely with architects, operations, and business units to build and deploy.
- Implementing data quality management solutions that manage millions of customer transactions.
- Configuration and Administration of Informatica Servers.
- Oversee the technical design of the data warehouse, optimizing for scale and performance.
- Performed systems, data quality assurance, system testing, ensuring that software and systems perform to specification.
- Implemented Slowly Changing dimension type2 methodology for accessing the full history of accounts and transaction information.
- Established test environments and test data and advocate the use of common re-usable functions, services and data.
- Participated in quality assurance activities like reviews, walkthroughs etc.
- Performed extensive risk assessment prior to production implementations and carried out validations for post implementation to ensure successful execution monitoring performance.
Environment: Teradata (14.X-15.X), ETL Informatica Power Center 9.6.1, MS SQL Server Version 11, MS SQL Server Management Studio, MS Excel, UNIX, Control-M.
Confidential, Dallas/Fort worth Area, TX
Sr. ETL Informatica Developer
Responsibilities:
- Worked with Data Business Manager(s), Master Data Manager, and Data Analyst for requirements gathering, business analysis and designing of the Look Database.
- Worked on Informatica Power Exchange Remote Function Calls to connect to SAP systems to extract the data from SAP to Oracle.
- Extensively worked on ABAP method and File Stream Mode to generate the SAP ABAP code through Informatica Power center client Designer.
- Extracted data from a wide variety of Sources like Flat files, XML files, Relational Databases (Oracle, SQL Server, Postgress and Netezza) and from the legacy Mainframes and SAP source systems by using Informatica Power Exchange.
- Prepared design document for data flow process, LLD, HLD, Project end User Manual and Operations Hand off guide.
- Coordinated in daily team meetings, technical code review meetings and interacted with business people for better technical solutions and proposed ETL strategy based on Agile Methodologies.
- Tuned performance of Informatica sessions for large data files by increasing block size, data cache size, sequence buffer length and target based commit interval.
- Developed Teradata BTEQ scripts to populate target tables, Responsible for data profiling, data mapping, data loading, and data validation.
- Worked with Teradata utilities like BTEQ, Fast Export, Fast Load, Multi Load to export and load data to/from different source systems including flat files.
- Involved in creation of Informatica 9.5 mappings to build business rules to load data using transformations like Source Qualifier, Expression, Aggregator, Lookup, Filter, Router, Update Strategy, Normalizer, Java, Stored procedure, and Sequence generator transformations.
- Extracted data from a wide variety of Sources like Flat files, XML files, Relational Databases (Oracle, SQL Server, Teradata) and SAP source systems by using Informatica Power Exchange.
- Designed and developed Reference Integrity, Technical and Business Data Quality rules using Informatica Data Quality. Involved in cleaning the data using IDQ in Informatica Data Quality 9.1/9.5 environment. Implemented IDQ Address standardization Mapplets to standardize validate and format client’s addresses for marketing purposes.
- Created UNIX shell scripts to handle pre and post session tasks.
- Used Repository manager to create user groups and users, and managed users by setting up their privileges and profile.
- Created an infrastructure to ensure data quality and appropriate data linkage and deliver self-service reporting to business and speeding and improving the decision making.
- Created User defined functions (UDF) to reuse the logic in different mappings. Built-in mapping variable / parameters and created parameter files for imparting flexible runs of sessions / mappings based on changing variable values.
- Worked on configuring Informatica and file processing Job schedules for CONTROL-M scheduler.
- Working with DT studio for transformations.
- Documented Process flows as required for cross training/educational purposes.
Environment: Informatica Power Center v9.5, Informatica Power Exchange, Informatica Data Quality (IDQ), Oracle 10g/9i, Control-M Scheduler, Windows-XP (Client), Informatica servers on Unix (Solaris), Teradata 14x, PL/SQL Developer, Teradata SQL Assistant.
Confidential, Omaha, NE
Sr. Informatica Developer/Data Modeler
Responsibilities:
- Analyzed and understood business and customer requirements by interacting with Business Analysts, Client Analytics team and Subject Matter Experts (SME).
- Created logical and physical data models using TDWI (The Data Warehousing Institute) best practices to ensure high data quality and reduced redundancy. Developed best practices for standard naming conventions and coding practices to ensure consistency of data models.
- Worked on Data Profiling tools to drive and accelerate meaningful business benefits from enterprise data.
- Worked with offshore associates and ensure high quality code delivery according to the requirements specified by business users.
- Involved in creation of Informatica mappings to build business rules to load data using transformations like Source Qualifier, Expression, Aggregator, Lookup, Filter, Router, Update Strategy, Normalizer, Java, Stored procedure, and Sequence generator transformations.
- Handling day to day informatica administration tasks such as components migration tasks, Scheduling and Monitoring informatica repository backups.
- Interact directly with the Business Users to accommodate any changes, if there are any, so as to minimize/eliminate last minute changes.
- Execution/Implementation of the project in accordance with End User’s expectations and making sure that the user is fully satisfied with the end results.
- Extensively used mapping parameters, mapping variables to provide the flexibility and parameterized the workflows for different system loads.
- Implemented the Error Records Handling mechanism.
- Created UNIX shell scripts to handle pre and post session tasks.
- Involved in different phases of testing like Unit, Functional, Integration and System testing.
- Developed and tested stored procedures, functions and packages in PL/SQL.
- Involved in database testing, writing complex SQL queries to verify the transactions and business logic like identifying the duplicate rows by using SQL Developer and PL/SQL Developer.
- Prepared documents like Deployment Doc, Estimation Reports, Tractability Reports, Development tracking report and weekly status reports (WSR).
Environment: Informatica Power Center v9.0.1, Informatica Power Exchange, Informatica Data Quality (IDQ), Oracle 10g/9i, Tidal Scheduler, Windows-XP (Client), Informatica servers on Unix (Solaris), Netezza, PL/SQL Developer, Global IDs, Sybase Power Designer 16.0
Confidential
Sr. Informatica Developer
Responsibilities:
- Analyzed and understood business and customer requirements by interacting with Business Analysts, Date Modeler and Subject Matter Experts (SME).
- Validated, debugged old Mappings tested Workflows& Sessions and figured out the better technical solutions on old & new Mappings for Source/Target compatibility due to version changes. Identified the Bottlenecks in old/new Mappings and tuned them for better Performance. Established standard code delivery and code migration process.
- Extracted data from a wide variety of heterogeneous Sources like Flat files, XML files, Relational Databases (Oracle, SQL Server, Postgress and Netezza) and loaded into Data Warehouse.
- Used shortcuts to reuse objects without creating multiple objects in the repository and inherit changes made to the source automatically.
- Creation of sessions and workflows according to the data load in to different systems.
- Created and tested shell scripts to automate Job scheduling by using commands like pmcmd.
- Data investigation in the analysis of incoming data from the various source systems, documenting the data anomalies and generating Data Quality reports.
- Involved in different phases of testing like Unit, Functional, Integration and System testing.
- Creation of Review documents for specification document and test cases.
- Troubleshoot loading failure cases, including database problems.
Environment: Informatica Power Center 7.1.4/8.1.6, Informatica Power Exchange, Oracle 10g/9i,Tidal Scheduler, Windows-XP (Client), Informatica servers on Unix (Solaris), Netezza, PL/SQL Developer.
