Bi Architect/dev (sr. Consultant) Resume
Minneapolis, MN
SUMMARY
- Over 8 years of working experience in all phases of Data warehouse life cycle involving Architect, analysis, design, development, and testing.
- Experienced in leading software development teams, Agile project planning and project management.
- Highly proficient in development, implementation, administration and support of ETL process for large - scale data warehouses using Informatica Power Center 9.6/8.x, Informatica B2B, MDM, IDQ and SQL Server Integration service (SSIS) 2005/2008.
- Architect and design scalable ETL software systems for new DBMS vendors likeHadoop, Paraccel PADB,Amazon Redshift,MangoDB, Apache Cassandra.
- Extensively worked in designing mappings using different transformations like Filter, Joiner, Router, Source Qualifier, Expression, Union, Update Strategy, Unconnected / Connected Lookup, Aggregator and SCD Type-1/2/3/6.
- Expertise in using components like Designer, Workflow manager, Workflow monitor, Repository manager and B2B Studio.
- Experience in loading from various data sources like HDFS, Oracle, SQL Server, Teradata, DB2, MS-Access and Flat Files into Datamarts.
- Extensive experience performance tuning BI Environment, ETL load tuning, Database performance management etc.
- Strong knowledge in Relational Databases (RDBMS), Data modeling and building Data Warehouse, Data Marts using Star Schema and Snow Flake Schema.
- Extensive experience in coding using SQL, PL/SQL Procedures/Functions, Triggers and Packages in Oracle.
- Extensive programming background with UNIX Shell scripting. Developed many process automation scripts to schedule/automate informatica jobs.
- Software development background using JAVA programming. Developed batch process to generate re-usable Informatica mappings from template. Created ETL Scheduling tool using Java/Quartz.
- Good understanding on job scheduling tools like Oracle DAC, CRON, AUTOSYS and Quartz Scheduler. Production implementation with 500+ jobs.
- Hands-on experience across all stages of Software Development Life Cycle (SDLC) including business requirement analysis, data mapping, build, unit testing, system integration and user acceptance testing.
- Extensive experience and domain knowledge in areas of Retail, Manufacturing, Legal, HealthCare, Financial, and Government verticals.
- Interacting with business partners to identify information needs and business requirements for reports. Quick learner with excellent communication and interpersonal skills.
TECHNICAL SKILLS
ETL Tools: Informatica PowerCenter 9.X/8.X,Informatica B2B, MDM, IDQ, SSIS 2005/2008
RDBMS: Oracle 11g/10g/9i, SQL Server 2005/2008, MySQL, IBM DB2, MS ACCESS.
New DBMS: Paraccel PADB (Actian Matrix), Amazon Redshift, MangoDB, Apache Cassandra
Languages: Java, C, C++, Objective C, J2EE, PL/SQL, Shell Scripting.
Operating Systems: Windows 7/XP/2000/NT Server, OSX, Linux and UNIX.
Office tools: Word, PowerPoint, Outlook, Project and Visio.
Data Modeling: Star-Schema Modeling, Snowflakes Modeling, Dimension Modeling, Erwin, SQL data modeler.
Packages: SQL*PLUS, Toad, SQL * Loader
Scheduler: Oracle DAC, Autosys, Quartz, CRON-tab
BI Reporting: Cognos, Crystal Reports, SSRS 2005/2008
Web Technologies: HTML5, CSS3, XML, PHP, PERL, JavaScript, AJAX, Dreamweaver.
PROFESSIONAL EXPERIENCE
Confidential, Minneapolis, MN
BI Architect/Dev (Sr. Consultant)
Responsibilities:
- Architecting, implementation, administration and support of ETL process for large-scale data warehouses (>50TB) using Informatica Power Center 9.x/8.x.
- Participate in full lifecycle development of enterprise data warehouse (EDW). Tasks include ETL architecture, analysis, design, development, testing and documentation.
- Providing leadership and direction to project team by understanding business processes, identifying potential usability issues, managing scope, and ensuring that an appropriate level of Data quality is maintained at all times.
- Developing ETL process for extracting data from operation, legacy and heterogeneous systems and loading into PADB Columnar Database on cloud. Implementing complex data transformation rules, data formats and business cleaning rules.
- Implementing business requirements by designing ETL mappings using transformations like Filter, Joiner, Router, Source Qualifier, Expression, Union, Update Strategy, Unconnected / Connected Lookup, Aggregator, Java and SCD Type-1/2/3/6.
- Performance tuning ETL mappings. Implementing various tuning methods at source, target, mapping, session and at system levels.
- Architecting and designing cloud based PADB columnar database (Redshift) to support EDW and reporting. Performing table designs, data compression analysis, optimizing data distribution and sorting, workload management (short-query bias) and designing data backup/restore strategies.
- Developing user defined functions (UDF), Functions and Procedures (T-SQL) to extend PADB columnar database functionality.
- Developing PSQL (PADB shell scripts) to automate PADB database table management and executing them through Informatica after daily loads.
- Design E-R modelling and Data models to fit business goals on new cloud environment. Develop Dimensional modeling, both physical and logical, using Erwin tool.
- Developing object reuse and ETL migration policies. Developing Automation script for ETL object migration to higher environments on Cloud.
- Developing a new Data warehouse scheduling system on cloud to trigger Informatica jobs based on business contracts.
- Expertise in implementing software automation using Java, UNIX Shell and Informatica. Developing custom ETL automated process for data extracts, ETL logic and target load from flat files to PADB columnar database on cloud.
- Expertise in UNIX shell scripting and PERL coding. Developing FTP automation script to eliminate manual user processes during ETL.
Environment: Agile Methodologies, Informatica cloud PowerCenter 9.X, Actian Matrix PADB, Amazon Redshift, MicroStrategy cloud, Oracle 11i/10g, Java, SQL Developer, UNIX and PSQL Scripting, PL/SQL, Windows 7
Confidential, Eagan, MN
Sr BI Consultant/Architect
Responsibilities:
- Agile work environment with good understanding on practices like Iterations, Sprints, Daily Standups, Burndown charts, Velocity, retrospectives etc.
- Highly Knowledgeable in implementing ETL development architecture with components like Messaging based middlewares, JUnit Testing framework, Continuous Integration etc using tools like WebSphere MQ, Microsoft TFS, Eclipse and AntHill.
- Develop Informatica B2B parsers for extracting data from XML data formats. Used various transformation logics like EnclosedGroups, RepeatingGroups, Groups, ContentSearch, Marker Search, SetValue, CalculateValue, EnsureCondition, AppendValues Transformers, Map, WriteValue etc.
- Code Informatica realtime (RT) mappings using unstructured data transformations, SourceQualifer, Expression’s, Routers, Lookup’s and reusable Mapplets to read/write data from B2B parsers.
- Involve with the Architecture group to develop ETL Informatica objects reuse policy. Develop reusable Informatica Mapplet’s and User-defined Transformations.
- Good Understanding with Informatica migration process using deployment groups. Wrote automation scripts to move objects to higher environments.
- Wrote shell scripts using pmcmd to check post-migration process accuracies. Scripts to start wf’s, stop wf’s, performing smoke test, continuous error monitoring script etc.
- Develop various PL/SQL scripts to automate Informatica Metadata table management process.
- Implemented software automation for batch generation of B2B parsers, Jnuit testcases and Informatica Mappings using Java.
- Extensively working in areas of code debugging and Performance Tuning of Informatica ETL mappings.
Environment: Agile Methodologies, Middleware (WebshpereMQ, JMS), Web services(SOAP, REST, XML-RPC), SOA, Java, TFS, Informatica Power Center 9.X, Informatica B2B, Oracle 11i/10g, SQL Developer, SQL DataModeler, PL/SQL, Windows 7, UNIX
Confidential, Vernon Hills, IL
ETL Team Lead/ Data Architect
Responsibilities:
- Lead and manage a team of 5 ETL Developers responsible for building Business Intelligence solutions. Mentoring them with the knowledge of BI methodologies and in day to day development activities.
- Supervised entire software development lifecycle, from detailed study and analysis of the Village Health business data and determining the Warehouses requirements and capacities.
- Responsible for resource planning, liaison for product management, overall timeline and product quality deliverables.
- Creation of Dimensional Modeling, Identifying fact and dim table as per the business data. Maintaining the model as per the changes in the business requirements.
- Creation of ETL specification and mapping documents based on the data model/source system analysis for daily ETL loads.
- Extensively working in areas of code debugging and Performance Tuning of Informatica ETL mappings.
- Involved in Documentation of Data Mapping, Code Review, Test plan and operations
Environment: Informatica Power Center 9.X, Informatica B2B, SSIS, Statistics(Minitab, R programming), Oracle 11i/10g, SQL server 2008, TOAD, SQL DataModeler, PL/SQL, Visio, Windows XP, UNIX, PERL, Microsoft project.
Confidential, Atlanta, GA
Sr. ETL Developer
Responsibilities:
- Developed various Informatica mappings using transformations like Expression, Aggregator, External Procedure, Stored Procedure, Lookup, Filter, Joiner, Rank, Router, Update Strategy, Normalizer and XML.
- Handled slowly changing dimensions of Type 1, 2 and 3 to populate current and historical data to Dimensions and Fact tables in the data warehouse.
- Developed and tested all the backend programs, Error Handling Strategies and update processes.
- Created the transformation routines to transform and load the data. Developed processes for automation of loading data using parameter driven sessions for batch schedule processes.
- Setting up Batches and sessions to schedule the loads at required frequency using Power Center Workflow manager.
- Created several materialized views for reporting purpose and to enhance the performance.
- Responsible to tune ETL procedures and schemas to optimize load and query Performance.
- Involved with the Architecture group to develop ETL metadata strategies and Informatica objects reuse policies. Developed reusable Informatica Mapplet’s and Transformations.
- Developed ETL technical specs using Visio for ETL process flow and ETL load plan, ETL execution plan, Test cases, Test scripts etc.
- Involved in production support activities with Installation and Configuration of Informatica Power Center 9.x. Used Informatica Workflow Monitor to monitor and control jobs.
- Developed PL/SQL and UNIX Shell Scripts for scheduling the sessions in Informatica.
- Wrote stored procedures, functions, and database triggers. Created database triggers on tables to generate surrogate keys.
- Coordinated with Business Users to understand business needs and implement the same into a functional Data warehouse design.
- Integrated logical and physical data models for business users to determine common data definitions.
Environment: Informatica Power Center 9.X/8.X, Oracle 11i/10g, TOAD, Erwin 3.5.2, PL/SQL, Windows XP, UNIX, Sun Solaris.
Confidential, TN
Sr. ETL Developer
Responsibilities:
- Involved in Dimensional Modeling to design and develop STAR Schemas, Identifying Fact / Dimension Tables.
- Designed and developed various Informatica mappings using transformations like Expression, Aggregator, External Procedure, Stored Procedure, Lookup, Filter, Joiner, Rank, Router, Update Strategy and XML. Developed various Ad-hoc mappings for various business needs.
- Developed and tested all the backend programs, Error Handling Strategies and update processes.
- Experience in using Normalizer transformation for normalizing the XML source data.
- Worked extensively with Teradata utilities (MLOAD, TPUMP and FAST LOAD) to load data.
- Extensively used XML transformation to generate target XML files.
- Developed Scripts to automate the Data Load processes to target Data warehouse and scheduling using AUTOSYS.
- Created several materialized views for reporting purpose and better performance.
- Responsible to tune ETL procedures and schemas to optimize load and query Performance.
- Involved with the Architecture group to develop ETL metadata strategies and Informatica objects reuse policies. Developed reusable Informatica Mapplet’s and Transformations.
- Developed ETL technical specs, Visio for ETL process flow and ETL load plan, ETL execution plan, Test cases, Test scripts etc.
- Involved in production support activities with Installation and Configuration of Informatica Power Center 8.x. Used Informatica Workflow Monitor to monitor and control jobs.
- Accomplished data movement process that load data from Oracle and Teradata into oracle by the development of Shell scripts using Teradata SQL utilities.
- Developed PL/SQL and UNIX Shell Scripts for scheduling the sessions in Informatica.
- Wrote stored procedures, functions, and database triggers. Created database triggers on tables to generate surrogate keys.
- Coordinated with Business Users to understand business needs and implement the same into a functional Data warehouse design. Used Cognos for reporting
- Integrated logical and physical data models for business users to determine common data definitions.
Environment: Informatica Power Center 8.X, Teradata V2R6, Oracle 10g, DB2, TOAD, Erwin 3.5.2, PL/SQL, Cognos, Windows XP, UNIX, Sun Solari