Etl Developer Resume
Boston, MA
SUMMARY:
- Has over 6+years of experience in ETL and Data warehousing concepts using Informatica, Oracle and Netezza.
- Expertise in analyzing data quality, data organization, metadata, data profiling, and documenting quality of source data to be used within the warehouse.
- Primarily responsible for performing ETL operations with Informatica, Oracle in the company’s client environment.
- Good experience in Unix Shell Scripting and ETL Process Automation using Shell Programming and Informatica. Have good experience in Performance Tuning at both Database and Informatica Level.
- Involved in requirements gathering, to identify dimensions and measures needed for Data Modeling.
- Experience in designing and implementing different types of mappings which includes aggregator, expression, lookup, update strategy, router, sequence generater etc, using Informatica.
- Analyzing the BRD, FSD, and TSD of the source system.
- Data profiling of the source system.
- Extracting data from relational and Flat File systems to stage environments.
- Cleanse the extracted data.
- Process and loading into Data Warehouse.
- Involved with data validation and debugging and preparing unit test document.
- Used Maestro scheduling tool for executing ETL jobs.
- Used Harvest to migrate parameter files, SQL and Shell scripts and Schedules.
- Support QA team to resolve issues if any.
- Responsible for interacting with the business people to discuss their requirements.
- Responsible for interacting with the source system for getting the data related issues resolved.
- Responsible for working with the architecture team to get the objects created in their 3nf model
- Engagement with the offshore development team.
- Trained in Cloudera Hadoop – Development and Administration.
- Strong Logical and Analytical Reasoning Skills, Excellent Communication with good Listening, Presentation and Intrapersonal Skills.
TECHNICAL SKILLS
- ETL Tools: Informatica Power Center 7.1/8.1/8.6/9.1.
- Platforms: Windows (all flavors), UNIX
- Databases: Oracle 8i/9i/10g, , Netezza
- Languages: C, C++, SQL, Shell Scripting,VB6.0
- Data Modeling: Visio
- Tools: WinSQL, Maestro , Harvest
Professional Experience:
Confidential Boston, MA June 11 to present
Informatica/ETL Developer
Description:
The project was to implement the conversion of Retail Data Warehouse and KPIs from Oracle 11G to Netezza 4.6.8 and all ETL code related to that. The business aim was to have a faster reporting system. Many of the financial queries which ran for 16/17 hours were accomplished in minutes.
Responsibilities:
- Performed POC to ensure that the proposed Netezza based solution is robust enough to get the required efficient reporting ability. This involved converting Informatica mappings to tune to Netezza Database. The project involved migration of Netezza 4.0.8 to 4.6.8 which is entirely different in behavior.
- Performed modeling of Netezza tables using customized Unix based tools to go to existing oracle tables and analyzed the data, including distribution keys and constraints
- Remodeled Type 1, Type 2 dimensions and created cross reference tables for faster reporting.
- Worked along with Senior Architects in the High Level and Detailed Design
- Performed the role of ETL lead and took part in the design of Netezza scripts.
- Coordinated with the DBA’s to get the necessary database objects in place.
- Resolved existing bugs and redesigned very complex mappings to ensure recoverability and enforced constraints in ETL level
- Converted all Oracle Stored Procedures to ETL mappings/Netezza Scripts.
- Created ETL test plans, test cases, assumptions and validations based on design specifications for unit testing, system testing, expected results, preparing test data and loading for testing, error handling and analysis.
- Highlighted issues and deviations to the client and ensured proactive delivery.
- Took part in integration testing and data validation.
- Performed performance tuning activities to ensure that all the Netezza scripts complete execution in an optimized time frame.
- Reporting tool was changed from BO XI R2 to Microstrategy 9, had to support and ensure reports were in sync with Oracle as we had dual loading and reporting for Integration Testing
Environment: Informatica Power Center 9.0.1, Oracle 10g, Netezza 4.6.8, UNIX Shell Scripting, Flat Files, Microsoft Office
Confidential, Madison, WI Sept 10 to May 11
Informatica ETL Developer
Description:
The project was to load sales related data to a separate ‘Operational Data store’ created in Oracle with Flat files and Netezza tables as source. This data was already available in the Data warehouse but was still loaded to a separate ODS to assist reporting activities.
Responsibilities:
- Worked closely with client in understanding the Business requirements, data analysis and deliver the client expectation
- Used InformaticaPowerCenter 8.6.1forextraction, loading and transformation(ETL) of data to the ODS.
- Created complex mappings in PowerCenter Designer usingAggregator, Expression, Filter, Sequence Generator, Update Strategy, Union, Lookup, Joiner transformations.
- Extracted data from multiple sources flat files and Netezza and loaded into Oracle ODS.
- HandleSlowly Changing Dimensions (SCD)(Type I, Type II and Type III) based on the business requirements.
- UsedInformatica Power Center Workflow managerto create sessions, batches to run with the logic embedded in the mappings.
- Involved in creation ofFolders,Users, RepositoriesandDeployment Groupsusing Repository Manager.
- Developed UNIX Shell Scriptsfor scheduling the sessions in Informatica.
- Involved in scheduling the Informatica workflows usingMaestro.
- Involved inPerformance tuningfor sources, targets, mappings and sessions.
- Created deployment groups, migrated the code into different environments.
Environment:Informatica PowerCenter 8.6.1 (Power Center Repository Manager, Designer, Workflow Manager, and Workflow Monitor), Oracle 10g, Netezza, WinSQL, UNIX
Client: Confidential, ME June 09 to Aug 10
Informatica Developer
Description:
This project was to create ETL jobs for a partner source system into the Data warehouse. Data was received in flat files and was first loaded to a staging area after performing the Data Quality checks and then it was loaded to the Netezza Data Warehouse.
Responsibilities:
- Involved in gathering requirements from business users. Participated in the detailed requirement analysis for the design of data marts and star schemas.
- Extracting data from several source systems like Oracle, Teradata, etc. and loading data into Enterprise Data ware house. Designed many Multi Source Single Target mappings and vice versa.
- Performed Performance tuning for the jobs to adhere to the SLAs set in Production.
- Created scripts to perform data quality checks at the stage level jobs to stop bad records at the staging level itself.
- Coordinated with the source systems in getting the data in the correct format.
- Worked on Power Center Designer client tools like Source Analyzer, Target Analyzer, Mapping Designer and Mapplet Designer.
- Created Mapplets with the help of Mapplet Designer and used those Mapplets in the Mappings.
- Created reusable transformations for reducing monotonous activities.
- Moving the data from source systems to different schemas based on the dimensions and fact tables by using the slowly changing dimensions type two and type one.
- Informatica workflow manager was used to create schedule, execute Sessions, Worklets, Command, E-Mail Tasks and Workflows. Performed validation and loading of the Flat files received from business users.
- Wrote UNIX Shell scripts to schedule the workflows
- Used Parameter files to reuse the mapping with different criteria to decrease the maintenance.
- Used Maestro to schedule Informatica, SQL script and shell script jobs.
Environment: Informatica 8.6.1, Oracle 11g/10g, Netezza, WinSQL, Unix.
Confidential, Edgewood, NY Nov 07 to May 09
ETL Developer
RESPONSIBILITIES:
- Involved in designing and developing the data acquisition process for the data warehouse including the initial load and subsequent refreshes.
- Using Informatica Power Center Designer, Extract & Transform the data from various source systems by incorporating various business rules for different applications by using the different objects and functions which the tool supports.
- Worked on Informatica tool like Source Analyzer, Data Warehousing designer, Mapping Designer, Transformations, Informatica Repository Manager and Informatica Workflow Manager.
- Developed Mappings with transformations like Source qualifier, Aggregator, Expression, Lookup, Filter, Router, Sequence Generator, Update strategy, Rank etc.
- Designed and implemented appropriate ETL mappings to extract and transform data from various sources to meet requirements.
- Documented all transformations and business logics and ETL processes. Designed and developed Informatica ETL mappings to extract master and transactional data from heterogeneous data feeds and load into Target Database.
- Created and modified mappings according to the client’s requirementas per business rules.
- Implemented performance tuning concepts to the whole ETL process.
- Documented the mappings used in ETL processes.
- Implemented Slowly Changing Dimension methodology.
ENVIRONMENT: Informatica power center 7.1.4, Oracle 9i, Netezza, TOAD and UNIX
Client: Confidential, India May 06 to Oct 07Informatica Developer
Responsibilities:
- Extracted Data from Different Sources by using Informatica.
- Created Mappings using Mapping Designer to load the data from various sources, using different transformations like Unconnected and Connected lookups, Router, Aggregator, Joiner, Filter and Update Strategy for Daily Load.
- Extensively used SQL overrides at Source Qualifier and Lookup Transformations while extracting data from multiple tables.
- Creating Sessions, Batches for data loads using Workflow Manager for Daily load.
- Involved in resolving performance bottlenecks for performance tuning of load process at various stages such as targets, sources, mappings, sessions, and systems.
- Documented ETL test plans, test cases, assumptions and validations based on design specifications for unit testing, system testing, expected results, preparing test data and loading for testing, error handling and analysis.
- Extracted data from different source databases. Createdstaging areato cleanse the data and validated the data.
- Developed schedules to automate the update processes andInformatica sessions and batches.
- Created ETL specification document for every mapping.
- Documenting the project flow and prepared User and System Manuals for better usage.
- Modifying the shell/Perl scripts as per the business requirements.
- Supported other teams to resolve tickets raised by users on Data quality
Environment:Informatica PowerCenter, Oracle 8i, WinSQL, UNIX Shell Script.
EDUCATION:
Masters in information Technology