Etl Developer/data Analyst Resume
3.00/5 (Submit Your Rating)
Richardson, TX
PROFESSIONAL SUMMARY:
- Over 10 years of dynamic career reflecting pioneering experience and high performance in technical analysis, design, development and implementation of relational databases V10.x/9.x and data warehouse using Informatica PowerCenter, IBM Data Stage 8.0.1/7.x/6.x/5.x (Info Sphere Information Server, Web Sphere, Ascential Data Stage)
- Working knowledge of Credit card, Auto and Home Loans, Healthcare and Telecom projects
- Strong expertise in SQL, SSRS, SSIS, Excel, VBA, SAS, PL/SQL, HQL (Hive w/Kerberos), AS/400. Proficient with complex joins and sub - queries; experience working with huge datasets
- Worked with Teradata, Vertica, Oracle, SQL Server, Netezza, Hive and DB2 databases
- Proficient in developing strategies for Extract, Transform and Load (ETL) mechanism
- Expert in designing Parallel jobs using various stages like Join, Merge, Lookup, Remove duplicates, Filter, Dataset, Lookup file set, Complex flat file, Modify, Aggregator, XML.
- Software Development Life Cycle (SDLC) experience including analysis, design and review of business and software requirement specifications.
- Hands on experience in design, develop, document and Testing of ETL jobs and mappings in Server and creating parallel jobs using Data Stage to populate tables in Data Warehouse and Data marts
- Hands-on experience with enterprise job schedulers like Control-M and Autosys
- Experience in analyzing the data generated by the business process, defining the granularity, root cause analysis, source to target mapping of the data elements, creating Indexes and Aggregate tables for the data warehouse design and development.
- Expert in designing Server jobs using various types of stages like Sequential file, ODBC, Hashed file, Aggregator, Transformer, Sort, Link Partitioner Link Collector.
- Experienced in integration of various data sources (DB2-UDB, SQL Server, PL/SQL, Oracle, Netezza, Teradata, XML and MS-Access) into data staging area.
- Expert in working with Data Stage Manager, Designer, Administrator, and Director.
- Expertise in MS Visio, MS Project and building Database schema and Data Modeling (Star and Snow flake schemas) experience using Erwin.
- Strong work experience on waterfall and Agile methodologies using VersionOne, Rally and Jira. Experience with Kanban methodology
- Hands on with coordinating and collaborating with multiple teams, providing trouble shooting support and prioritizing and tracking enhancements. Managing team in US and offshore.
PROFESSIONAL EXPERIENCE:
ETL Developer/Data Analyst
Confidential, Richardson, TX
Responsibilities:
- Fulfilled all responsibilities as ETL developer and supported the business team as a Data Analyst
- Used Informatica PowerCenter as ETL tool to extract data from sources systems (Oracle, SQL Server), transformed and loaded the data to Teradata, Vertica (Big Data) and Netezza (Data Lake)
- Documented ETL test plans, cases, scripts and validations based on design specifications for unit testing, system testing, functional testing, prepared test data for testing, error handling and analysis
- Designed, tested and documented existing and new ETL work and all related components; made recommendations for related functions that result in a more cost effective product delivery.
- Performed unit testing of my ETL work as well as double checked co-developers ETL work, the components and documented results.
- Completed code reviews for ETLs and related components, and complete documentation of issues identified and action items.
- Corrected any testing defects identified and supported all testing, including but not limited to: Unit Testing, Development Integration Testing, System Testing, User Acceptance Testing, End-to-End Testing, and Performance Testing
- Created the HLD (High level Design), ARR (Application Reengineering Requirement) and EDD (Enterprise Design Document) for the Adobe migration and Data Lake support project
- Worked with Adobe development team on the Adobe data layer migration project to ensure the Premier Care pages are on the same production data layer as the other cross functional teams
- Generated daily, weekly and monthly reports against Vertica, Netezza, Teradata, Hive and/or Oracle database(s) depending on the requirement
- Worked on Splunk AI tool to get visual data on user performance analytics (for response times, JS errors etc), user behavior analysis (conversion rates, entry/exits etc) and other analytical metrics
- Wrote SQL and HQL code (via Kerberos authentication client) for reports and analysis
Informatica ETL Developer
Confidential, Richardson, TX
Responsibilities:
- Responsible for modeling, mapping and loading data fields from the source (Verizon) billing systems to the target’s (Frontier) billing system
- Worked on Informatica to extract, transform and load data based on the mapping documents
- Worked with technical leads, architects, subject matter experts and testing/QA teams to deliver value to the business.
- Reviewed & accepted business, system requirements & enterprise design documents (HLD,EDD)
- Created Repository Users/Groups and assigned permissions to the user groups to define security in the repository and involved in detailed design of the reports/dashboards and data model.
- Performed root cause analysis for defects and triaged them to the appropriate teams for resolution
- Recommend enhancements in the business user dashboard for faster response times and conversion updates from the ETL standpoint
- Captured metadata in Informatica MDM for all elements sourced from Verizon.
- Worked on HP ALM and Remedy to work on defects and resolved them based on Severity/Priority.
ETL developer
Confidential, Richardson, TX
Responsibilities:
- Primary on-site technical lead during the analysis, planning, design, development, and implementation stages of data quality projects using Integrity.
- Extracted data from source systems, transformed and load into Oracle and Netezza according to the required provisions
- Involved in system analysis, design, development, support and documentation.
- Created views for hiding actual tables and to eliminate the complexity of the large queries.
- Created various indexes on tables to improve the performance by eliminating the full table scans
- Created objects like tables, views, Materialized views procedures, packages using Oracle tools like PL/SQL, SQL*Plus, SQL*Loader and Handled Exceptions.
- Created tasks in Jira for all development related work
- Generated Surrogate ID’s for the dimensions in the fact table for indexed and faster access of data.
- Identified application bottlenecks and opportunities to optimize performance of
- Worked with the Informatica PowerCenter configuration and supported installation, upgrades, performance tuning, etc. during off peak hours
ETL Developer
Confidential, Lewisville, TX
Responsibilities:
- Used DataStage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into staging tables
- Used Parallel Extender for parallel processing to improve performance when extracting data from multiple sources
- Used the DataStage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into Data Marts
- Worked with Metadata Definitions, Import and Export of Datastage jobs using Datastage Manager
- Implemented PL/SQL scripts in accordance with the necessary Business rules and procedures.
- Created queries using join and case statements to validate data in different databases.
- Created queries to compare data between multiple databases to make sure data is matched.
- Used the DataStage Director and its run-time engine to schedule run the solution, testing and debug its components, and monitored the executable versions on an ad hoc or scheduled basis.
- Created DataStage jobs using different stages like Transformer, Aggregator, Sort, Join, Merge, Lookup, Data Set, Funnel, Remove Duplicates, Copy, Modify, Filter, Change Data Capture, Change Apply, Sample, Surrogate Key, Column Generator, Row Generator, Etc.
- Involved in analysis, planning, design, development, and implementation phase of projects for IBM Web Sphere software -Quality Stage, Web Service, Information Analyzer, Profile Stage, of IIS 8.0.1
- Monitored Datastage jobs regularly by running UNIX scripts and forced restarts for failed jobs
- Created and modified batch scripts to ftp files from different server to data stage server.
ETL Analyst
Confidential, Plano, TX
Responsibilities:
- Involved in the full life cycle development of the data warehousing project including design, development, testing and Production support.
- Involved in business requirements gathering and preparing architecture design documents.
- Involved in preparing Technical Design Documents for the ETL development.
- Worked on Informatica Power Center tools like Designer, Repository Manager, Workflow Manager and Workflow Monitor.
- Involved in building the ETL architecture and Source to Target mapping to load data into EDW.
- Extracted data from flat files and other databases into staging area and load into Data warehouse
- Maintained stored definitions, transformation rules & targets definitions using repository manager
- Used various transformations like Filter, Expression, Sequence Generator, Update Strategy, Joiner, Stored Procedure, and Union to develop robust mappings in the Informatica Designer.
- Created, documented and maintained logical and physical database models (snowflake schema) in compliance with enterprise standards and created corporate metadata definitions for enterprise data stored within the metadata repository.
- Parsed high-level design specification to simple ETL coding and mapping standards
- Custom designed data model for Data warehouse to support data from multiple sources in real time
- Extensively used SQL loader to load data from flat files to the database tables in Oracle.
Healthcare Data Analyst/SQL Reports Developer
Confidential, Richardson, TX
Responsibilities:
- Worked on projects using Agile and Kanban (using JIRA) methodologies.
- Provided analysis and efforts on a rating engine which is based on .NET and Java technologies in an SDLC environment for calculating new premiums based on actuarial algorithms.
- Created reports using TOAD against Teradata, DB2 or SQL server depending on the requirement
- Investigated, analyzed and resolved issues based on criticality by writing SQL against multiple dbs
- Created data flow diagrams in MS visio and RACI chart for a new Healthcare pricing project.
- Performed data profiling on the source data to visualize the table structures, joins and identify any data quality issues that can be eliminated when loading the data.
- Using Quality Center to manage defects and work with all teams to ensure resolution prior to UAT.
Systems Architect
Confidential, Richmond, VA and Plano, TX
Responsibilities:
- Developed mapping docs for ETL work with source to target data mapping with physical naming standards, datatypes, volumetrics and corporate meta-data definitions (MDM).
- Worked with technical leads, architects, subject matter experts and testing/QA teams to deliver value to the business.
- Created tasks in Jira for all development related work
- Completed code reviews for ETLs and related components, and complete documentation of issues identified and action items.
- Corrected any testing defects identified and supported all testing, including but not limited to: Unit Testing, Development Integration Testing, System Testing, User Acceptance Testing, End-to-End Testing, and Performance Testing and performed code version control activities.
Market Risk Reporting Analyst
Confidential, New York, NY
Responsibilities:
- Worked with cross functional teams including Risk Managers, Traders, Front Office Managers, IT, Financial Control and other groups internally to gather data including issuer risk (JTD).
- Produced daily and weekly market risk reports that monitor issuer risk ( Confidential ) against single issuer market risk limits and perform daily data checks to ensure data quality and data integrity.
- Automated the process to run the daily data integrity checks using MS Access and MS Excel and scheduled the report using system scheduler to compare previous day’s data to the current days’.
- Collaborated with Risk Management and technology teams to create a new repository that integrates market risk and credit risk into a single report to view the combined risk for issuers.
- Provide information to Risk Managers, senior management and the business for audit, stress testing and ad-hoc related queries.
- Fix indicative data (Security Ratings, Product type, Parent company names) using Bloomberg.
