Etl Developer Resume
PROFESSIONAL SUMMARY:
- Over 6+ years of total IT experience and technical proficiency in building Data Warehouses,Data Integration, Operational Data Stores and ETL processes for clients in Ecommerce(Sales orders,Inventory tracking,daily reports) ,Financial (Equities, Futures) and HealthCare (Providers, Customers, Organizations, Plans, Claims, and Extracts) domains.
- Hands on experience in Design, Development, Migration, Implementation of Data Warehouse and ETL Application development.
- Hands on experience in working with large scale Data Warehouse implementations and ETL tools using Informatica PowerCenter 8.x/7.x, DataStage 9.x,8.x, Oracle 10g&11g, DB2,Netezza, SQL Server on UNIX and Windows platforms.
- Extensive experience in Extraction, Transformation, and Loading (ETL) data from various data sources into Data Warehouse and Data Marts using DataStage Designer & Talend Open Studio environment.
- Expertise in implementing complex business rules by creating several parallel jobs and sequencers for process execution flow with robust mappings in DataStage.
- Experience in working on performance tuning jobs and thereby decreasing the execution time and efficient server utilization with limited resources .
- Stop / start services, monitor logs, create/delete projects, create/modify environmental parameters etc
- Evaluate system specifications, working parameters for hardware / software compatibility
- Comply with standards and guidelines related to the design, construction, testing and deployment activities within the Delivery Management environments
- Install IBM Information Server product suite, post configurations including configuring database native connectivity and ODBC connections and parallel database reads and writes
- Works with minimal guidance; seeks guidance on only the most complex tasks
- Create documentation for the installs, upgrades and regular maintenance
- Resolving ETL DataStage performance issues
- Experience in integration of various data sources like Oracle, DB2, SQL Server, Flat Files,XML files into Data Warehouse and also experienced in Data Cleansing and Data Analysis.
- Extensively used SQL and PL/SQL to write Stored Procedures, Functions, Packages, Cursors, Triggers, Views, and Indexes in distributed environment.
- Hands on experience on different types of data load strategies and scenarios like Historical Dimensions, Surrogate keys, Summary facts etc.
- Strong experience in writing UNIX Shell scripts, SQL Scripts for development, automation of ETL process, error handling, and auditing purposes. Experience in using Autosys and Control - M scheduling tools to organize and schedule jobs.
- Experience in preparing documentation such as High level design, System requirement document, and Technical Specification document etc.
- Extensive ETL experience of more than 4 years in data extraction, transformation and loading data using Informatica PowerCenter 9.x/8.x,Informatica PowerExchange9.x,8.x and Data Warehousing and Teradata.
- Worked extensively in all stages of SDLC, from gathering requirements to testing, implementation and support.
- Fluid understanding of multiple programming languages, including C#, C, C++,Java, JavaScript, HTML, and XML.
- Expertise in developing and debugging indexes, stored procedures, functions, triggers, cursors using DB2.
- Experience in project management, estimations, and resource management activities.
- Excellent analytical, problem solving skills with strong technical background and interpersonal skills.
AREAS OF INTEREST:
- Datawarehousing
- Business Analytics
- Big Data
- Data Science
TECHNICAL SKILL SET:
Databases: Oracle,Netezza,DB2,NoSQL,SQL Server,Teradata
Operating Systems: Windows/Unix/Linux
Database Tools: TOAD, SQL Navigator
Programming Languages: C, C++, HTML, XML, PL/ SQL, Java, J2EE,Python
ETL Tools: IBM DataStage Enterprise Edition 9.1/8.5 / 7.5 Informatica PowerCenter 9.6.1/8.5.1
Scripting Languages: Shell Scripting, Perl Scripting
Scheduling Tools: Autosys,Tivoli, Control-M
PROFESSIONAL EXPERIENCE:
ETL Developer
Confidential
Responsibilities:
- Involved in all phases of SDLC from requirement gathering, design, development, testing, Production, user training and support for production environment.
- Create new mapping designs using various tools in Informatica Designer like Source Analyzer,
- Warehouse Designer, Mapplet Designer and Mapping Designer.
- Experienced in developing mappings using transformations such as Source Qualifier, Aggregator,
- Lookup, Filter, Sequence Generator, Expression, Router, Update Strategy, Rank, XML, Normalizer etc., to load data from different sources like Oracle, Flat Files, Excel Spread Sheets, XML files to the target Data Warehouse.
- Created Hive tables and loaded data from HDFS to Hive tables as per the requirement.
- Worked in MDM Hub configurations - Data Mappings, Data validation, Hierarchy Manager, customizing/configuring Informatica/Business Data Director
- Created complex mappings that involved implementation of Business Logic to load data in to staging area.
- Designed & Developed Informatica MDM User Exits, Match rules etc
- Worked as an Informatica Administrator to migrate the mappings, sessions, workflows, repositories into the new environment.
- Designed and developed scripts for administrative tasks like backup’s, tuning and periodically refreshing the test databases from the production databases.
- Developed mappings/sessions using Informatica Power Center 8.6 for data loading.
- Performed data manipulations using various Informatica Transformations like Filter, Expression,
- Lookup (Connected and Un-Connected), Aggregate, Update Strategy, Normalizer, Joiner, Router, Sorter and Union.
- Developed Workflows using task developer, Worklet designer and workflow designer in Workflow manager and monitored the results using workflow monitor.
- Building Reports according to user Requirement.
- Extracted data from Oracle and SQL Server then used Teradata for data warehousing.
- Implementedslowly changing dimensionmethodology for accessing the full history of accounts.
- Write Shell script running workflows in unix environment.
- Optimizing performance tuning at source, target,mapping and session level
- Participated inweeklystatus meetings, and conducting internal andexternal reviews as well as formal walk through among various teams and documenting the proceedings.
Environment: Informatica PowerCenter 9.6, SQL Developer, AQT,Oracle 11g, UNIX Shell Scripting,Hive.
Confidential, Dallas TX
ETL Developer (Datastage)
Responsibilities:
- Designed and developed mappings between sources and operational staging targets, using Star and Snow Flake Schemas.
- Provided data models and data maps (extract, transform and load analysis) of the data marts for systems in the aggregation effort.
- Involved in Extracting, cleansing, transforming, integrating and loading data into data warehouse using Datastage Designer.
- Developed various transformations based on customer last name, zip code for internal business analytical purposes, loaded warehouse based on customer credit card number with dynamic data re - partitioning.
- Developed user defined Routines and Transformations by using Universe Basic.
- Used Datastage Manager for importing metadata from repository, new job categories and creating new data elements.
- Used the Datastage Director and the runtime engine to schedule running the solution, testing and debugging its components and monitoring the resulting executable versions (on adhoc or scheduled basis).
- Developed, maintained programs for scheduling data loading and transformations using Datastage and Oracle 12c.
- Developed Shell scripts to automate file manipulation and data loading procedures.
Environment: Datastage 9.1/8.7, Oracle 12c, SQL, TOAD, UNIX, Windows NT 4.0 .
Confidential
Software Engineer
Responsibilities:
- This is a captive company and we the team which runs the payments to all the customers of Confidential who claim their medical expenses to our company.
- Involved in documenting Functional Specifications, Design Specifications documents and created ETL Specifications documents and updated them as and when needed.
- Designed ETL specifications with transformation rules using ETL best practices for good performance, maintainability of the code and efficient restart ability.
- Designed reusable objects like mapplets & re - usable transformations in Informatica.
- Experienced in developing mappings using transformations such as Source Qualifier, AggregatorLookup, Filter, Sequence Generator, Expression, Router, Update Strategy, Rank, XML, Normalizer etc., to load data from different sources like Oracle, Flat Files, Excel Spread Sheets, XML files to the target Data Warehouse.
- Designed and developed the UNIX shell scripts for the automation of ETL jobs.
- Performed data validation in the target tables using complex SQLs to make sure all the modules are integrated properly.
- Involved in cleansing raw data in staging area using stored procedures in pre and post-session routines.Worked on Teradata upstream and down stream Integration.
- Tested and tuned the SQL queries for better performance, Conducted code reviews to make sure the business requirements are met and the coding standards are followed.
- Coordinated with upstream and down stream teams on daily basis for daily batch runs for payment engine processing.
- This payment processing which we refer to as a batch runs on daily basis except on US holidays and we process over 5 to 10 million claims a day. We as a part of support and development team monitor the daily progress of this batch which runs in US day timings from offshore and thereby providing critical support 24x7 for 365 days.
- Extensively worked with Teradata utilities like BTEQ, Fast Export, Fast Load, Multi Load to export and load data to/from different source systems including flat files.
- I being a part of this team in a short while has automated some of the reports which we send to our downstreams and thereby made a cost saving of upto 50000 USD per annum and suggested and implemented many performance tuning techniques and developed new Enhancements on existing systems.
- Supporting the application in Production environment by monitoring the ETL process everyday during the nightly loads.
Environment: Informatica 8.6,DataStage 9.1, SQL Developer,Unix, Tivoli Scheduler, Teradata, Oracle 11g,DB2
Confidential
Senior Systems Engineer
Responsibilities:
- Design, develop, and test Informatica mappings, workflows, worklets, reusable objects, SQL queriesand Shell scripts to implement complex business rules.
- Gathered the requirements from the Business Analyst on client side and developed necessary prototypes to demonstrate full length design and functionality.
- Developed new Informatica jobs using existing Talend code which is an open source and thereby providing security and speed through Informatica without disturbing the existing functionality.This is an agile project which is tracked on day to day basis so we are just two resources working from off shore location in collaboration with HCL technologies which is a strategic partner for Confidential in this project.
- Performance tuning and optimization achieved through the management of indices, table partitioningand optimizing the SQL scripts.
- Created generic packages useful for other team members.
- Migrated the data from SQL Server and DB2 to Oracle.
- Created a mapping document that outlines the sources mapped to the targets
- Created a document outlining the plan of action to be taken for the entire process
- Created views to select data from the existing SQL Server databases.
- Designed mappings to load first the Staging tables and then the destination tables.
- Designing mappings using transformations such as Source Qualifier, Joiner, Expression, Lookup, FilterRouter etc.
- Created different transformations using Informatica for loading the data into SQL Server database.
- Transferred the data from a combination of different input files like XML, Flat files to Oracle.
- Created, optimized, reviewed, and executed Complex SQL queries to validate transformation rules used in source to target mappings/source views, and to verify data in target tables.
- Created Functional Spec & Technical Spec documentation & also documented the issues found in the end to end testing.
- Generated SQL Loader scripts and Shell scripts for automated daily load processes.
- Developed triggers and stored procedures for data verification and processing.
- Extensively worked on database performance tuning techniques and modifying the complex join statements.
- Involved in the smooth transition from Informatica 7.1 to Informatica 8.0. Worked as an Informatica
- Administrator to migrate the mappings, sessions, workflows, repositories into the new environment.
- Designed and developed scripts for administrative tasks like backup s, tuning and periodically refreshing the test databases from the production databases.
- Documented detailed steps for migrating the code.
Environment: Informatica Powercenter 9.1.5,SQL Developer,Unix,Aginity for Netezza,Oracle 11g,Netezza, Autosys
Confidential
Systems Engineer
Responsibilities:
- Interacting with business owners to gather both functional and technical requirements.
- Documenting the business requirements and framing the business logic for the ETL process.
- Use Agile methodology for SDLC and utilize scrum meetings for creative and productive work.
- Transferring the data from various sources like XML, flat files, DB2 into Oracle data warehouse.
- Migrating historical data from DB2 to the Oracle data warehouse and extensively worked on SCD type 2 using Look up transformation.
- Identifying bottlenecks/issues and fine tuning them for optimal performance.
- Responsible for capturing, reporting, and correcting error data.
- Performed/automated many ETL related tasks including data cleansing, conversion, and transformations to load Oracle 10g based Data Warehouse.
- Used Autosys for job scheduling, workload automation and for generating reports.
- Developer Shell/Perl scripts to transfer files using FTP, SFTP, and to automate ETL jobs.
- Provide on - call support to production system to resolve any issues.
- Conducting code walkthroughs and review peer code and documentation.
- Playing role in design of scalable, reusable, and low maintenance ETL templates. Documenting the business requirements and framing the business logic for the ETL process.
- Developing technical specifications and other helpful ETL documents following standards.
- Involved in creating logical and physical data models using CA ERwin data modeler. Generating the
- DDL scripts for the physical data model.
- Used Agile methodology for SDLC and utilized scrum meetings for creative and productive work.
- Design and develop PL/SQL packages, stored procedure, tables, views, indexes, and functions; implement best practices to maintain optimal performance.
- Design, develop, and test DataStage mappings, workflows, worklets, reusable objects, SQL queriesand Shell scripts to implement complex business rules.
- Migrating historical data from DB2 to the Oracle data warehouse.
- Developed DataStage jobs to migrate data from one source system to another and Created new Enhancements and Reports to Accommodate Reason codes Functionality to the existing systems.
- Learnt a lot through interaction with client and functional consultants in drafting technical specifications and Test Cases.
Environment: IBM DataStage 8.5& 8.7, SQL Developer,Unix, Oracle 10g,11g,IBM DB2, SQL Developer, TOAD 9.0
