Lead Etl Consultant Resume Profile
Bentonville, AR
Professional Summary
- With 8 Years of work experience in Data Warehousing, Data integration and conversion processes, Design, Development BIG DATA Technologies
- Proficient in Data analysis, Data modeling, Database design and Data migration.
- Extensive experience with Data Warehousing and Business Intelligence applications using IBM InfoSphere DataStage 8.x/7.x Server Edition and Enterprise Edition Administrator, Manager, Designer, Director .
- Expertise in translating business requirements into Data Warehouse and Data Mart design and developing ETL logic based on the requirements using DataStage.
- Proficient in data warehousing techniques to perform data profiling, data analysis, implementing slowly changing dimension, surrogate key generation.
- Experience with IBM Websphere QualityStage in Standardizing and Matching data.
- Extensive expertise in working with Data Warehouses as well as Oracle 11g/10g/9i, DB2 TERADATA GREENPLUM 4.2 Databases
- Strong experience in SQL, Triggers, Debugging, Troubleshooting and Performance Tuning.
- Well Versed with BIG DATA Technologies ,HDFS File Commands, Hadoop Clusters, Sqoop, Hive , Internal External Tables
- Worked extensively on different types of stages like Data Set, Aggregator, Transformer, Merge, Join, Modify, Lookup, Sort, Slowly changing dimension, Change Capture/Apply, XML Transformations, and Column Import/ Export.
- Good working experience on various Operating Systems.
Technical Skills
Data Warehousing | IBM Websphere DataStage 8.x/7.x Manager, Designer, Director, Administrator , IBM Websphere QualityStage, Datamart, OLAP, OLTP, Star Snowflake Schema, Fact Dimension Tables, Physical Logical Data Modeling |
Databases | Oracle 11g/10g/9i, My SQL, SQL Server 2005/2000/7.0/6.5, DB2, |
Languages | SQL, Unix Shell Scripting |
Operating System | HP-UX, IBM AIX, Red Hat Linux, Windows 2000/XP/Vista, Windows NT 4.0, Mac OS X |
Software | SQL Server 2000, MS-Office, |
Tools | SQL Developer, DB 2 Connect, SQL Plus, SQL Loader, Autosys |
PROFESSIONAL EXPERIENCE
Position: Lead ETL Consultant
Confidential
Responsibilities
- Participated in the design review meetings to come up with the proper methodology that is to be followed uniformly across the development.
- Collaborated with project team members in providing design and development guidance, mentoring, best practices.
- Implemented data extraction and load processes in a parallel framework.
- Created the Schema file to dynamically pass the metadata in Sequential file stage.
- Created Shared Container to encapsulate the logic that is commonly used across the project.
- Created ETL/DI designs for various source and target Databases like Oracle- Exadata, Teradata, SQL Server, DB2 Green PLum
- Created Unix flow to unload data from Teradata and load to Green plum Database
- Experienced on working TERADATA PARALLAEL TRANSPORTER
- Created the multi-instance Generic Load Parallel Job that is called in the Job Sequence to load the data into the Teradata Database, Oracle-Exadata, Teradata DB2,SQL SERVER GREENPLUM
- Worked with widely used stages like Flat File, Lookup, Join, Pivot, Transformer, Sort, Aggregator, Merge, Row Generator, and Column Generator and also Troubleshooted the designed jobs and tested the jobs for all logical errors.
- Participated in the Design review of normalized and de-normalized data repositories.
- Provided the necessary details to the Testing Team for them to create their test cases for UAT.
- Implemented HDFS file system commands to load to Hadoop Clusters
- Well Versed with Hive External Internal Tables
- Worked on Sqoop to import and Export Tables to and fro from Hadoop and RDBMS TERADATA
- Worked on Yaml config file to load Green Plum Database from sources like DB2, MVS, and TERADATA.
- Created DML's for loading Staging and Base Tables for Green Plum Database
- Carried out unit testing, system testing and integration testing.
- Provided the necessary support in terms of information required by the DataStage Administrator in order to solve the performance related issues.
- Provided Standard Documentation, Best practice, Common ETL Project Templates. Fine Tune jobs/Process to higher performances debug critical/complex job.
Environment
IBM InfoSphere Information Sever 8.5,IBM Information Server Datastage 8.1 Designer, Director , Windows Server 2003, AIX5.2, Oracle11g/10g, SQL Loader, PL/SQL, UNIX, IBM DB2 UDB, Web Sphere, Teradata 13.10,Green Plum 4.2,Post gre SQL 8.2,Hadoop Clusters, Hive, HDFS,BIG DATA ,TPT ,IBM Blueprint Director, Oracle Exadata, SQL Server, Db Visualizer , Erwin Model, Power Designer
Position: ETL Consultant
Confidential
Responsibilities
- Involved in meetings with key stakeholders and client Managers and gathered requirements, and converted them into technical specifications.
- Interacted with End user community to understand the business requirements and in identifying data sources.
- Analyzed the existing informational sources and methods to identify problem areas and make recommendations for improvement. This required a detailed understanding of the data sources and researching possible solutions.
- Created the mapping document for source to target.
- Worked with Datastage Manager for importing metadata from repository, new job Categories and creating new data elements.
- Designed and developed ETL processes using DataStage designer to load data from Oracle, MS SQL, Flat Files and XML files to staging database and from staging to the target Data Warehouse database.
- Used DataStage stages namely Hash file, Sequential file, Transformer, Aggregate, Sort, Datasets, Join, Lookup, Change Capture, Funnel, Peek, Row Generator stages in accomplishing the ETL Coding.
- Developed job sequencer with proper job dependencies, job control stages, triggers.
- Used QualityStage to ensure consistency, removing data anomalies and spelling errors of the source information before being delivered for further processing.
- Excessively used DS Director for monitoring Job logs to resolve issues.
- Involved in performance tuning and optimization of DataStage mappings using features like Pipeline and Partition Parallelism and data/index cache to manage very large volume of data.
- Documented ETL test plans, test cases, test scripts, and validations based on design specifications for unit testing, system testing, functional testing, prepared test data for testing, error handling and analysis.
- Used Autosys scheduling jobs.
- Assisted QA/UAT cycle by resolving the defects quickly.
- Wrote Configuration files for Performance in production environment.
- Participated in weekly status meetings.
- Analyzed/Profiled the source data and involved in the gap analysis and implemented rules to cleanse the data.
- Created different shell scripts and used them as pre/post session commands.
- Produced a Unit test Document, which captures the test conditions and scripts, expected/actual results.
- Created PL/SQL procedures to handle some complex logic.
- Implemented archiving and error handling of data loads.
- Closely worked with Business Objects BO team to ensure proper data in the reports.
- Assisted QA/UAT cycle by resolving the defects quickly.
- Interaction with Offshore Team everyday based on the development activities/Tickets/issues and follow up with them.
- Trained end users and supported them in resolving their issues.
- Worked concurrently on different projects prioritizing the tasks.
Environment
IBM Information Server Datastage 8.1, Designer, Director , Windows Server 2003, AIX5.2, Oracle10g/9i, SQL Loader, PL/SQL, UNIX, IBM DB2 UDB, Teradata, Web Sphere, XML.
Position: DataStage Developer
Confidential
Responsibilities
- Generate snowflake and star schema dimension modeling.
- Created physical and logical models using Erwin.
- Involved in the creation of mapping documents from source to data marts.
- Worked closely with data warehouse architect and business intelligence analyst in developing solutions.
- Created DataStage jobs to extract data from sequential files, flat files, MS Access and Oracle.
- Used DataStage Manager for importing Metadata from repository and for creating new job categories and data elements.
- Used Parallel Extender for parallel processing of data extraction and transformation.
- Written functions, transforms, before job routine and after job routines using DataStage basic.
- Used Data Stage Manager for importing metadata from repository, new job categories and creating new data elements
- Used Data Stage debugger to troubleshoot the designed jobs.
- Tuned DataStage transformations and jobs to enhance their performance.
- Implementation of business rules.
Environment
Ascential DataStage 7.5.2 Designer, Director, Manager, Administrator , Quality Stage, Oracle 9i, Sybase, DB2, UNIX Shell Scripting, PL/SQL, MS Access, Erwin, Windows NT, Metadata.
Position: ETL Developer
Confidential
Roles Responsibilities
- Designed Data Warehouse Star Schema with Facts and Dimension Tables.
- Designed the mappings between sources external files and databases to targets.
- Developed ETL processes that will ensure conformity, compliance with standards and lack of redundancy, translating business rules, functionality requirements.
- Designed jobs in Designer that extract data from sequential and relational sources, perform reference lookups from hashed files, and load data into sequential and relational targets.
- Used DataStage Manager for creating and importing metadata definitions in repository, viewed and edited the contents of the repository.
- Used Lookup, Sort, Merge, Funnel, Filter, Transformer stages.
- Used the DataStage Director to scheduling, monitoring and debugging the jobs.
- Troubleshooted the jobs using the job log and the DataStage debugger.
- Developed jobs that use parallel processing techniques such as row buffering, Partitioning Collecting.
- Controlled the flow of job execution based on job status codes, link record counts, and user-defined conditions.
Environment
Data Stage 7.5, Oracle 10g, SQL Plus, SQL Loader, SQL Navigator, MS-SQL Server 2000, Erwin, Windows NT 4.0, Windows 2003, Autosys.