Senior Etl Consultant Resume
Norfolk, VA
SUMMARY:
- Around 9 years of extensive experience in Information Technology with special emphasis on design, development of Database/Data Warehousing/Client - Server applications.
- Experience in all the phases of the Data warehouse life cycle involving Data analysis, design, and development and testing using ETL, Data Modeling, Online Analytical Processing & reporting tools.
- Extracted data from various sources such as Relational Sources (Oracle, SQL Server, Teradata and DB2) and File formats.
- Expose to AWS Cloud environment.
- Proficient in interaction with the business users, data loading strategies of loading staging area and data marts.
- Strong hands-on in developing Scripts using Perl and UNIX Scripting (FTP, SFTP, SCP and Shell Scripting).
- Good knowledge in Data Integration tool Talend Open Studio 6.0 for implementing Data Integration methodologies.
- Involved in troubleshooting of DataStage jobs and addressing the production issues, performance tuning and enhancement.
- Good Knowledge on Teradata utilities Fast Load, Multi Load, Tpump and Fast Export.
- Excellent knowledge of studying the data dependencies using Metadata stored in the DataStage Repository.
- Strong knowledge of DBMS concepts (views, foreign keys, primary keys, indexes, referential integrity, column constraints), SQL, PL/SQL, DDL generation and validation.
- Preparing job sequences as per data dependencies for the existing/new jobs to facilitate scheduling of multiple jobs.
- Extensive experience with UNIX shell scripting to clean up the prior run files, setting up the new UNIX directory Structure, archiving the files, file validations, generating the SQL scripts and running the DataStage jobs.
- Used Version Control software to promote the DataStage jobs from Development to Testing and then to Production environment.
- Good knowledge of Data Warehouse concepts and principles (Kimball/ Inman) - Star Schema, Snowflake, SCD, Surrogate Keys, Normalization/ De-normalization.
- 24X7 production support for Data warehouse.
- Worked extensively in Development of large Projects with complete end to end participation in all areas of Software Development Life Cycle and maintain documentation.
- Good Knowledge in HIVE and HBASE tables.
- Good Knowledge on Big data Hadoop Sqoop and Spark Process.
- Quick adaptability to new technologies and zeal to improve technical skills.
- Good analytical, programming, problem solving and troubleshooting skills.
- Good team player, extremely adaptable and fast learner.
- Strong team player with excellent Inter-personal, written and oral communication skills.
- Enthusiastic to learn new technologies
TECHNICAL SKILLS:
IBM Software: IBM Datastage 11.X, 9.X, 8.X, 7.X, 6.X.
Operating Systems: IBM AIX, Unix, Linux, Window
Languages: Java, J2EE, C/C++, XML, XSL, SQL, Python, and scala
Databases: Oracle 9i/11g, SQL Server 2005/2008/2012, DB2, Teradata,Netezza,Hive
Data Modelling Tools: Erwin 4.1/3.5, Microsoft Visio 2010
Other ETL Tools: SAS Data Integrator, Microsoft SSIS 2008 and 2012, Talend 6.2
Other Reporting Tools: Microsoft SSRS 2012, IBM Cognos 8.4, and 10.2
Other Applications: Toad, Oracle SQL Developer, HP AIM
Scheduling tools: IBM Tivoli, Control -M, oozie etc.
PROFESSIONAL EXPERIENCE:
Confidential, Norfolk, VA
Senior ETL Consultant
Responsibilities:- Worked with Data Architects and Business Analysts to clarify the ambiguous requirements/source to target mappings.
- Identified the sources as per the mapping document and analyzed the source data for better development of the code.
- Used the DataStage Designer to import table definitions, develop processes for extracting, cleansing, integrating, transforming and loading data into data warehouse and data marts.
- Designed the DataStage Parallel Jobs using different stages to load data from sources to Operational staging area and then to target Data warehouse/Data marts as per the Business Requirements Document.
- Extensively used different DataStage Stages like Database, Debug (Peek, Row Generator and Column Generator), File (Sequential File and Dataset) and Processing Stages (Copy, Change Capture, Aggregator, Surrogate Key Generator, Transformer, Lookup, Join, Sorter, Remove Duplicates, Pivot Enterprise and Funnel).
- Reduced the development effort by using the RCP (Run-time Column Propagation) option to load/unload the Staging, EDW and Mart tables.
- Effectively used the DataStage transformer stage looping feature for developing the smaller jobs.
- Performed impact analysis on the metadata changes using DataStage metadata repository for better development.
- Tuned DataStage jobs to enhance their performance.
- Automated the DataStage project import and export process using client command istool apart from manual import/export using Designer.
- Used DataStage Multiple Job compile option to compile the jobs at project level.
- Built utility jobs using UNIX shell scripts to clean the prior run files/temp files, setting up the new UNIX directory Structure, archiving the files, back-up the DataStage projects and data management/validation.
- Wrote SQL scripts on Stage, EDW and Data Mart Tables to validate the data results by counting the number of rows, checking for the NULL values, Data Truncation/rounding and comparing after resolving the ID columns.
- Imported and exported the environment variables using DataStage Administrator across the environments.
- Created job sequences for each dimension as well as fact. Also created a performance efficient Master Sequence to call dimensions and facts in correct order.
- Used the DataStage Director and helped with TWS to schedule the jobs and sequences.
- Supported the production migrated DataStage jobs for issues.
- Prepared the Test Case, Test Scripts and Test results documentation for the code migrations.
- Attending the team meetings for status updates and ability to prioritize as per the urgency.
- Involved in Unit, Integration, System and User Acceptance Testing (UAT).
- Documentation is done as to facilitate the personnel to understand the load process and in corporate the changes as and when necessary.
Environment: IBM InfoSphere DataStage 8.5.x EE (Parallel Extender, Designer, Director, Administrator), Oracle, Sequential Files (Fixed Width and Delimited), SQL, PL/SQL, UNIX Shell Scripts, Toad, IBM TWS, Linux, Windows.
Confidential,IL
Senior ETL Consultant
Responsibilities:- Analyzed the existing application and business rules defined in Informatica mappings for quality development/enhancements.
- Extensively used SQL for Data Analysis and to understand the data behavior.
- Developed Informatica mappings, sessions and workflows to extract, transform and load data into different tables in Staging, Warehouse and Datamart environments.
- Extensively used transformations such as Source Qualifier, Aggregator, Expression, Lookup (connected & un-connected), Router, Filter, sorter, Union, Normalizer, Update Strategy, Sequence Generator, Join, Transaction Control and Stored Procedure to implement the business logic.
- Migrated the Informatica code between environments using Informatica Repository Manager.
- Updated the job dependency document to in corporate the newly developed Dimensions, Association and Fact table loading jobs.
- Define the data loading strategies into the tables in Staging, Warehouse and Mart environments.
- Developed the job chains for Control-M job scheduler to run the jobs for entire project level.
- Created the test plan and test strategy documentation to verify/validate the data.
- Validated the Testing results to make sure expected results were produced.
- Documented all the development activities for better understanding of the process and technical support.
- Participated in the daily standup calls/status meetings to update the status about design and development and to discuss about the road blockers.
- Wrote UNIX shell scripts to cleaning up the prior run files/temp files, archiving the files and running the Informatica Workflow jobs.
Environment: IBM InfoSphere DataStage 8.1.x EE (Parallel Extender, Designer, Director, Administrator), Oracle, Teradata V12R2, Teradata SQL Assistant, Flat Files, Toad, SQL, PL/SQL, UNIX Shell Scripts, Putty, IBM AIX, Windows, Control-M.
Confidential, Bentonville, AR
ETL Datastage Developer
Responsibilities:- Worked with Data Architects and Business Analysts to clarify the ambiguous requirements/source to target mappings.
- Developed Parallel Jobs in DataStage Designer to extract data from the Sources Oracle and Complex Flat Files, Transform it by applying business rules and Load (Initial/Incremental) it into Stage tables and then to warehouse and mart tables.
- Extensively used different DataStage Stages like Database, Debug (Peek, Row Generator and Column Generator), File (Sequential File and Dataset) and Processing Stages (Copy, Change Capture, Aggregator, Surrogate Key Generator, Transformer, Lookup, Join, Sorter, Remove Duplicates, Pivot Enterprise and Funnel).
- Reduced the development effort by using the Schema File and RCP (Run-time Column Propagation) options to load/unload the Staging, EDW and Mart tables.
- Built utility jobs using UNIX shell scripts to clean the prior run files, setting up the new UNIX directory Structure, archiving the files, data management and data loading techniques.
- Prepared the job scheduling documents to run the Dimensions and Facts loading jobs in the correct order.
- Unit tested the jobs in Development environment by running manually using DataStage Director and verifying the job logs for warnings & errors.
- Exported the jobs from Development to Test environment and then to Production environment using DataStage Designer/Manager.
- Involved in writing the possible Unit test cases and tested number of jobs and also resolved the defects in developed jobs.
- Participated in Team meetings to discuss about the issues and development status.
- Documentation is done as to facilitate the personnel to understand the load process and in corporate the changes as and when necessary.
- Written sql queries for extraction of data for ETL process.
Environment: IBM InfoSphere DataStage 8.5.x EE (Parallel Extender, Designer, Director, Administrator),, Oracle 10g, MS SQL Server 2000/2005,Unix, Erwin, TOAD 8.6, Business objects, Autosys, Windows XP Professional
Confidential, Malvern, PA
ETL/DataStage Developer
Responsibilities:- Worked with Data Architects and Business Analysts to clarify the ambiguous requirements/source to target mappings.
- Developed Parallel Jobs in DataStage Designer to extract data from the Sources Oracle and Complex Flat Files, Transform it by applying business rules and Load (Initial/Incremental) it into Stage tables and then to warehouse and mart tables.
- Extensively used different DataStage Stages like Database, Debug (Peek, Row Generator and Column Generator), File (Sequential File and Dataset) and Processing Stages (Copy, Change Capture, Aggregator, Surrogate Key Generator, Transformer, Lookup, Join, Sorter, Remove Duplicates, Pivot Enterprise and Funnel).
- Reduced the development effort by using the Schema File and RCP (Run-time Column Propagation) options to load/unload the Staging, EDW and Mart tables.
- Built utility jobs using UNIX shell scripts to clean the prior run files, setting up the new UNIX directory Structure, archiving the files, data management and data loading techniques.
- Prepared the job scheduling documents to run the Dimensions and Facts loading jobs in the correct order.
- Unit tested the jobs in Development environment by running manually using DataStage Director and verifying the job logs for warnings & errors.
- Exported the jobs from Development to Test environment and then to Production environment using DataStage Designer/Manager.
- Involved in writing the possible Unit test cases and tested number of jobs and also resolved the defects in developed jobs.
- Participated in Team meetings to discuss about the issues and development status.
- Documentation is done as to facilitate the personnel to understand the load process and in corporate the changes as and when necessary.
Environment: IBM InfoSphere DataStage 8.5.x EE (Parallel Extender, Designer, Director, Administrator), Oracle, Toad, SQL, PL/SQL, UNIX Shell Scripts, IBM TWS, Linux, Windows 7 Enterprise.