Senior Etl Developer / Technical Lead Resume
Lebanon, NJ
SUMMARY
- 16+ years of total IT experience in analysis, design, development, testing and production support of business applications for Operational/Transactional systems and Data Warehousing systems.
- 3+ years of experience using Hadoop, Hive and IBM BigIntegrate
- 16+ years of experience using Oracle 11g/10g/9i/8i, SQL, PL/SQL, SQL*Plus, SQL*Loader, Sun Solaris, HP - UX,, Windows XP/2000/98 and Windows NT.
- 16+ years of experience in ETL Design, Development, Testing and Production rollover with DataStage 11.5/9.1/8.5/8.1 /7.5.2 (Designer/Director/Administrator) involving Star Schema, and Snowflake schema with Fact, Dimension and Aggregate tables.
- Designed Mapping documents, ETL architecture documents and specifications.
- Analyzed the Source Data and designed the source system documentation.
- Participated in discussions with Project Manager, Business Analysts and Team Members on any technical and/or Business Requirement issues.
- Performed and documented unit test plan and test case scenarios of developed code.
- Documented the developed code for promotion to production environment.
- Used the Datastage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into data warehouse database.
- Experience in Converting the Business Logic into Technical Specifications.
- Extensive experience in loading high volume data and performance tuning.
- Hands on experience as ETL Architecture design, ETL process, mapping document.
- Strong understanding of data warehouse concepts and best practices including: Dimensional modeling Star schemas and Snowflake schema.
- Clear understanding of relational databases in a Business Intelligence context.
- Strong analytical and organizational skills with ability to work on multiple project/assignments.
- Ability to troubleshoot, debug, problem solve at the highest level, under pressure and across multiple platforms
- Expertise in UNIX Shell Programming
- Experience in working with AGILE and SCRUM methodologies.
- Wide range of experience in executing Onshore-Offshore model.
TECHNICAL SKILLS
Big Data: Hadoop, Hive, Pig, MapReduce, Ambari, Kerberos Authentication
Databases: Oracle 11g/10g/9i/8i/7.3, MS SQL Server, DB2, Access
Tools: IBM BigIntegrate, IBM InfoSphere DataStage and QualityStage 11.5/9.1/8.5/8.1 , Ascential DataStage 7.5.2 Ascential QualityStage 7.5, SQL Loader, Toad, PL/SQL Developer, SQL Developer, Microsoft Office Business Objects 6.5
Languages: SQL, PL/SQL, Korn Shell Scripts, HTML
Operating Systems: Windows 2000/NT/XP, UNIX Sun Solaris, DOS
Industries: Insurance, Pharmaceutical, Financial Services, Automotives
PROFESSIONAL EXPERIENCE
Confidential, Lebanon, NJ
Senior ETL Developer / Technical Lead
Responsibilities:
- Data Ingestion from 17 different sources like Policy, Marketer, Client, Beneficiary etc into the data lake using Sqoop import.
- Data tokenization using Protegrity function to protect PI data (sensitive data).
- Use data wrangling tools like Trifacta, waterline to perform data analysis to identify valid values.
- Create HLD/LLD/Source to Target Mapping Documents for moving the data from source to processed to published zones.
- Identify and define data elements required from various domains for Data Governance and Lineage.
- Set up the Big Integrate environment for the Datastage jobs to run on Hadoop Cluster.
- Involved in code reviews and suggest if any improvements required and ensure that standards are maintained.
- Supervised or performed lead role in database or ETL related projects.
- Create Hive tables using Beeline commands on Hadoop Edge Node and Ambari.
- Import Hive table definitions using Infosphere Metadata Asset Manager.
- Create SCD Type-2 parallel jobs to load data in hive tables in the source zone to processed and published zones.
- Extensively used Join, Merge, Lookup, Surrogate key, Change Capture, Funnel, Sort, Transformer, Copy, Remove Duplicate, Filter, Pivot and Aggregator stages for various transformations needed on data to conform to business rules.
- Develop sequence jobs with proper job dependencies, job control stages, triggers and notifications.
- Extensively used Hive connectors, had file connectors in WebHDFS mode to read/write to Hive tables. Knowledge of configuring hive connectors to run in Kerberos enabled Hadoop Cluster.
- Create HDFS scripts for copying files, data file manipulation, schedule jobs, data validation etc.
- Built standard Error handling framework to capture operational statistics, exceptions in data and rejects to report or reprocess.
- Document ETL test plans, test cases, test scripts, and validations based on design specifications for unit testing, system testing, functional testing.
- Participated in daily standup calls and scrum meetings and was responsible to create User Stories and tasks in Rally.
Environment: IBM BigIntegrate, Oracle 11g, Toad, Linux, Hadoop, Hive, Pig, MapReduce, Ambari, Kerberos Authentication
Confidential, Wilmington, DE
Senior ETL Developer / Technical Lead
Responsibilities:
- Worked with business analysts to understand the requirement on various projects.
- Involved in various partner conversions, launches and campaign programs.
- Analyzed existing programs or system designs and locate any errors or changes required.
- Involved in code reviews and suggest if any improvements required and ensure that standards are maintained.
- Supervised or performed lead role in database or ETL related projects.
- Executed projects successfully by working closely with business and collaborate the people with different backgrounds and technical peer groups.
- Sell the concept by influencing peer technology owners and business by strong technical acumen and business know-how.
- Used Client components IBM InfoSphere DataStage Designer/Director/Administrator.
- Developed Parallel jobs using stages OCI, Funnel, Join, Copy, Modify, Lookup, Sort, Aggregator, Transformer,Open Command, DataSet andFileSet etc.
- Prepared design, technical, development test plans and solution review documents as per the organization standards.
- Worked and coordinate with different teams to implement the project in different environments.
- Worked with different teams to analyze and set up a data feeds between Barclays to partner and vice-versa.
- Wrote PL/SQL packages, stored procedure, and functions using user defined TYPES/OBJECTS.
- Wrote stored procedure to update millions of rows using BULK COLLECT and FOR ALL features.
- Used RANGE partitions to Partition the tables as per business requirement.
- Used various analytical functions like RANK, ROW NUMBER and COUNT using PARTITION/ORDER BY features in SQL queries.
- Performance tuning of queries using HINTS and EXPLAIN PLAN.
- Data migration to various data warehouse environments using PL/SQL stored procedures, functions and UNIX.
- Implementing business logics using Datastage ETL tool and load data into various warehouse environments.
- Designed and Developed Datastage jobs to implement the business requirement using Web Services and also used SOAP UI to test the Web Service calls.
- Worked with different development methodologies like Waterfall and Agile (Kanban and Scrum)
Environment: IBM InfoSphere DataStage 9.1, Oracle 11g, WinCVS, Toad, AIX
Confidential, East Hanover, NJ
Senior ETL/DataStage Developer
Responsibilities:
- Worked with Data Modeler, Architects and Business Analysts to Understand the Requirements.
- Developed ETL process for many tables includes Fact, Dimensional, Lookup and Summary using Dimensional Modeling.
- Used Client components IBM InfoSphere DataStage Designer/Director/Administrator.
- Developed Parallel jobs using stages OCI, Funnel, Join, Copy, Modify, Lookup, Sort, Aggregator, Transformer,Open Command, DataSet and FileSet etc.
- Optimized long running queries using Explain Plan, Oracle Hints to improve performance.
- Tuned Long Running DataStage Jobs for better performance.
- Developed multiple DS routines for automation of the DataStage Jobs.
- Mapped the source and target databases by studying the specifications and analyzing the required transformations.
- Planned and implemented ETL and Database Object Migrations in a version controlled environment.
- Involved in Unit Testing, Integration Testing to compare and validate data between source and target.
- Extensively worked on migrating DataStage jobs from development to test and to production environments.
- Gathered requirements and analyzed data from various sources/groups for ETL Design.
- Created high / low level design documents, technical/system architecture documents.
- Involved with Design/Development of ETL components and mappings from source to stage and stage to data mart.
- Reviewed and updated technical specification documents for ETL process.
- Worked closely with Business Users and Data Quality Analysts after loading data for accuracy and consistency of data.
Environment: IBM InfoSphere DataStage 8.5, Oracle 11g, Cognos10.1, PL/SQL Developer 7.11, SQL Developer Version Control, AIX
Confidential, PA
Senior ETL/DataStage Developer
Responsibilities:
- Gathered requirements and Analyzed data from various sources/groups for ETL Design
- Created high level design documents, technical/system architecture
- Involved with Design/Development of ETL components and mappings from source to stage and stage to data mart.
- Modified and redesigned the existing ETL processes to increase performance to support multiple batch schedules and meet the SLA requirements.
- Used Client components DataStage Designer/Director/Manager/Administrator
- Developed Server jobs using stages ODBC, Link Partitioner, Aggregator, Transformer, Link Collector, and Hash File etc..
- Mapped the source and target databases by studying the specifications and analyzing the required transforms.
- Involved in Unit Testing, Integration Testing to compare and validate data between source and target.
- Extensively worked on migrating DataStage jobs from development to test and to production environments.
- Reviewed and updated technical specification documents for ETL process.
- Created of high level and low level design documents.
- Worked closely with Business Users and Data Quality Analysts after loading data for accuracy and consistency of data.
Environment: IBM InfoSphere DataStage 8.1, Oracle 10g, MS SQLServer, Version Control, HP UNIX
Confidential, Fort Washington, PA
Senior ETL/DataStage Developer
Responsibilities:
- Served as Technical Lead for the ECR ETL processes and managed offshore/onsite resources
- Gathered requirements and Analyzed data from various sources/groups for ETL Design
- Created high level design documents, technical/system architecture & dimensional data models
- Designed the Oracle Table structures for data warehouse application that included Staging, Partitioned, Dimension and Fact Tables.
- Involved with Design/Development of ETL components and mappings from source to stage and stage to data mart.
- Created several Oracle PL/SQL stored procedures, views, materialized views and triggers.
- Used Oracle database partitioning, partition exchange to load data quickly with little impact on the users
- Used Explain Plan and Oracle Hints to tune queries to improve system performance
- Responsible for UNIT, System and Integration testing. Developed Test scripts, Test plan and Test Data.
- Loaded data from flat files into the Mortgage database using SQL*Loader.
- Modified and redesigned the existing ETL processes to increase performance to support multiple batch schedules and meet the SLA requirements.
- Used Client components DataStage Designer/Director/Manager/Administrator
- Used the Data Stage Designer to design and develop jobs for extracting, cleansing, transforming, integrating, and loading data into different Data Marts.
- Planned and implemented ETL and Database Object Migrations in a version controlled environment
- Performed Unit Testing, Created Unit/System Test Plans and conducted system/integration testing.
- Wrote UNIX shell scripts and PERL scripts to run the jobs for automated batch processing.
- Documented implementation plans and system support guides
Confidential
Responsibilities:
- Worked with business analysts and end users to identify requirements and enhancements to existing functionality.
- Involved with current state system/data analysis from different sources
- Created System Design documents and conducted design reviews.
- Involved in logical/physical database design for slowly changing data dimension, FACT/Dimensional tables.
- Designed/Developed various ETL mappings to extract data from Confidential database
- Created ORACLE PL/SQL stored procedures and triggers.
- Created Business Objects Universe/metadata from EBdata warehouse databases.
- Created various data marts from data warehouse and generated reports using Business Objects report designer.
- Created different types of reports like Master/Detail, Cross tab and chart.
- Wrote UNIX scripts to perform various operations and Scheduled UNIX shell scripts using Cron jobs
- Involved with unit/system testing, user acceptance testing and documentation of test plans.
- Responsible for performance tuning, implementation, production support and creation of support documents.
Environment: Ascential Datastage 7.5.2 (Server and EE edition), Ascential QualityStage 7.5 with CASS Version Control, Oracle 10g/9i, HP UX, Perl Scripts, Business Object 6.5, Windows NT