Project Lead/data Analyst Resume
Hoboken, NJ
SUMMARY:
- Over 10.5 years of experience in Data warehousing, Business Intelligence, Data Migration and Data Integration project areas with exposure to ETL and BI tool like IBM InfoSphere DataStage (server and parallel), Informatica PowerCenter, SSIS, BOreporting in variousbusiness domains like Insurance, Manufacturing, Auto industry and Pharmaceutical.Have worked with database like Oracle, SQL server, IBM Redbrick, Teradata.
- Involved in complete Software Development life - cycle (SDLC) of various projects, including Requirements gathering, System Designing, Data modeling, and ETL design, development, Production Enhancements and Maintenance.
- Experience in Performance Tuning and Optimization of Parallel Jobs and Server Jobs.
- Experience in Data Modeling, dimensional Star/Snowflake Schema modeling, Fact& Dimensions tables, as well as reverse engineering, using tools Erwin.
- Strong experience in designing of Datastage Parallel jobs, Sequencers and Batch Jobs working with Data Stage Manager, Designer, Administrator, and Director.Extensively used Parallel Extender to load data into data warehousein SMP/MPP environment.
- Experience in UNIX Shell scripting as part of various project needs (to name few like file manipulation, database connectivity, report generation, FTP etc.), and have strong knowledge in scheduling Data Stage jobs using datastage scheduler as well as familiarity with Crontab. Have high level knowledge on autosysJIL code.
- Hands on experience in writing, testing and implementation of the Triggers, Procedures, functions, Indexes, Partitions at Database level. Experienced working withoracle PL/SQL. Worked with SQL*Loader utility loading high volume historical data.
- Have experience in data profiling and data quality related analysis and implementations in projects working as data analyst.
- Extensive experience inheterogeneous source and target systems stages including file, database, XML, unstructured etc. Implemented various complex mapping with help of various processing stages like transformer, sort, Join, look-up, Merge, Remove Duplicates, Filter, Dataset, aggregator, CDC, SCD, SFDC, SAP BW etc. and many more.
- Experience of uploading/extracting data to/from AWS Amazon Cloud S3 bucketsusingDatastage.
- Experience in analyzing the data generated by the business process, defining the granularity, source to target mapping of the data elements, creating Indexes and Aggregate tables for the data warehouse design and development.
- Have experience in analyze and estimate feasibility, costs, time, and resources needed to develop and implement application systems for projects with enterprise-wide impact.
- Involved in client interaction to understand the requirement and providing Data Warehouse and Datamart solutions for various requirements and reporting users.
- Managed complexities in client expectations, timelines, standards, staffing, onshore / offshore co-ordination.
- Worked in development of Informatica Mappings, Informatica Workflows, have experience with Informatica Advanced Techniques (Dynamic Caching, Memory Management, Parallel Processing to increase Performance throughput). In depth experience in reading and writing in heterogeneous sources and targets like SalesForceCloud application, various file, databases, web services etc.
- Involved in Optimization and Tuning of mappings and sessions in Informatica by identifying and eliminating bottlenecks, memory management and parallel threading.
- Have real time conceptual and theoretical knowledge on MDM application. Attended MDM conceptual training for better implementation of projects. Involved in various discussions for brainstorming in MDM projects.
TECHNICAL SKILLS:
OS: UNIX,Unix Shell scripts, Win-XP
Tools: IBM InfoSphere DataStage 7.5/8.5/9.1/11.3 | Quality Stage | Informatica PowerCenter 9.x| BO XI 3.1| Metadata Manager
Database: Oracle 11g/10g/9i/8i; Oracle SQL, PL/SQL, SQL*Loader; RedBrick;SQL server DB; Teradata
Other Tools: ServiceNow | REMEDY 7.0 | TFS | HPQC | SSIS
Big Data Ecosystem: MapReduce, HIVE, HDFS, Oozie (to automate data loading into HDFS) and PIG (to pre-process the data) etc.
PROFESSIONAL EXPERIENCE:
Project Lead/Data Analyst
Confidential, Hoboken, NJResponsibilities:
- Involved in complete Software Development life-cycle (SDLC) of the project including Requirements gathering, ETL design, Development, Testing, UAT etc.
- Extensively using Data Stage designer for designing Parallel jobs and performed complex mappings, Data Transformations as per requirement using various datastageprocessing, debug, file stages. Broken down query based jobs into modular ETL datastage jobs to increase modularity and ease of maintainance and performance.
- Applied performance tuning techniquesin Datastage and in Oracle DB level to handle complex ETL process.
- Workedon UNIX Shell scripting as part of various project needs (to name few like file manipulation, database connectivity, report generation etc.).
- Worked in Datastage, Informaticacodes during the code change and development phase. Worked with Teradata TPT (Teradata parallel transporter) connection in ETL to load data in Teradata.
- Involved in BO report testing and UAT phase. Involved in UAT with client and successfully completed project participating in application go-live and cut-over phases.
- Analyzes and estimates feasibility, costs, time, and resources needed to develop, and implement application systems for projects with enterprise-wide impact.
- Involved in discussion with various business user, stake holdersto understand the requirement and providing Data Warehouse and Datamart solution for SQL Users and reporting users.
- Established ETL Best Practices and architecture guidelines for new developers and existing ETL team.
- Managed complexities in client expectations, timelines, standards, staffing, onshore / offshore co-ordination.
Environment: UNIX, Win-XP, Info Sphere DataStage 9.1, Informatica PowerCenter 9.5, MetaData Manager, IDQ, BO XI 3.1, Teradata, Oracle 10g (SQL, PL/SQL),Unix Shell scripts, Force.com(SOQL tool), Toad, SQL developer, ClearQuest
Integration Analyst/ ETL Tech Lead
Confidential, Irvine, CAResponsibilities:
- Involved in complete Software Development life-cycle (SDLC) of the project including Requirements gathering, System Designing, Data modeling, and ETL design, Development, Testing, UAT, Production Enhancements and Maintenance etc.
- Extensively using Data Stage designer for designing Parallel jobs and performed complex mappings, Data Transformations and Aggregate Data based on user specifications. Designed jobs using different processing stages like Transformer, Aggregator, lookup, funnel, Merge, Remove Duplicates, Filter, filter,CDC, SCD, SFDC, XML, SAP BW. Extensively used the Debugging StagesRow generator, column generator, peek stages and more for unit testing.
- Worked on datastageunstructured stage to read excel file and write into .xlsx file.
- Applied performance tuning techniquesin Datastage and in Oracle DB level to handle complex ETL process.
- Added, deleted and setup DataStage projects and managed users from Data stage administrator.
- Workedon UNIX Shell scripting as part of various project needs (to name few like file manipulation, database connectivity, report generation, FTP etc.).
- Worked on writing, testing and implementation of the Triggers, Packages, Procedures, functions, Indexes, Partitions at Database level.
- Worked with various team to migrate codes from Datastage to Informatica. Started participating in datastage migration to 11.3 version.
- Experience of uploading/extracting data to/from AWS Amazon Cloud S3 buckets using Datastage.
- Involved in understanding the Business Process and Coordinating with Business users and worked as Data Modelers to create new tables and objects on the existing design.
- Analyzes and estimates feasibility, costs, time, and resources needed to develop, and implement application systems for projects with enterprise-wide impact.
- Involved in discussion with various business user, brand manager and stake holdersto understand the requirement and providing Data Warehouse and Datamart solution for SQL Users and reporting users.
- Established ETL Best Practices and architecture guidelines for new developers and existing ETL team.
- Managed complexities in client expectations, timelines, standards, staffing, onshore / offshore co-ordination.
- Partially involved in architectural design through Oracle PL/SQL to communicate with web services through SAP PI enabling online registration process to establish for client in real time.
Environment: Sphere DataStage and Quality Stage 9.1/11.3, BO XI 3.1, Informatica PowerCenter 9.0, Oracle 10g/11g (SQL, PL/SQL) SQL Server, Unix, Shell scripting, Force.com(SOQL tool), Toad, SQL developer, ServiceNow, Remedy
ProjectLead/Sr. Developer
Confidential, Hoboken, NJResponsibilities:
- Involved with Business users and Architects from different teams to implement ETL Frame Work using DataStage Server/PX combination of jobs.
- Extensively using Data Stage designer for designing Parallel jobs and performed complex mappings, Data Transformations and Aggregate Data based on user specifications. Designed jobs using different processing stages like Transformer, Aggregator, lookup, funnel, filter, CDC, Join, Merge, Remove Duplicates, Filter, Dataset, Switch, Modify, and Aggregator, Funnel, FTP, Sort, Hash file, look up file set. Extensively used the Debugging Stages Row generator, column generator, peek stages and more for unit testing.
- Applied performance tuning techniquesin Datastage and in Oracle DB level to handle complex ETL process.
- Added, deleted and setup DataStage projects and managed users from Data stage administrator. Provide autosys JIL code detail to respective team for scheduling various processes.
- Workedon UNIX Shell scripting as part of various project needs (to name few like file manipulation, database connectivity, report generation etc.).
- Worked on writing, testing and implementation of the Triggers, Packages, Procedures, functions, Indexes, Partitions at Database level.
- Worked in SSIS tools with SSIS ETL team and helped in developing necessary components. Developed package, control flow, data flow in SSIS for necessary ETL transformations.
- Involved in understanding the Business Process and Coordinating with Business users, Data Modelers and DBA's to create Dimension and Fact tables based on the existing Data Warehouse design.
- Involved in complete Software Development life-cycle (SDLC) of the project including Requirements gathering, System Designing, Data modeling, and ETL design, Development, Testing, UAT, Production Enhancements and Maintenance etc.
- Established ETL Best Practices and architecture guidelines for new developers and existing ETL team.
- Managed complexities in client expectations, timelines, standards, staffing, onshore / offshore co-ordination.
Environment: Info Sphere DataStage Quality Stage 8.0, Quality Stage,Oracle 9i (SQL, PL/SQL) SQL Server, Unix, Shell scripting,Informatica PowerCenter, IDQ, SSIS, Autosys
Sr.Software Developer/Lead
ConfidentialResponsibilities:
- Involved in this major onetime migration project where existing old warranty legacy (Mainframe) system was getting migrated to a new java based application. The data is transformed and cleaned in the process of generating CSVs. The whole process is done in three phases.
- Prepared High level and Low level Design documents to be reviewed by client.
- Applied performance tuning techniques in Datastage and in Oracle DB level to handle complex ETL process.
- Added, deleted and setup DataStage projects and managed users from Data stage administrator.
- Extensively using Data Stage designer for designing Parallel jobs and performed complex mappings, Data Transformations and Aggregate Data based on user specifications. Designed jobs using different processing stages like Transformer, Aggregator, lookup, funnel, filter, CDC, Join, Merge, Remove Duplicates, Filter, Dataset, Switch, Modify, and Aggregator, Funnel, FTP, Sort, Hash file, look up file set. Extensively used the Debugging Stages Row generator, column generator, peek stages and more for unit testing.
- Worked on UNIX Shell scripting as part of various project needs (to name few like file manipulation, database connectivity, report generation etc.).
- Created unit test cases and documented them for approvals.
- Worked on a Migration of the projects from Development to System Testing Environment.
- Developed Job Sequencers with restart capability for the designed jobs using Job Activity, Exec Command, E-Mail Notification Activities and Triggers.
- Worked on editing the scripts and added new jobs that has to be run.
- Prepared migration process documents to move the jobs from development to system testing and then to production.
- Worked closely with Data Quality Analysts and Business Users& Analysts for data accuracy and consistency after table loads.
- Managed complexities in client expectations, timelines, standards, staffing, onshore / offshore co-ordination.
- Co-ordinate the go-live and historical data migration process to implement a smooth cut over.
Environment: Info Sphere DataStage and Quality Stage 8.0,Oracle 9i (SQL, PL/SQL) SQL Server, Unix, Shell scripting.
Software Engineer
Confidential, Alviso, CAResponsibilities:
- Extensively using Data Stage designer for designing Parallel jobs and performed complex mappings, Data Transformations and Aggregate Data based on user specifications. Designed jobs using different processing stages like Transformer, Aggregator, lookup, funnel, filter, Join, Merge, Remove Duplicates, Filter, Dataset, Switch, Modify, and Aggregator, Funnel, FTP, Sort, MQ. Extensively used the Debugging Stages Row generator, column generator, peek stages and more for unit testing.
- Involved in the extracting data from various sources like mainframe application and java applications. The files are FTPed to DataStage server to get loaded into staging tables and which in turn are loaded into the respective tables based on various ETL rules through DataStage.
- Involved in the design and development of Data Warehouse
- Involved in writing UNIX shell Scripts for file validation and scheduling Data Stage jobs.
- Worked closely with database architect during the design and development of ETL technical specification document. Prepared and updated HLD, LLD, UTC etc. various project related docs.
- Extensively worked with Parallel Extender using Parallel Processing (Pipeline and partition parallelism) techniques to improve job performance while working with bulk data sources.
- Experience inusing PL/SQL Functions, stored procedures to load the data into Data marts.
- Used Datastage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into Data Warehouse database.
- Performed Import and Export of Data Stage components and table definitions.
- Used Data Stage Director to Run and Monitor the Jobs performed, automation of Job Control using Batch logic to execute and schedule various Data Stage jobs
- Performed the Unit testing for jobs developed to ensure that it meets the requirements and documented unit test plan and test case scenarios of developed code.
- Documented the Data Warehouse development process and performed knowledge transfer to Business Intelligence developer
Environment: UNIX, Win-XP, DataStage 7.5.2 Parallel Extend, Oracle SQL, PL/SQL, Unix Shell script, Toad, SQL developer
Project Lead/Data Analyst
Confidential, Dallas, TXResponsibilities:
- Involved in designing ETL jobs to extract data from Redbrick warehouse and load properly in Oracle database after performing necessary transformations. At the same time, load data in oracle side from other external feeds coming from mainframe and windows based machine.
- Involved in loading historical data Loadfor 7 years of data from Data warehouse in Oracle database using Oracle SQL*Loader (sqlldr) scripts. Created scripts to run the load process in background in unix server and then reconcile the load.
- Involved in writing UNIX shell Scripts for file validation and scheduling Data Stage jobs.
- Used Data Stage Designer to develop processes for extracting, cleansing, transforming, integrating and loading data into Data Warehouse database.
- Designed jobs using different datastage stages such as sequential file, Join, Merge, Lookup, Remove Duplicates, Filter, Lookup File Set, Hash file, Aggregator, Sort, Transformer, ODBC stage.
- Developed batch jobs with DS routines to have batch sequencing functionality and notification activities.
- Experience inusing PL/SQL Functions, stored procedures to load the data into Data marts.
- Performed Import and Export of Data Stage components and table definitions.
- Used Data Stage Director to Run and Monitor the Jobs performed, automation of Job Control using Batch logic to execute and schedule Datastage jobs
- Performed the Unit testing for jobs developed to ensure that it meets the requirements and documented unit test plan and test case scenarios of developed code.
- Documented the Data Warehouse development process and performed knowledge transfer to Business Intelligence developer.
Environment: UNIX, Win-XP, DataStage 6.0 server edition, OracleSQL, PL/SQL, Red Brick Warehouse, Unix Shell scripts