Datastage/talend Developer Resume
Chicago, IL
PROFESSIONAL SUMMARY
- 7+ years of total IT experience in Analysis, Design, Development and Implementation of Enterprise Data Warehousing and Business Intelligence systems across industries like,
- Healthcare, Pharma - Clinical Trial Management, Health Insurance, Life Sciences,Manufacturing, Pharmaceuticals, Learning and ERP domains.
- Responsible for all the activities related to the development, implementation, administration & support of ETL, BI processes for large-scale Data Warehouse.
- Five plus (5+) years of diverse professional IT experience in analysis, design, development and implementing software solutions in Data Warehousing tools like Datastage 8.5, Informatica PowerCenter 9.1 and Talend 6.4
- Three plus (3+) years of Business Intelligence experience in Business Objects XI R2/6.x, Crystal Reports XI/9.0/8.5.
- Five plus (5+) years of extensive Database experience using Oracle 10g/9i/8i, Sybase ASE 11.x, SQL Server 2000, PL/SQL, SQL*Plus on UNIX, LINUX, Windows Vista/NT/XP/2000/9X.
- Highly proficient in Data Modeling concepts using ERWIN, Visio, Star Schema/Snowflake, Conceptual, Physical & Logical data modeling.
- Sound knowledge of OLTP, OLAP systems and excellent knowledge in RDBMS principles.
- Proven track record in troubleshooting of Datastage jobs, addressing production issues like performance tuning, enhancement, job scheduling using the Datastage Director.
- Ability to interact with Business partners to identify information needs & business requirements.
- Programming experience in deploying VBA Macros, PL/SQL and DTS Packages.
- Involved in various stages of testing (Black Box, White Box, Unit, System, Integration, User Acceptance Test) and Automated Testing using Mercury tools like Winrunner.
- Held significant roles as a Process/Quality Facilitator and participated in Internal Quality Audits and PMRs (Project Management reviews)
- Web programming using HTML/ASP and VB script.
- Expertise in designing & coding Stored Procedures, Functions, Triggers using PL/SQL.
- Coordinated with the Off-shore Development Team in developing project specific tools like Issue Tracker, Trial Tracker and web based knowledge repositories.
- Played Lead role and successfully executed 5 offshore and 4 Onsite projects of Team Size varying from 7 - 10. Skilled Confidential project management skills such as preparation of Project proposals, Estimations, SOW, Project Plan, Design specs, Master Test Strategy and review of project deliverables.
- Participated in Corporate Academy activities like handling DataStage and Business Objects training sessions for Entry level trainees and associates across domains.
- Key strengths are ability to learn on the job, Teamwork, Technical capability, Customer Interaction,Work ethics and Consulting skills.
PROFESSIONAL EXPERIENCE
Confidential, Chicago, IL
DataStage/Talend Developer
Responsibilities:
- Worked with Data mapping team to understand the source to target mapping rules.
- Analyzed the requirements and framed the business logic and implemented it using Talend.
- Involved in ETL design and documentation.
- Analyzed and performed data integration using Talend open integration suite.
- Worked on the design, development and testing of Talend mappings.
- Created ETL job infrastructure using Talend Open Studio.
- Worked on Talend components like tReplace, tmap, tsort and tFilterColumn, tFilterRow,tJava,Tjavarow, tConvertType etc.
- Used Database components like tMSSQLInput,tMsSqlRow, tMsSqlOutput, tOracleOutput,tOracleInput etc.
- Worked with various File components like tFileCopy, tFileCompare, tFileExist, TFileDelete,tFileRename.
- Worked on improving the performance of Talend jobs.
- Created triggers for a Talend job to run automatically on server.
- Worked on Exporting and Imporrting of Talend jobs.
- Created jobs to pass parameters from child job to parent job.
- Exported jobs to Nexus and SVN repository.
- Implemented update strategy on tables and used tJava, tJavarowcomponnets to read data from tables to pull only newly inserted data from source tables.
- Observed statistics of Talend jobs in AMC to improve the performance and in what scenarios errors are causing.
- Created Generic and Repository schemas.
- Developed project specific Deployment job responsible to deploy Talend jar files on to the windows environment as a zip file, later, this zip file is unzipped and the files are again deployed to the unix box.
- Also, this deployment job is responsible to maintain versioning of the Talend jobs that are deployed in the unix environment.
- Developed shell scripts in unix environment to support scheduling of the Talend jobs.
- Monitored the daily runs, weekly runs and adhoc runs to load data into the target systems .
Environment: Talend 5.5.2, UNIX, Shell script, SQL Server, Oracle, Business Objects, ERwin, SVNRedgate, Capterra.
Confidential, Cleaveland, OH
DataStage Developer
Responsibilities:
- Analyzed source data and gathered requirements from the business users.
- Worked with Snow Flake Schema for building the data mart.
- Prepared technical specifications to develop ETL transformations to load data into various tables confirming to the business rules.
- Designed and developed jobs for extracting, transforming, integrating, and loading data into the staging tables and the target tables using DataStage Designer, used Data Stage manager for importing metadata from repository, new job categories and creating new data elements
- Solely responsible for the daily loads and handling the reject data.
- Developed Interfaces using UNIX Shell Script to automate the bulk load & update Processes
- Sequential File, Aggregator, Joiner, Oracle Enterprise, Transformer, File set, Sorter, Tera data API, XML, Remove duplicate, Peek, Surrogate key, dataset and Seibel Plug-ins Stages were extensively used to develop the server jobs.
- Using Key Management functions Surrogate Keys were generated for composite attributes while loading the data into Data Warehouse.
- During Implementation phase, tuned DataStage Jobs for optimum performance.
- Made performance improvements to the database by building Partitioned tables, Index Organized Tables and Bitmap Indexes
- DataStage jobs were scheduled, monitored, performance of individual stages was analyzed and multiple instances of a job were run using DataStage Director.
- Involved in writing the DB2 sql queries make the performance in the query level.
- Worked on programs for scheduling jobs by using Data stage and CRON scheduler from SIEBEL database to Oracle 9i.
- Extensively used Shared Containers and Job Sequencer to make complex jobs simple and to run the jobs in sequence
- Involved in the preparation of ETL documentation by following the business rule, procedures and naming conventions
- Worked with Cognos Reporting team for extensively reporting using Data mart for Slice & Dice, Drill Down and Drill through.
- Involved in writing Test Plans, Test Scenarios, Test Cases and Test Scripts and performed the Unit, Integration, system testing and User Acceptance Testing
- Provided 24/7 production support
Environment: DataStage 8.5 (Data stage Manager, Designer, Director, Integrity and MetaRecon), ERwin 7.0,TeraData V2R5, Oracle 10g, PL/SQL, BO 6.0, MS Access 2000, Shell Programming, Autosys, INFORMIX database,db files, Control M, SQL*Loader, DB2 UDB 7.0, Mainframes, OS/390, C,C++,JDE, Sun Solaris, VB, Toad, SQL Navigator, Excel, Unix scripting, Win NT
Confidential, VA
DataStage Developer/Analyst
Responsibilities:
- Designed, developed and Unit tested using DataStage for Extraction, Transformation and Loading of data from source to target.
- Involved in writing test cases to validate the code changes also documenting the entire process.
- Extensively involved in deployment of jobs and monitoring it after UAT approval.
- Analyzed Business and Accounting requirements from the Accounting and Business Detail level Process design.
- Involved in understanding the Requirements of the end Users/Business Analysts and Developed Strategies for ETL processes.
- Responsible for the detailed design documentation
- Provided technical solutions for the Process requests raised by Data team to fix the issues in the existing system.
- Extensively used Database and Dataset components like Input file, Input table, and Output table and transform components like Join, Scan, Filter by expression, Reformat and other components like Merge, Lookup, Input/Output table and Sort
- Implemented component level, pipeline and data parallelism using DataStage for ETL process.
- Troubleshooted Data Stage jobs, fixing bugs and addressed production issues like performance tuning and enhancements.
- Extensively involved in Import and Export of DataStage jobs
- Used Partition components like partition by expression, partition by key, etc., to run the middle layer processing parallel.
- Extensively worked in the UNIX environment using Shell Scripts.
Environment: DataStage 7.5(Manager, Designer, Director, Administrator), Parallel Extender, Oracle 9i, SQL, PL/SQL, TOAD, UNIX, Shell Scripts, Windows XP
Confidential, Mineapolis, MN
Datastage developer /Analyst
Responsibilities:
- In this enterprise data warehouse I started working with enhancements and bug fixing in the production system. Then, Involved in developing the EDW using datastage jobs.
- Extensively used Datastage Designer to develop processes for extracting, transforming, integrating and loading data from various sources into the Data Warehouse database.
- Created datastage jobs to extract data from mainframe files to staging area and from staging to base database. Extensively datastage stages like sequential file, Copy, Aggregator, Surrogate key, Transformer, Teradata enterprise, Multi load, dataset, Look up, Aggregator, joiner, Remove Duplicates, sorter, Column generators and Funnel.
- Written SQL queries to extract data from different tables and those queries used in Teradata stages. Using teradata multiload stage created log and error tables to rectify the issue during the data load.
- Involved in performance tuning of the jobs while developing the datastage jobs.
- Created the various complex datastage jobs to load the data from staging area to the base data base and from there to data mart.
- Developed staging and Data Mart DS jobs using Data Stage Designer on parallel environment and Involved in writing the possible Unit test cases and tested number of job and also resolved the defects in developed jobs.
- Involved in all phases including Requirement Analysis, Design, Coding, Testing, Support and Documentation.
- Extensively used SQL in Datastage jobs for processing data.
- Worked on loading of data from several flat files sources using Teradata MLOAD.
- Transfer of large volumes of data using Teradata Fast Export, Multi Load, T-Pump.
- Developed batches and sequencers in designer to run and control set of jobs.
- Developed several complex ETL jobs for Historical data loads and ongoing data loads using various active and passive stages of DataStage.
- Used the Datastage Director and its run-time engine to schedule running the job, testing and debugging its components, and monitoring the resulting executable versions
- Used Datastage Manager for importing metadata from repository, new job categories and creating new data elements.
- Involved in importing and exporting jobs category wise and maintaining the backup regularly.
- Performed the Unit testing for jobs developed to ensure that it meets the requirements.
- Used Parallel Extender to run jobs on SMP systems.
Environment: Datastage 7.5 (Administrator, Manager, Designer, Director, Teradata V2R6, Win 2000, SQL, C++,XML,AIX,UNIX scripting, ERwin 6.1.0, DB2 DB,IBM mainframe, Cobol flat files, Oracle 10g,Clear Case and Rational Quest.