We provide IT Staff Augmentation Services!

Datastage/talend Developer Resume

3.00/5 (Submit Your Rating)

Atlanta, GA

SUMMARY

  • Over 7 years of ETL experience in development with strong skills in data integration, data warehousing, and relational database systems.
  • Highly motivated, productive, and customer focused team player with excellent communication, analytical, and problem - solving skills.
  • Ability to grasp and adapt new skills quickly and ready for new challenges.
  • I have demonstrated excellent leadership skills in handling offshore and delivered all my deliverables on time.
  • Experience with multiple ETL tools - DataStage8.x/11.x, TalendEnterprise platform 7.x
  • Experience in converting from DataStage to Talend migration
  • Experience in creating mapping with stages like Aggregator, sort, funnel, lookup, join, transformer, xmlinput, Hierarchical, CDC etc.
  • Expertise in creating mappings in Talend using tMap, tjoin, tReplicate, tParallelize, tConvertType, tFlowToIterate, tAggregateRow, tSortRow, tFlowMeter, tLogCatcher, tRowGenerator, tNormalize, tDenormalize, tSetGlobalVar, tHashInput, tHashoutput, tJava, tLogCatcher, tFilterRow etc.
  • Experience with application development in implementing Data Warehouses and Data Marts
  • Designed and developed jobs using Parallel Extender for splitting bulk data into subsets and to dynamically distribute to all available nodes to achieve best Job performance.
  • Strong experience in Data Analysis, Data Profiling, Data Conversion, Data Quality, Data Governance and Metadata Management Services and Configuration Management.
  • Experience in Integration of various data sources like DB2, SQL Server, Oracle and Teradata into staging area.
  • Extensive experience in developing strategies for Extraction, Transformation, Loading (ETL) data from various sources into Data Warehouse and Data Marts.
  • Created derivations and business rules to be used by ETL for mapping source data for the population of Data Warehouse and Data Marts.
  • Expertise in creating Star/Snowflake Schema Designing and Dimensional Modeling.
  • Experience in UNIX Shell Scripting and Automation of ETL process
  • Extensive experience in writing SQL scripts, functions, stored procedures and packages using PL/SQL.
  • Excellent team member with problem-solving and trouble-shooting capabilities. Quick learner, highly motivated, result oriented and strong ability to meet deadlines.
  • Experience in Control-M, Autosys and Robert scheduling tools.
  • Experience in using GIT, SVN versioning tools.
  • Experience in working and creating incidents by using JIRA, CISM

TECHNICAL SKILLS

Data Warehouse (ETL) Tools: IBM Info sphere DataStage 8.5,11.3, 11.5 (Parallel Extender), COBOL, TalendEnterprise platform7.x

Languages: C, C++, SQL

RDBMS: DB2, Oracle 10g/11g, Teradata, MS SQL Server, DB2, Mongo dB

Scheduling Tools: Robot Scheduler, AutoSys, Control-M

Operating Systems: Windows, UNIX, Linux

Other tools: JIRA, CISM, GIT, SVN

PROFESSIONAL EXPERIENCE

Confidential, Atlanta, GA

DataStage/Talend Developer

Responsibilities:

  • Build DataStage jobs to read files from multiple sources using Complex Flat File and Sequential File stages and different operational DB's.
  • Developed and maintained accurate project documentation and worked with DA’s to get data model from Erwin tool.
  • Analyze users’ needs and then design, develop test and develop software to meet those needs
  • Prepared technical data flow proposals for enhancements and integration of existing code in production.
  • Worked with ITSME’s to understand requirement and prepared the Designed mapping document
  • Worked on conversion project from DataStage to talend and mapped for various Sources (.txt, .csv, xml, Json) and load the data from these sources into relational tables with Talend Enterprise Edition.
  • Used Talend’s most used components (tMap, tDie, tConvertType, tFlowMeter, tLogCatcher, tRowGenerator, tjobrun tSetGlobalVar, tHashInput & tHashOutput and many more).
  • Worked SCDs to populate Type I and Type II slowly changing dimension tables from several operational source files
  • Implemented Talend to extract data from XML, JSON & flat files and load data into SQL Server/Oracle Database for downstream process.
  • Worked on stages like Aggregator, sort, funnel, lookup, join, transformer, xmlinput, Hierarchical Stage
  • Automated SFTP process by exchanging SSH keys between UNIX servers. Worked Extensively on Talend Admin Console (TAC)and Schedule Jobs in Job Conductor.
  • Expertise in working on Star/Snowflake Schema data models and Created ETL DataStage parallel jobs to extract and reformat and loaded in DWH.
  • Worked in an environment with many other application dependencies like sending FTP flag files to trigger the jobs like Cognos cubes, mainframe dependent jobs.
  • Converted Cobol jobs to DataStage jobs.
  • Written UNIX shell scripts to retrieve data from various databases and file move scripts.
  • Used different types of DB connector stages like DB2, ODBC.
  • Used different File stages like XML, Sequential files, Data sets and unstructured stages.
  • Developed complex SQL’s in Data studio to retrieve the data from DWH.
  • Design and development of common jobs to update Batch Status, Control, File Info, App Parameter tables.
  • Developed Scheduling jobs in CTRL-M scheduler, which invoked from Data stage jobs.

Environment: IBM DataStage 11.5, TalendEnterprise platform7.x Windows, Linux, Oracle11g/10g, DB2, Teradata, Cobol,Mainframe,SQL, CTRL-M, Data studio, Jira, CISM, GIT

Confidential, Charlotte, NC

DataStage Consultant

Responsibilities:

  • Gathered and analyzed business requirements from users to prepare BRD.
  • Worked in capacity as a functional person, manage client requirements, issues & communications throughout project life cycle and rollout.
  • Worked alongside Data Architects, Data Modelers and other functional teams to understand and analyze the requirements in order to develop the workflow process of parallel jobs using DataStage.
  • Performed profiling on CDE’s to provide range and valid values from various source when starting new initiatives to provide data domain knowledge to business community.
  • Performed profiling on Primary and Natural keys to determine whether the candidate keys are suitable for forming a primary or natural key at the inception of projects.
  • Extensively worked on Teradata utility like Fast Load to load the data from file to a table while doing a history load. and used Multiload utility to load the incremental data into the tables.
  • Performed GAP analysis on various systems and produced IA reports to data analysts and business analysts to help them in requirements gathering.
  • Worked with business teams and technical teams to capture all metadata related information is captured and maintained in business glossary.
  • Worked on XML Hierarchical Stage to extract data from Jason file and load to DWH
  • Experienced with DataStage Parallel Extender for partitioning the data into subsets and load balancing across all available processors to achieve job performance
  • Worked on analyzing and maintaining metadata across applications and models throughout the organization which enables to provide data lineage across multiple platforms.
  • Prepared Best Practices document for developers to use when developing parallel jobs using DataStage Enterprise edition.
  • Creation of Unix shell scripts and developed and executed the Unit Test Cases
  • Prepared Development Design Standards document to establish consistent naming standards to improve development productivity and quality.
  • Involved in DataStage upgrade project to migrate parallel jobs from DataStage 8.5 to DataStage 11.3
  • Prepared mapping documents, conceptual design, and Technical design documents for loading extracts to ODS (Operational Data Store).
  • Interacted with Unix Admin to setup Project Structure, estimate space required for the project.
  • Involved in creating Infrastructure for new Projects to handle Inbound Audit, Outbound Audit process, Error Management and Batch Status details for DataStage jobs.
  • Involved in designing and development of common jobs to update common Batch Status, Control, File Info, App Parameter tables.
  • Developed and assisted other developers to create complex DataStage jobs.
  • Involved in moving DataStage code from development to Test and Production environment through CVS.

Environment: DataStage 11.3 (Designer, Manager, Director), Business Glossary, Visio, Windows NT, UNIX, Oracle10g/9i, DB2, SQL Server2000, Oracle Apps, Toad, PL/SQL, Autosys Scheduler

Confidential, Eagan, MN

DataStage Developer

Responsibilities:

  • Build DataStage jobs to read files from multiple vendors using Complex Flat File and Sequential File stages and different operational DB's
  • Perform reconciliation of the data by matching Row counts and Hash Totals
  • Cleanse and transform the data as per business logic
  • Design Unit Test Cases
  • Migrated data from DB2 to Teradata
  • Write Unix shell scripts to retrieve data from various databases
  • Work with various stages of parallel extender like sequential file and merge stage and Aggregator, Row Generator, Datasets, Lookup, Transformer and many more to design the job and load data into fact and dimension tables.
  • Load data into DB2 tables using DB2 connector stage
  • Create parallel and server Routines functions
  • Analyzing and fixing the code as part of SIT and UAT
  • Worked on creating mapping with stages like Aggregator, sort, funnel, lookup, join, transformer, xmlinput, Hierarchical, CDC etc.
  • Creating the data Stage parallel job to extract data from various sources and operation mainframe database, UDB database like ODBC, SQLserver and load it into target data warehouse like DB2, Oracle
  • Design and development of common jobs to update Batch Status, Control, File Info, App Parameter tables
  • Worked on Agile method Development
  • Involved in moving DataStage code from development to Test and Production environment through TSV Subversion tool
  • Developed Scheduling jobs in Robert schedular which invoked from Data stage jobs and ini files

Environment: DataStage 11.3 (Designer, Manager, Director) Windows UNIX, Oracle11g/10g, DB2, Teradata, SQL, Robot Scheduler, C++ Subroutines, HP Quality center

We'd love your feedback!