We provide IT Staff Augmentation Services!

Sr Datastage Consultant Resume

4.00/5 (Submit Your Rating)

MN

SUMMARY:

  • 11 years of IT experience providing technical leadership on Data Warehousing, Data Integration and Migration projects involving Profiling, Design, Data Modeling, Development, Integration, Implementation, Maintenance, Testing, and Production Support of Applications.
  • 2+ years of experience in developing and implementing Big - Data technologies in core and enterprise software development initiatives and applications that perform large scale Distributed Data Processing for Big data analytics using Java and Big Data ecosystem tools; Hadoop, Hive, Pig, Sqoop, HBase. Hands on experience in using various Hadoop distributions (Cloudera, MapR).
  • Experience on Talend BigData Enterprise, Talend Data Integration, and Talend Data Quality platforms to perform different types of transformation, File and Database, Exception handling using Talend and worked on Talend Administrator Console (TAC) for deployment, scheduling jobs and adding users.
  • Strong skills in IBM-DataStage 9.1/8.5/7.5, Informatica, Talend 5.3/6.2, SQL Programming, IBM DB2, Teradata, Oracle PL/SQL, Debugging, Performance tuning and Shell Scripting.
  • Knowledge and experience in design, development and deployments of Big Data projects using Hadoop / Data Analytics / NoSQL / Distributed Machine Learning frameworks.
  • Experience using Sqoop to import data into HDFS from RDBMS and vice-versa and dealing with log files to extract data and to copy into HDFS using Flume.
  • Expertise on Logical and Physical modelling of Landing/Staging/Foundation and Mart Layers.
  • Used Model Mart of Erwin for effective model management of sharing, dividing and reusing model information and design for productivity improvement.
  • Created the High-level design documents and Source to Target Mapping for ETL/ELT process.
  • Reviewed SQL to ensure efficient plans and performance CPU thresholds. Worked closely with Architects to adjust SQL as necessary.
  • Extensive experience with Data Extraction, Transformation, and Loading (ETL) from multiple data sources Oracle, DB2, SQL Server, XML, Flat files into Analytical and Enterprise Data Model.
  • Coded complex SQL to load data into Foundation/Aggregate tables.
  • Closely worked with business in Requirements Analysis, Pre-scoping, Data profiling, Proto typing activities to understand the requirements and system design.
  • Developed Parallel jobs using various stages like Join, Merge, Lookup, Surrogate key, SCD, Funnel, Sort, Transformer, Copy, Remove Duplicate, Filter, Pivot,, Java Integration, CDC, XML, MQ, Teradata, SAP ABAP Extract, BW Extract, BW Load, Web Services/API Integration and Aggregator stages for grouping and summarizing on key performance indicators.
  • Lead a team of 5 to 10 people and guided developer on technical design, Preparation of test data, Resolution of defects across all phases of Testing.
  • Expert in fulfillment of data warehouse project tasks such as Planning, Requirement gathering, identifying sources, and execution of projects including planning of implementation activities.
  • Hands on experience in using Talend Data Integration and Talend for Bigdata and
  • Have good knowledge on Energy, Health care, Insurance and Automobiles domains.
  • Worked in both Waterfall and Agile/SRCUM methodologies.

TECHNICAL SKILLS:

ETL Tools: IBM Data Stage, Talend and Informatica, Cisco data Virtualization

Database: DB2, Oracle, SQL server, Teradata, Netezza, MySQL, MarkLogic

Big Data: Hadoop Ecosystem, HDFS, Map Reduce, Pig, HIVE, Sqoop, PIGHBase

No SQL Database: HBase

IDE/Build Tools: Eclipse, Maven

Data Modeling: Erwin, (ER Modeling, Star Flake, Snowflake)

Testing Tools: Quality Center

Reporting Tools: Business Objects

Languages: SQL, PL/SQL, Shell Scripting, Java, Python, Scala

CDC Tool: IBM Infosphere CDC

Version Control: GIT, MSTFS, PVCS, SVN

Scheduling Tools: Control-M, Tivoli, Autosys

Platform: Linux/Unix, Windows

PROFESSIONAL EXPERIENCE

Confidential, MN

Sr DataStage Consultant

Responsibilities:

  • Integrate and Load Pharmacy and drug information from different source system. The down stream systems can use for reporting purpose. Currently working on restricting Federal employee and VIP data where the down systems are not supposed receive.
  • Gathered the business requirements from the Business Partners and SMEs.
  • Involved in data analysis, preparation of design documents and mapping documents.
  • Involved in code review meetings
  • Created RBAC views from Teradata to restrict access based on role.
  • 27 down systems need to re source from non FEP/VIP data.
  • Created an automated process for removing temporary parameter files using a shell script and automated deployment process.

Confidential, MN

Sr DataStage Consultant

Responsibilities:

  • Deliver data ingestion to United Data Lake and data integration to NHI which is single source for all analytics and reporting dashboards. Worked on multiple sources to bring the data to Data Lake and build the snapshots of data on daily basis and load the data in NHI.
  • Gathered the business requirements from the Business Partners and SMEs..
  • Involved in data analysis, preparation of design documents and mapping documents.
  • Worked extensively with Sqoop for importing data from and SQL Server, Teradata.
  • Designed patterns for history/incremental loads using CDC and Sqoop process.
  • Designed ETL jobs for extracting from HIVE snapshot tables, transforming, loading data into different stage/Base tables using Talend jobs
  • Developed the jobs for Customer, Member, Medical/Dental/Vision claims domains and implemented reconciliation, reprocessing, scheduling of jobs.
  • Extensively worked on Reusable components, Routines, contexts, Global Variables.
  • Responsible for assisting in the development, execution and documentation of system and integration test plans, UAT and Regression phases; reviewed test cases and test strategy
  • Perform quality monitoring and trending on data standardization processes.

Environment: Hadoop 2.0,Talend 5.3, Infosphere CDC, HDFS, Sqoop, Hive, Pig, HBase, HiveQL, Teradata, LINUX and Oozie/Tivoli workload scheduler.

Confidential, MN

Senior Data Consultant

Responsibilities:

  • Deliver data ingestion to United Data Lake and data integration to Unified Data Warehouse which is single source for all analytics and reporting dashboards. Worked on multiple sources to bring the data to Data Lake and build the snapshots of data on daily basis and load the data in UDW.
  • Gathered the business requirements from the Business Partners and SMEs.
  • Involved in data analysis, preparation of design documents and mapping documents.
  • Developed data ingestion from various data sources CDB, ORx, Cirus loaded data into UDW
  • Worked extensively with Sqoop for importing data from and SQL Server, Teradata.
  • Designed patterns for history/incremental loads using CDC process.
  • Designed ETL jobs for extracting from source transforming, loading data into different stage/Base tables using Datstage jobs
  • Developed the jobs for Customer, Member, Medical/Dental/Vision claims domains and implemented reconciliation, reprocessing, scheduling of jobs.
  • Extensively worked on Reusable components, Routines, contexts, Global Variables.
  • Responsible for assisting in the development, execution and documentation of system and integration test plans, UAT and Regression phases; reviewed test cases and test strategy
  • Perform quality monitoring and trending on data standardization processes.

Environment: IBM Datastage, Teradata, Unix/Perl, Tivoli workload schedule

Confidential, TX

Tech Lead

Responsibilities:

  • This Project mainly concentrates on Data Integration, Customer Enterprise View for Account Management Centralized Email Opt-Out Repository, Web Customer Query & Drill Down Capability, Adhoc Analysis.
  • Designed patterns for history/incremental loads using CDC and Sqoop process.
  • Developed data ingestion from various data sources marketing database (mdb) loaded data into HDFS/Data Lake
  • Involved in deploying multiple modules using Talend TAC, Maven and Jenkins
  • Created OLAP data models in ERwin.
  • Was able to maintain the production system 100% uptime.
  • Sole resource in migrating ETL jobs from Datastage 7.5 to 9.1.
  • Assist in defining requirements and designing applications to meet business process and application requirements.
  • Involved in Extracting CISPlus data and transformed legacy data using Datastage. In SAP acceptable format.
  • Provided production support after project went live.
  • Managed 3 members work, which helped project in reducing the team size.

Environment: Datastage 9.1, SAP.

We'd love your feedback!