We provide IT Staff Augmentation Services!

Etl/big Data Consultant Resume

SUMMARY

  • Around 11 years of experience in the field of system analysis, design, development, testing and implementation of Relational Databases and Data Warehousing systems and big data applications.
  • Extensive working experience in IBM Information Server DataStage and QualityStage 11.7/11.5/11.3/8. x/7x.
  • Experienced in Designing, Developing, Documenting, Testing and implementing the ETL jobs to populate tables in Data Warehouse and Data Marts.
  • Handsome Experience in Retail, Banking and Financial Services domains.
  • Performed Debugging, Troubleshooting, Monitoring and Performance Tuning for Data Stage jobs.
  • Expert in designing Datastage Parallel jobs using various stages like Join, Merge, Lookup, Remove duplicates, Dataset, Complex flat file, Aggregator, XML,ODBC Connector, Teradata Connector.
  • Extensive working experience with Big data and Hadoop Technologies like Hive, Sqoop,Flume,Pig,HDFS and Oozie.
  • In depth knowledge of Dimensional Data Modeling, Star Schema & Snowflake.
  • Extensive knowledge of Teradata Bulk/Immediate load methods and BTEQ scripts.
  • Experience working on Quality Stage for data cleansing and data standardization process.
  • RDBMS experience with Teradata, Oracle, SQL Server, DB2 and Sybase including database development.
  • Handsome working experience in CA WorkStation ESP, Autosys, Zena, Control - M tool for scheduling Datastage jobs.
  • Worked with project teams and information architects to develop business process.
  • Strong expertise in database development using client tool like Teradata SQL Assistant, SQL Server Management Studio, TOAD, Sybase Interactive SQL.
  • Having extensive experience in creating unix scripts and FTP process.
  • Handsome working experience in creating test requirement, test plan and test cases.
  • Extensive experience in planning, designing and conducting Unit and System Integration Tests, correcting errors and re-testing to deliver an error-free product.
  • Proposing options and preparing cost estimates to enable the business area to make informed decisions.
  • Possess good knowledge on Onsite-Offshore Model, proven problem solving skills, good conceptual foundation, and ability to handle pressure.
  • Raising awareness when existing code, systems or processes do not meet current quality expectations and standards.
  • Maintaining code and all related artifacts in source control, perform code merges and resolve conflicts as defined by development process, update documentation and automated tests.
  • Highly motivated team player with ability to lead, manage and work in all environments to meet deadlines.

TECHNICAL SKILLS

ETL Tools: DataStage 11.7/11.5/11.3/9.1/8.5/8.1/7.5 , Informatica Power Centre

Modelling Tools: Erwin, ER Studio, Visio

Languages: SQL,Unix Scripting,PL/SQL,Python

RDBMS: Teradata, Oracle, SQL Server, DB2,Sybase, MS-Access

Version Controllers: TFS, PVCS, Visual Source Safe

Reporting tools: Micro strategy, Business Objects, Power BI

Change Management: BMC Remedy

Operating Systems: Windows, Unix, Linux

Big Data Platform: Hadoop 2.6, HDP Framework 3.0

Big Data Tools: Hadoop, Hive, Sqoop, Flume, Kafka, Pig, HDFS, Oozie,Spark

NoSql DBs: Cassandra, HBase

PROFESSIONAL EXPERIENCE

Confidential

ETL/Big Data Consultant

Responsibilities:

  • Working closely with business analysts in requirements gathering, reviewing business rules and identifying data sources.
  • Involving in designing and development of Tablet Usage Data(TUD), Remedy(RMD), Customer Behavior Datawarehouse(CBD) and ATM Hardware(AHA) projects.
  • Extensively used Data Stage Designer, Quality Stage, and Administrator Director for creating and implementing jobs to load the Data Marts/Datawarehouse.
  • Created logical and physical data models, business rules and data mapping for the Enterprise Data Warehouse system.
  • Involved in the Dimensional modeling of the Data warehouse, successfully implemented the slowly changing dimensions.
  • Developing DataStage jobs extensively using Join, Aggregator, Sort, Merge and Data Set in Parallel Extender to achieve better job performance.
  • Coordinate with Data scientists and Business Analyst and responsible for writing the complex Hive queries and get insights that convert the potential value of big data into real, tangible business value.
  • Involving in migrating the data using Sqoop from HDFS to Relational Database System and vice-versa according to client's requirement.
  • Developed pig scripts for analyzing large data sets in the HDFS.
  • Responsible for creating Hive tables, loading the structured data resulted from MapReduce jobs into the tables and writing hive queries to further analyze the logs to identify issues and behavioral patterns.
  • Used Hive to form an abstraction on top of structured data that resides in HDFS and implemented Partitions, Dynamic Partitions, Buckets on HIVE tables.
  • Responsible for performing extensive data validation using Hive.
  • Sqoop jobs, PIG and Hive scripts were created for data ingestion from relational databases to compare with historical data.
  • Involved in converting Hive/SQL queries into Spark transformation using Spark RDD in Pyspark.
  • Knowledge on handling Hive queries using Spark SQL that integrate with Spark environment.
  • Involved in creating Oozie workflow and Coordinator jobs to kick off the jobs on time for data availability.
  • Coordinating with Functional, System and UAT testing.
  • Monitoring and fixing production job failure.

Environment: DataStage 11.7/11.5/11.3 (IBM Infosphere Information Server Console, Designer, Quality Stage, Director), Oracle 11g, SQL Server 2012, Linux, Windows 7, Big Data, Hive, Spark,Sqoop, Pig, Flume, Ooozie, HDFS,Python.

Confidential

ETL Consultant

Responsibilities:

  • Worked closely with business analysts in requirements gathering, reviewing business rules and identifying data sources.
  • Work with the project and business teams to understand the business processes involved in the applications.
  • Involving in designing and development of Supply Chain,Logistic,Space Management,Predictix,mPerks,Market Basket,ICAP,HR-PeopleSoft,Finance,Pricing & Promo, Digital Account, Digital Transformation and Marketing & Analytical projects.
  • Extensively used Data Stage Designer, Quality Stage, and Administrator Director for creating and implementing jobs to load the Data Marts/Datawarehouse.
  • Used Quality Stage to analyzed Address/Name validation, Phone Number Verification and duplicate records.
  • Worked with Data Modelling team for building the dimensional model for the proposed system.
  • Aided in the design and development of the logical and physical data models, business rules and data mapping for the Enterprise Data Warehouse system.
  • Involved in the Dimensional modeling of the Data warehouse, successfully implemented the slowly changing dimensions.
  • Interacted with Datastage Administrator to fix the environmental issues.
  • Developed DataStage jobs extensively using Join, Aggregator, Sort, Merge and Data Set in Parallel Extender to achieve better job performance.
  • Developed DataStage Quality jobs using Investigate, Standardize, Data Rules, Survive, Match Frequency stages to perform source data analysis.
  • Used DataStage sequencer jobs extensively to take care of inter dependencies and to run datastage parallel jobs in load order.
  • Created ESP jobs to schedule the ETL jobs in CA WorkStation.
  • Created implementation plans and Change Management Requests.
  • Involved to prepare mapping document and designing the data model.
  • Created TFS(Team Foundation Server) requirement, test cases and test plan
  • Executed test cases in Test Case Manager (TCM).
  • Reviewed and approved Detailed Design Document.
  • Coordinated with Functional, System and UAT testing.
  • Monitored and fixed production job failure.
  • Prepared production readiness Review documents.
  • Coordinated with Development team, SME, Customer and Data Modeler to deliver the project on time.

Environment: DataStage11.3 (IBM Infosphere Information Server Console, Designer, Quality Stage, Director), Alteryx,Teradata,DB2,Sybase, Oracle 11g, SQL Server 2012, Unix, CA WorkStation, Windows 7,Microstrategy.

Confidential

Team Lead

Responsibilities:

  • Involved in designing, development, testing and implementing of Supply Chain, Logistic, Space Management, mPerks, Market Basket, ICAP, HR-PeopleSoft, Finance, Digital Transformation and Marketing & Analytical projects.
  • Reviewed mapping documents and TFS requirements.
  • Prepared Detailed Design Documents.
  • Extensively used Data Stage Designer, Quality Stage, and Administrator Director for creating and implementing jobs to load the Data Marts/Datawarehouse.
  • Used IBM Infosphere Information Server Console, IBM WebSphere QualityStage and Datastage to create Industry Standard to the address for Address Data Standardization Project.
  • Involved in the Dimensional modeling of the Data warehouse, successfully implemented the slowly changing dimensions.
  • Developing DataStage jobs extensively using Join, Aggregator, Sort, Merge and Data Set in Parallel Extender to achieve better job performance.
  • Using DataStage sequencer jobs extensively to take care of inter dependencies and to run datastage parallel jobs in load order.
  • Creating ESP jobs to schedule the ETL jobs in CA WorkStation.
  • Involving to prepare mapping document and designing the data model.
  • Executing test cases in Test Case Manager (TCM).
  • Coordinating with unit and System testing.
  • Monitoring and fixing production job failure.
  • Leading and providing technical solutions to team members.

Environment: DataStage 11.3/8.5 (IBM Infosphere Information Server Console, Designer, Quality Stage, Director), Teradata,DB2,Sybase, Oracle 11g, SQL Server 2012/2014, Unix, CA WorkStation, Windows 7,Microstrategy.

Confidential

Sr.DataStage Developer

Responsibilities:

  • Involved in Develop ETL jobs using datastage to extract, transform and load the data into data warehouse.
  • Extensively used Data Stage Designer, and Administrator Director for creating and implementing jobs to load the Data Marts/Data warehouse.
  • Involved to prepare HLD and LLD documents.
  • Involved in Designing Jobs using various stages like Sequential File, Dataset file, Filter, Lookup, Join, Aggregate etc.
  • Used DataStage sequencer jobs extensively to take care of inter dependencies and to run datastage parallel jobs in load order.
  • Prepared Unit Test Cases and System Test Cases.
  • Involved to create unix scripts.
  • Coordinated the team members & assist them to make delivery on time

Environment: DataStage 8 .1, Informatica Power Centre, DB2, Oracle, SQL Server 2010, Unix, Control-M, HP Quality Centre.

Confidential

Sr.DataStage Developer

Responsibilities:

  • Involved in Develop ETL jobs using datastage to extract, transform and load the data into data warehouse.
  • Extensively used Data Stage Designer, and Administrator Director for creating and implementing jobs to load the Data Marts/Data warehouse.
  • Involved to prepare HLD and LLD documents.
  • Involved in Designing Jobs using various stages like Sequential File, Dataset file, Filter, Lookup, Join, Aggregate etc.
  • Used DataStage sequencer jobs extensively to take care of inter dependencies and to run datastage parallel jobs in load order.
  • Prepared and executed Unit Test Cases and System Test Cases.
  • Involved to create unix scripts.
  • Coordinated the team members & assist them to make delivery on time

Environment: Data Stage 8x, DB2, SQL Server, Oracle 9i, Toad, Zena, and Windows.

Hire Now