We provide IT Staff Augmentation Services!

Data Engineer Resume

4.00/5 (Submit Your Rating)

Bothell, WA

OBJECTIVE

  • Seeking for an Opportunity where I can utilize experience and skills involved with system that effectively contribute to the growth of an organization.

SUMMARY

  • More than 3 years of IT experience with special emphasis on Analysis, Design, Development and Testing of ETLmethodologies in all the phases of the Data Warehousing.
  • 2 Years of strong experience in writing SQL queries and optimizing the queries in Oracle, MS SQL Server and Teradata.
  • Hands on experience in implementing Slowly Changing dimension types (I, II &III) Methodologies, Incremental Loads and Change Data Capture (CDC).
  • In - depth understanding of Star Schema, Snow Flake Schema, Normalization, 1st NF, 2nd NF, 3rd NF, Fact tables, Dimension tables.
  • Experience in optimizing and performance tuning of Mappings and implementing the complex business rules by creating re-usable Transformations, Mapplets and Tasks.
  • Hands-on Expertise in Data Warehouse programming concepts such as SQL Server Stored Procedures, PL/SQL, Tableau, Teradata, JavaScript and HTML.
  • Extensively used SQL and PL/SQL for development of Procedures, Functions, Packages and Triggers.
  • Good knowledge of Normalization, Fact Tables and Dimension Tables, also dealing with OLAP and OLTP systems.
  • Experience in implementing Data Warehouse, Datamart, ODS, OLTP and OLAP, teamed with project scope. Familiar with Top-Down & Bottom-Up Data Warehouse approaches.
  • Experience working with data modelers to translate business rules/requirements into conceptual/logical Dimensional model and worked with Complex Denormalized and Normalized data models.
  • Extensive experience with ETL tool Informatica in Designing the Workflows, Worklets, Tasks and Mappings, scheduling and monitoring the Workflows and sessions using Informatica Power Center 9.1/8.x/7.x.
  • Experienced on Tableau Desktop, Tableau Server and good understanding of tableau architecture.
  • Expertise in Java/J2EE technologies such as Core Java, JDBC, HTML
  • Experience in designing both time driven and data driven automated workflows using Oozie.
  • Experience in ETL operation on Sqoop, Hive, Pig, HBase, Teradata, Tableau.

TECHNICAL SKILLS

Data Warehousing: Informatica Power Center, Power Connect, Power Exchange, Informatica PowerMart, Informatica Web services, Informatica MDM 10.1/9.X, Oracle Data.

Business Tools: MS Access, Tableau.

Big Data: Hadoop, Map Reduce 1.0/2.0, Pig, Hive, Hbase, Sqoop, Flume,oozie,spark.

Databases and Related Tools: MySQL, Oracle 10g/9i/8i/8/7.x, Teradata, PL/SQL, Hive, HDFS, TOAD 8.5.1/7.5/6.2.

Languages: Java / J2EE, SQL, JDBC.

Operating System: Mac OS, Unix, Linux (Various Versions), Windows 2003/7/8/8.1/XP

Web Development: HTML

Application Server: Apache Tomcat, WebLogic, WebSphere Tools Eclipse, NetBeans

PROFESSIONAL EXPERIENCE

Confidential, Bothell, WA

DATA ENGINEER

Responsibilities:

  • Developed the services to run the Map-Reduce jobs as per the requirement basis.
  • Importing and exporting data into HDFS and HIVE, PIG using Sqoop.
  • Responsible to manage data coming from different sources.
  • Built data pipeline using Pig and Java/Scala Map Reduce to store onto HDFS.
  • Designing ETL Data Pipeline flow to ingest the data from RDBMS source to Hadoop using shell script, sqoop.
  • Responsible for loading data from UNIX file systems to HDFS. Installed and configured Hive and written Pig/Hive UDFs.
  • Involved in creating Hive Tables, loading with data and writing Hive queries which will invoke and run MapReduce jobs in the backend.
  • Worked with NoSQL databases like HBase in creating HBase tables to load large sets of semi structured data coming from various sources.
  • Implemented the workflows using Apache oozie framework to automate tasks.
  • Developing design documents considering all possible approaches and identifying best of them.
  • Loading Data into HBase using Bulk Load and Non-bulk load.
  • Involved in converting Hive/SQL queries into Spark transformations using Spark RDD, Scala and Python.
  • Involved in gathering the requirements, designing, development and testing.
  • Ability to work with onsite and offshore team members.
  • Able to work on own initiative, highly proactive, self-motivated commitment towards work and resourceful.
  • Strong debugging and critical thinking ability with good understanding of frameworks advancement in methodologies and strategies.

Environment: Cloudera, MySQL, Apache HBase, HDFS, MapReduce, Hive, PIG, Sqoop, SQL, Windows, Linux.

Confidential, Chicago, IL

ETL Developer

Responsibilities:

  • Extensive experience with Data Extraction, Transformation, and Loading (ETL) from heterogeneous Data sources of Multiple Relational Databases like Oracle, Teradata, DB2, SQL Server, MS Access and Worked on integrating data from flat files like fixed width and delimited, CSV, XML into a common reporting and analytical Data Model using Informatica.
  • Experience with Teradata utilities like Fast load, Fast Export, Multi Load, TPUMP & TPT. Have experience in creating BTEQ scripts.
  • Used Unix Command and Unix Shell Scripting to interact with the server and to move flat files and to load the files in the server.
  • Worked extensively in tuning the current ETL processes for improving the performance by implementing database partitioning and increasing block size, data cache size and SQL overrides.
  • Worked on tuning ETL dimension and fact table loads by using optimization techniques and tuning mappings and database objects and improved performance, availability and throughput.
  • Experience in optimizing and performance tuning of Mappings and implementing the complex business rules by creating re-usable transformations, Mapplets and Tasks.
  • Involved in designing, developing and documenting of the ETL(Extract, Transformation and Load) strategy to populate the Data Warehouse from various source systems feeds using Informatica, PL/SQL scripts.
  • Loaded the data into the Teradata database using Load utilities like (Fast Export, Fast Load, and Multiload). Experience on Oracle utilities like SQL Loader, TOAD and Worked extensively on PL/SQL as part of the process to develop several scripts to handle different scenarios.
  • Extensively used Transformations like Router, Aggregator, Normalizer, Joiner, Expression and Lookup, Update strategy and Sequence generator and Stored Procedure.
  • Created and used Filters, Quick Filters, table Calculations and parameters on Tableau reports.Published TableauDashboards into TableauServer.

Environment: Informatica Power Center 9.1, InformaticaData Studio, Windows, DB visualizer, Putty, WinSCP, Teradata 13.x, Teradata SQL Assistant, Tableau, UNIX Shell Scripting, PL-SQL, Oracle 10g/ 9i/11g, MS-SQL Server, Windows XP and MS Office Suite, MS Office and Delimited Flat files.

Confidential

Oracle PL/SQL Developer

Responsibilities:

  • Worked for commerce clients in designing and developing databases for POS systems. The system was developed to maintain the entire information about the material availability and finished goods, delivery details, purchases, vendor’s information and catalog information.
  • Creating PL/SQL Procedures, Packages, Functions for billing module.
  • Developing Forms and Reports.
  • Developed custom reports for various modules as per the client requirement.
  • Query Optimization, Tuning SQL queries.
  • Unit Testing on Forms and Reports and PL/SQL Stored Procedures, Functions, Triggers, Packages.
  • Developed Oracle Stored procedures and database objects for the whole billing database.
  • Created Stored Procedures, Functions and Triggers using SQL*Plus to be invoked by shell scripts and forms.
  • Tuned the billing module to improve the performance, because of growing number of transactions.
  • Developed the Oracle Reports to show the users various employee reports, Salary slabs, promotion, tax, loans and advances.

Environment: Oracle 9i RDBMS Enterprise Edition, Microsoft windows server 2003, PL/SQL, Oracle 9i Application Server Enterprise Edition, SQL loader.

We'd love your feedback!