We provide IT Staff Augmentation Services!

Senior Data Engineer Resume

5.00/5 (Submit Your Rating)

SUMMARY

  • A self - motivated, creative Data Engineer who is passionate about Big Data with strong analytical and debugging skills. Quick learner and a good team player with proven ability to drive initiatives to completion.

TECHNICAL SKILLS

  • Teradata
  • Shell Scripting
  • Big Data
  • Teradata Utilities
  • AWS
  • Splunk
  • PySpark
  • Python
  • Hive

PROFESSIONAL EXPERIENCE

SENIOR DATA ENGINEER

Confidential

Responsibilities:

  • Architected and delivered logistics data summary layer for Confidential that facilitated on time delivery of promotional shipments to 99%.
  • Successfully implemented first ever data summary layer for World Wide Business Intelligence on GBI Enterprise Data warehouse in reseller domain as a part of feasibility study and took the partnership between two Confidential Organization to next level.
  • Architected and delivered Confidential ’s first metadata driven framework for handling Personally Identifiable Information with integrated feed from fraud detection engine.
  • Built multiple self - service ETL pipelines to transfer data between multiple data platforms in multiple data centers to facilitate data analysis and avoided recurring development effort.
  • Optimized Naive Bayes classifier computation algorithm in EDW Teradata platform and saved 2 million CPU cycle per day.
  • Improved data quality and data availability in data reporting layer and staging area by designing and building ETL monitoring solutions in one of the biggest enterprise data warehouse in world with exceptionally large data volume.
  • Deployed metadata driven framework to meet immediate data need of Data Scientists to discover data patterns to combat sudden onset of fraud attacks and system abuses.
  • Successfully Implemented Intelligent clean-up process to purge un-used objects in Teradata EDW by parsing data lineage and leveraging SQL usage logs and latest data ETL status which significantly reduced migration and maintenance effort.
  • Optimized data ingestion of largest table in Confidential Teradata EDW with Tera bytes of data feed flowing per day.
  • Worked with data scientists to improve accuracy of fraud detection data models by effective data profiling.
  • Optimized Ad-hoc computation of feature’s and label’s owned by analytic insight team and migrated into process oriented semantic layers with documentations with high scalability.
  • Working in a fast-paced agile development environment to quickly analyze, develop, and test potential use cases for the business.
  • Partnered with Confidential Data Insight team and built multiple scalable semantic layers for data scientists to build data model for detection of suspicious transactions, training data models and for extracting SQL rules to feed fraud decisioning engine.
  • Provided round the clock support for Analytic Insight team during onset of sudden fraud attacks and system abuses.
  • Experience in data interpretation to draw conclusions for senior management and drive key strategies.
  • Mentored 16-member team supporting Confidential Analytics Insight team.
  • 5+ years of experience in Fraud Analytics domain and knowledgeable on data science algorithms, predictive analytics, statistical computing etc.
  • Extensive debugging experience in Teradata and Hadoop platform.
  • Implemented Time Series analysis to capture the data gap in Fraud detection engine data.
  • Extensively used Oracle as a meta-data repository for building data pipeline.
  • Debugged python code bases of Data Scientists for data accuracy.

SENIOR SOFTWARE ENGINEER

Confidential

Responsibilities:

  • Successfully completed Impact and data gap analysis for consolidating multiple BI reporting solutions for Confidential Consulting Services and implemented consolidated solution with minimum issues.
  • Supported existing data mart solutions. Identified areas of inconsistency to establish new best practices for future development.
  • Worked on strategies to improve productivity and optimized process for support model.
  • Lead 12 member offshore team.

DATA CORPUS ENGINEER

Confidential

Responsibilities:

  • Automated web data extraction process which are spanned across 30 servers for more than 25 languages with failure and progress notifications and cut down the effort required to gather the efficiency and progress of data collation process by 90%.
  • Optimized Mega Corpus cleansing process for generating Speller and Thesaurus components using SQL Server which reduced the effort by 20%.
  • Worked with native speakers across the globe for validation and verification of data corpus for more than 25 languages to facilitate the data acquisition process.
  • Implemented automated text data quality evaluator that shortened data acquisition deals time window to 40% of total timeframe.
  • Proactively worked with third party vendor of PDF to text conversion utility to customize the code and improved lexicon data quality to 90% from 45% which resulted in immediate cost saving and laid foundation for future data acquisition of PDF contents and cut down all future data acquisition cost.
  • Worked with worldwide lexicon providers and developed C# web services to connect and download data from their databases.
  • Written multi-threaded C# modules to generate lexicons from hundreds of Giga Byte’s text contents.
  • Written SQL scripts in Confidential SQL Server to effectively reduce data noise and eliminate abusive words for speller component.

SOFTWARE ENGINEER

Confidential

Responsibilities:

  • Worked in a 35 member team to deliver smart client application for generating customer-pricing sheets in C# and SQL Server
  • Optimized SQL Server procedures to improve data retrieval throughput in UI layer
  • Developed test cases and captured many boundary case issues.

We'd love your feedback!