We provide IT Staff Augmentation Services!

Data Engineer Resume

0/5 (Submit Your Rating)

New, YorK

SUMMARY

  • 8+ years of experience in building data - oriented systems and supporting projects in ETL development, Data Integration, Data Analysis, Data Modeling, Data Warehousing, Data Management, Business Intelligence and Machine Learning.

TECHNICAL SKILLS

Programming: Python, SQL, PL/SQL, C/C++, Cplex, XML, Json, UNIX Shell Scripting, HTML, CSS, Pig, Hive, Java, Julia

Databases: Oracle 11g/10i/9i/8i, MongoDB 2.6.1, MySQL, PostgreSQL, MS Access 2000, Netezza

ETL Tools: Ab initio 3.1/3.2, Clover ETL 3.2/3.4/3.5, Informatica 7/9, Alteryx

Data Modeling: Erwin 7.x, ER studio, ArchiMate

BI and data mining: Datameer, SAS, Mathematica, Excel, Tableau, Cognos, MINITAB, SPSS

Other tools: Airflow, Collibra, MS Azure, AWS, Toad, SQL Developer, Remedy, Bugzilla, Rational Team Concert 4.0, Putty, SVN, Docker

PROFESSIONAL EXPERIENCE

Data Engineer

Confidential, New York

Responsibilities:

  • Designed and developed datalake and data pipelines to ingest data from various data sources and supporting analytics platform and data warehouse
  • Designed and developed a data encryption/decryption framework to secure sensitive data throughout the analytics platform
  • Implemented a metadata and data governance platform to provide data security and visibility throughout the analytics platform

Tech: Python, Meltano, Singer, Airflow, Snowflake, AWS, Great Expectations, Collibra, Docker, dbt

Data Engineer/Architect

Confidential, New York

Responsibilities:

  • Architected and designed data and analytics platforms and processes as part of enterprise digital transformation.
  • Migrated data from legacy systems to newly architected platforms; Netezza data warehouse, EDH and Oracle to Azure Synapse and Kudu Cloudera
  • Designed and developed ML proof of concept including Confidential Could Pak platform and customer segmentation use case
  • Designed and developed Client360 proof of concept including architecture, back-end and front-end

Tech: Django, Python, PySpark, Shell, Hive, HDFS, Oracle, MS Azure, Alteryx, Netezza, Mysql, SQL/PLSQL, Confidential Cloud Pak for Data, Cloudera Data Platform, Hue

Data Engineer

Confidential, New York

Responsibilities:

  • Market Analytics and Classification Machine Learning support processes
  • BI Dashboards and reporting and ETL jobs and processes
  • Data Lake in Hadoop environment and Hive injection processes
  • Datameer big data analytics processes and promotion scripts
  • Python scripts for data transformation and as utility functions
  • Shell script utility function

Tech: Python, Shell, Scala, Oracle, Hive, HDFS, Ab Initio, Datameer, SQL/PLSQL, Sybase, R, SVN

Data Engineer

Confidential - Washington DC

Responsibilities:

  • Data Engineer for a multi domain custom master data management solution.
  • Built data pipelines that ingested and migrated data from various sources in various formats and was consumed by 6000 different federal and commercial organizations
  • Developed Java code for data transformation and shell scripts for file management and operation
  • Implemented data governance by creating 4000 data quality rules
  • Performed extensive performance tuning and improvement of ETL jobs and processes

Tech: Shell, Java, Python, Oracle, SQL/PLSQL, Clover ETL

Graduate Research Assistant

Confidential, Mississippi State, MS

Responsibilities:

  • Areas of research and study: Statistics and Time Series, Metaheuristics and Simulation, Network Optimization and Graph Theory, Mathematical Programming, Stochastic Programming
  • Areas of application: Financial Engineering and Portfolio Optimization, Healthcare, Supply Chain and Contract Management.

Tech: C++, SAS, Cplex

We'd love your feedback!