We provide IT Staff Augmentation Services!

Data Architect Resume

4.00/5 (Submit Your Rating)

ChicagO

SUMMARY:

  • Accomplished Data warehousing professional with a strong work experience Data science and BI Datawarehousing domain with.
  • Strong hands on experience in design, development, implementation and performance tuning on Large Scale Production systems.I strongly believe, with my ability to quickly learn and contribute; backed by my strong work experience in this area, I would be able to make a difference and take my skills to the next level.
  • Over 10 years of experience of working in Business Intelligence & datawarehousing.
  • Expertise in converting business needs into technical requirements and designing/ Architecting solutions.
  • Proficient in Teradata Query optimization , Performance Tuning.
  • Expert in Teradata tools & Utilities
  • Proven track record in planning, building, managing successful large - scale Data Warehouse and decision support systems.
  • 7+ years of solution design of ETL solutions using various tools like Datastage parallel edition on platforms such as Teradata, Oracle & Netezza.
  • Advanced scripting scripting skills like SQL & shell scripting.
  • Experience in dimensional data modeling, schema design, modeling tasks on small and medium sized projects.
  • Expertise in implementing SCD, surrogate key generation and other complex jobs in Datastage .
  • Vast experience in Data Integration, Data ingestion and datamart creation.
  • Good Hands on with BigQuery , Pub/Sub, Dataflow and other products on Google Cloud Platform .
  • Over 3 years of strong hands on experience in data science and machine learning projects.
  • Strong hands on in statistical programming frameworks such as R , Python, spark and tools such as SAS, Gretl etc.
  • Well versed with machine learning packages like nltk, sklearn, mllib, forecast, randomForest and visualization packages like ggplot2, seaborn etc.
  • Proficient in techniques such as linear and logistic regression , time series forecasting , clustering , classification, random Forrest etc.
  • Certified Teradata professional.
  • 3+ years of work experience in Agile Methodology
  • Good understanding and experience on Hadoop and spark ecosystem & related technologies.
  • Well versed with predictive modeling life-cycle and processes.
  • Good domain knowledge in Retail & Telecom.
  • Several Data Science, Machine learning and visualization courses. Ongoing Coursera Data Science specialization.

TECHNICAL SKILLS:

Analytics: SQL, R, python, SAS, spark, Tableau

ETL Tools: Data Stage

Scripting Languages: SQL, Unix Shell

Version Control: SVN, git

Cloud/Cluster: Google Cloud Platform( GCP ), spark

Database: Teradata, Netezza, BigQuery

Scheduling Tools: Control M, cron

OS: Linux, Windows, HDFS

Visualization: Tableau, R, Python

PROFESSIONAL EXPERIENCE:

Confidential, Chicago

Data Architect

Responsibilities:

  • Developed ETL pipeline for creating independent variables on demand in Teradata. interpretation of bandit algorithm to issue right offers to members. used twitter feeds and text analytics to build a purchase propensity model.
  • Developed a set of algorithms in R to find the best time series forecasting algorithm for a product line . setup Machine learning environment on google cloud. Setup jupyter notebooks, install kernels for R, python, spark. Confidential on MLLIB in cloud.

Confidential, Chicago,

Lead

Responsibilities:

  • Data munging, validation, pre & post processing large datasets, designing complex flows in Teradata to maintain consistency of the data.
  • A part of my role is to verify the technological approaches to a solution and come up with the best fit.
  • I have done Confidential work for tools like TDWM ( Teradata Warehouse Miner) & Hadoop multi-node system in the past.

Confidential, Chicago

ETL Architect, ETL Lead

Responsibilities:

  • I led the QA effort in its early stage and later on designed the production schedule using control M scheduler.
  • As an ETL Lead , implementation by designing ETL code & overseeing development in Datastage , Teradata & Hadoop.

Confidential, Chicago

Technical Lead

Responsibilities:

  • As a Technical Lead , Mainly responsible for high level technical design and functional design for small and medium scale projects in Confidential .
  • Estimating project resources and effort.
  • Level 4 point of contact for production issues.
  • Lead the offshore team consisting of developers & quality analyst.
  • Development of scripts for loading the data into the base tables in EDW and to load the data from source to staging and staging area to target tables using FastLoad , MultiLoad , TPT and BTEQ utilities of Teradata.
  • Writing scripts for data cleansing, data validation , data transformation for the data coming from different source systems.

Confidential

Senior Software Engineer

Responsibilities:

  • Involved in Requirement gathering, business Analysis, Design and Development, testing and implementation of business rules.
  • Analyzing the data model and designing the ETL Jobs.
  • Part of the team that developed internal audit system for EDW.
  • Proposing as well as implementing the ETL solutions.
  • Involved in designing the data flow diagram. Documented the mappings used in ETL processes
  • Logical and Physical Data modeling.

We'd love your feedback!