We provide IT Staff Augmentation Services!

Data Architect/big Data Consultant Resume

2.00/5 (Submit Your Rating)

SUMMARY

  • Data professional with over twenty years of experience in relational and MPP columnar databases, data architecture and cloud technologies.

AREAS OF EXPERTISE

  • MPP database SME
  • Data Architecture and Database Design
  • Vertica, Redshift, Snowflake, Oracle, Netezza
  • Cloud Computing, Big Data, HDFS
  • Data Pipelines, ETL, ELT
  • Data migration and data loading
  • Agile methodologies
  • Large - scale, enterprise level environments
  • Backup and Disaster Recovery
  • User training and mentoring

PROFESSIONAL EXPERIENCE

Confidential

Data Architect/Big Data Consultant

Responsibilities:

  • Investigated, compared and tested several MPP databases in order to identify the one best suited for the needs of the company. Ran several POC to help finalize the selection.
  • Worked with engineers, data scientists and analysts to identify required data structures, pipelines and workloads in order to source, load and transform data while preserving data integrity and security.
  • Created several multi-node Vertica clusters using AWS EC2 instances with attached EBS storage volumes. Eventually migrated the clusters to EON mode on AWS using S3 as storage area. Enabled AWS Athena to directly query data on S3.
  • Built and maintained physical structures, including external tables based on Parquet files. Created and maintained users, roles, access policies and resource allocation. Enabled LAP synchronization to control database access.
  • Migrated data from the legacy database to the new cluster using automation and a repeatable process. Developed scripts to load data from S3 and HDFS data lakes.
  • Created data distribution policies. Created and maintained data dictionary and naming conventions.
  • Tuned database and tables as well as user queries. Offered guidance and advice on best practices to developers and analysts.
  • Developed and implemented a backup and disaster recovery strategy. Established cluster monitoring and issue notification process.
  • Vertica, Snowflake, Redshift, Impala, Hive, HDFS, AWS S3, Python, Shell scripts

Confidential, Austin, TX

Data Architect, Vertica and Big Data Consultant

Responsibilities:

  • Executed health check on several Vertica clusters to identify issues.
  • Tuned resource pools and redesigned poorly performing projections.
  • Identified and tuned inefficient queries.
  • Worked with users and engineers to optimize ETL pipelines.
  • Established Vertica to Hadoop connectivity and Kerberized the databases.
  • Setup LDAPLink for user maintenance.
  • Contacted training sessions on Vertica basics and best practices.
  • Automated cluster maintenance using scripts and cron jobs.
  • Monitored clusters and recovered down nodes. Performed root cause analysis on failures.
  • Trained developers and users. Published Vertica best practices.

Confidential

Data Architect/Big Data Consultant

Responsibilities:

  • Hired as a Vertica SME to help with underperforming clusters.
  • Executed Vertica health checks, identified and fixed issues.
  • Modified resource pools to optimize resource utilization.
  • Tuned under-performing user and Tableau queries.
  • Re-designed table projections to use proper segmentation and partitioning.
  • Upgraded the cluster to the latest version and added new nodes to double the capacity.
  • LDAPLink, so Vertica users can be managed using LDAP.
  • Kerberized Vertica and setup loading from HDFS.
  • Established and maintained data dictionary and naming conventions.
  • Worked with users to optimize ETL processes.
  • Implemented data loading using partition/table swapping.
  • Automated maintenance tasks using shell scripts and cron jobs.
  • Offered training sessions to all users and DBAs on MPP database basics and best practices, including hands on labs.

Confidential, Atlanta, GA

Data Architect/Vertica SME

Responsibilities:

  • Vertica performance tuning
  • Projection Cleanup/Redesign
  • Proper segmentation and partitioning
  • Vertica cluster monitoring and maintenance
  • Vertica cluster health check
  • Add/replace/remove nodes
  • Vertica upgrades
  • Review existing code and provide recommendations
  • Vertica Best Practices
  • Training and mentoring developers and DBAs

We'd love your feedback!