We provide IT Staff Augmentation Services!

Lead Big-data Engineer / Database Architect  Resume

4.00/5 (Submit Your Rating)

San Ramon, CA

SUMMARY:

  • 15+ years of professional experience as Database architect, Administrator/Developer related to Financial, Mortgage, Health care and Manufacturing databases on SQL Server 2008 R 2 / 2005 / 2000 / 7.0, Greenplum 3.3.4/4.1,Oracle 10g, PostgreSQL 9.0/9.1
  • Expertise and Interest include Administration, Database Design, Performance Analysis, and Production Support for Large (VLDB) and Complex Databases.
  • Having good expertise in data modeling a metadata services.
  • Specific hands on experience include administering large and complex Databases, Database Design, Normalization, Replication, Log Shipping, Full - text indexing, and Database Application Development using varied development tools.
  • Extensively worked on ETL design and development.
  • Expert in dimension modeling OLAP structures.
  • Performance Tuning & Application Design.
  • Hot and Cold Backups, Disaster Recovery Planning, Security Management
  • Expert in writing PSQL/T-SQL/PL-SQL Functions
  • Expert in handling Memory configurations for database servers.
  • Implemented Polymorphic data storage, Multi-storage/SSD support in Greenplum databases
  • Worked on implementing Multi level Data partitioning with dynamic partition elimination in Greenplum.
  • Worked on Out-of-the-Box Support for Big Data Analytics.
  • Worked on High Performance gNet for Hadoop.
  • Data modeling Experience like Physical data modeling and logical by using ERWIN.
  • Worked on Replication and Log shipping for SQL Server High Availability
  • Extensively worked on Active-Active and Active-Passive Clustering
  • Having hands on experience in Database Mirroring.
  • Expert in parallel data loading services gpdist, gpload.
  • Having Experience working on Data center servers and handling multi terabyte databases.
  • Experience in Batch processes, Import, Export, Backup, Database Monitoring tools and Application support.
  • Proficiency and expertise in Backup/Recovery, Disaster recovery planning and Security Management.
  • Troubleshoot performance problems, fine-tuning of databases and index Analysis.
  • Monitoring tools and Application support.
  • Experience in Oracle Database 9i/10g Administration.
  • Created PL/SQL packages, procedures and functions.
  • Having good experience in configuring and troubleshooting IIS and XML.
  • 24 X 7 Production Database on Call Support

TECHNICAL SKILLS:

Project Domain: Application Development, Systems Integration and Database Administration of MSSQL Server 2005/ 2000/7.0/6.5, PostgreSQL 9.0/9.1

Languages: Java, Visual Basic 6/5, ASP.NET, C, C++, C#, XML, HTML

Database: SQL Server 2005, 2000/7.0/6.5, Greenplum 3.3.4/4.2/4.3.3.1, PostgreSQL 9.0/9.1, DB2 8x UDB, Access and Sybase

Tools: Active X, IIS, SSIS, DTS, MTS, litespeed, Spotlight.

ETL Tools: Informatica 7.1.2/6.1.X.

Business Intelligence: Crystal Reports XI/10/9, Business Objects XI/6.5.1.

Operating System: Windows 2003/2000 Advanced Server, UNIX, Linux, NT & Windows 9x.

Other: Microsoft Office (word, excel, power point, outlook), MS Visio, Visual Studio 2005, 2003.

PROFESSIONAL EXPERIENCE:

Confidential, San Ramon, CA

Lead Big-Data Engineer / Database architect

Responsibilities:

  • Extensively worked on HAWQ-Data Lake(Greenplum 4.2.0 build 1)
  • Created external tables in Greenplum using HDFS, PXF.
  • Fine-tuned data models to improve Green plum report functions/queries.
  • Worked on changing existing distribution keys to avoid data skew and improve performance.
  • Extensively worked on ELT performance tuning using multiple Gpfdist instances using GPLOAD utility.
  • Worked on Greenplum features, benefits, and understanding architecture in terms of shared nothing and the MPP design and how Greenplum supports redundancy and high availability
  • Install, initialize, validate, and configure Greenplum Databases.
  • Implemented Greenplum’s workload management for dynamic query prioritization.
  • Designed and implemented table partitioning for handling very large tables.
  • Load data into a Greenplum database instance using external tables, copy command, insert commands, and parallel load utilities.
  • Extensively worked on gpfdist and gpload utilities for parallel data load and unload.
  • Worked on system administrative tasks, managing and checking the state of the Greenplum databases and its distribution of data.
  • Used EXPLAIN and EXPLAIN ANALYZE to identify and fine tune sql queries for performance improvements.
  • Worked on Improving query performance by updating statistics and tuning the database for sampling size and error conditions.
  • Developed and automated regular database maintenance Jobs.
  • Planned and implemented disaster recovery of Greenplum databases using EMC data domain.
  • Installed, implemented sqoop to pull data from various RDBMS to Hadoop
  • Expert in writing complex PSQL queries.
  • Solid experience in fine tuning PSQL/Functions for better performance.
  • Worked on setting up Cron jobs for various data loads into Greenplum
  • Performed Linux Administration.
  • Worked on Python scripts.
  • Worked on writing MR jobs to parse data in Hadoop.
  • Good understanding and experience working on Hive, Pig and HBASE.

Confidential, Foster City, CA

Sr Greenplum /PostgreSQL Consultant

Responsibilities:
  • Production Database Administration and Programming.
  • Successfully completed 40 hr. certified Greenplum and Hadoop training at pivotal (EMC) location sponsored by client.
  • Extensively worked on ELT performance tuning using multiple Gpfdist instances using GPLOAD utility.
  • Worked on Greenplum features, benefits, and understanding architecture in terms of shared nothing and the MPP design and how Greenplum supports redundancy and high availability
  • Install, initialize, validate, and configure Greenplum Databases.
  • Worked on upgrading DCA with enterprise level standards and best practices.
  • Worked on End to End migration from DCA to software ONLY Greenplum platform.
  • Implemented Greenplum’s workload management for dynamic query prioritization.
  • Designed and implemented table partitioning for handling very large tables.
  • Load data into a Greenplum database instance using external tables, copy command, insert commands, and parallel load utilities.
  • Extensively worked on gpfdist and gpload utilities for parallel data load and unload.
  • Worked on system administrative tasks, managing and checking the state of the Greenplum databases and its distribution of data.
  • Used EXPLAIN and EXPLAIN ANALYZE to identify and fine tune sql queries for performance improvements.
  • Perform backup and restoration of Greenplum databases.
  • Expert in writing complex PSQL queries.
  • Setting up cron jobs and trouble shoot.
  • Performed Linux Administration.
  • Developed Python scripts to parse data.
  • Won Big-Data Impact award from cloud era.

Confidential, Dallas TX

Sr Greenplum / PostgreSQL Consultant

Responsibilities:
  • Database Administration and Programming.
  • Troubleshooting performance issues using postgres and UNIX level commands to gather data.
  • Shell scripting for database maintenance functions
  • Closely worked with SME to understand requirements.
  • Reverse engineered data models to improve Green plum function queries.
  • Worked on changing distribution keys to avoid data skew and improve performance.
  • Worked on end to end application level performance tuning re designing data models, choosing proper distribution keys re write Greenplum function code.
  • Extensively worked on ELT performance tuning using multiple Gpfdist instances and available hardware.
  • Worked on Greenplum features, benefits, and understanding architecture in terms of shared nothing and the MPP design and how Greenplum supports redundancy and high availability
  • Install, initialize, validate, and configure Greenplum Databases.
  • Manage database objects and workload management processes by defining and managing roles, privileges, and resource queues.
  • Implemented Greenplum’s advanced workload management for dynamic query prioritization.
  • Designed and implemented table partitioning for handling very large tables.
  • Load data into a Greenplum database instance using external tables, SQL copy and insert commands, and parallel load utilities.
  • Extensively used gpfdist and gpload utilities for data load and unload.
  • Extensively used data manipulation language and data query language to access, manage, and query data.
  • Perform system administrative tasks, managing and checking the state of the Greenplum databases and its distribution of data.
  • Used EXPLAIN and EXPLAIN ANALYZE to identify and fine tune sql queries for performance improvements.
  • Worked on Improving query performance by keeping statistics up to date and tuning the database for sampling size and error conditions.
  • Planned and implemented disaster recovery of Greenplum databases using EMC data domain
  • Perform backup and restoration of Greenplum databases using gpcrondump, gp backup, gp restore utilities on EMC Data Domain.
  • Improve query performance by following performance enhancement tips and database best practices.
  • Expert in writing complex SQL queries.
  • Expert in shell scripting to automate, monitor and perform administrative activities.

Confidential, IL

Greenplum / Postgres DBA / SQL Developer

Responsibilities:
  • Database Administrator, Database Programming.
  • Played a key role in data modeling Data warehouse for reporting needs.
  • Install, initialize, validate, and configure Greenplum Databases
  • Manage database objects and workload management processes by defining and managing roles, privileges, and resource queues
  • Designed and implemented table partitioning for handling large tables
  • Load data into a Greenplum database instance using external tables, SQL copy and insert commands, and parallel load utilities.
  • Extensively used data manipulation language and data query language to access, manage, and query data.
  • Perform system administrative tasks, including managing and checking the state of the Greenplum database and its data and checking the distribution of data.
  • Used EXPLAIN and EXPLAIN ANALYZE to identify and fine tune sql queries for performance improvements.
  • Worked on Improving query performance by keeping statistics up to date and tuning the database for sampling size and error conditions.
  • Perform backup and restoration of Greenplum databases
  • Distribute and store data in Greenplum using a distribution key and partitioning
  • Worked on optimizing indexes for better performance and maintainability
  • Improve query performance by following performance enhancement tips and database best practices.

Confidential, Dallas, TX

LEAD DBA/Developer

Responsibilities:
  • Worked closely with end-users to Define Requirements, Resolve Issues, Troubleshoot problems and Provide Solutions for various applications and day to day business needs.
  • Actively involved in Data Modeling a new star-schema based database from legacy flat tables to improve performance, redundancy and scalability.
  • Designed and Developed mortgage models, pricing models, tax reporting models.
  • Wrote complex stored procedures, triggers, functions and T-SQL.
  • Tuned stored procedures for performance improvement.
  • Daily administration and tuning of SQL Server 2005 and 2008 R2 servers
  • Designed, developed and implemented Databases from scratch.
  • Involved in logical and physical data modeling.
  • Suggested and configured High availability solutions like Database Mirroring, Clustering and log shipping.
  • Designed, Created and Implemented complex SSIS packages for data warehouse
  • Good knowledge in Mortgage data analysis and elements.
  • Planned and Configured Team Foundation Server (TFS) for maintaining source code and version control.
  • Created “Release Process” Documentation to establish a staged migration method from the Development to the Production environment.
  • Release deployment Lead responsible for mentoring other developers and ensuring best practice for coding and deployment were followed.
  • Responsible management of offshore DBA and developers.
  • Designed, Developed and Tested automation of Tax Reporting.
  • Worked on Oracle OBIEE for designing, building complex reports.
  • Responsible for performing a Proof of Concept using Microsoft Windows AZURE to demonstrate cloud technology to the client.
  • Have good experience with SSIS, SSRS
  • Extensive use of Red-Gate SQL Server and Developer Tools

Confidential, Los Angeles, CA

SQL Server DBA/Developer

Responsibilities:
  • Closely worked with business users in gathering requirements to provide key data for critical business decisions.
  • Developed logical design and physical design
  • Administered and maintained Production/Development servers with more than 2 TB of database size with complex structures.
  • Designed a report on the monitoring of SQL Server Performance. This report gives a detail report to the management on the Performance of the server.
  • Worked with SQL server Profiler to generate traces to find long running stored procedures and queries to tune for peak performance
  • Integrated SQL server profiler traces with Performance monitor to have better graphical analysis to find system bottlenecks.
  • Expertise in Fine tuning complex queries
  • Expertise in setting up and troubleshooting Database mirroring.
  • Optimized working process.
  • Rebuilding the indexes at regular intervals for better performance
  • Extensively working on IMS performance issues like update statistics, rebuilding the indexes at regular intervals for better performance.
  • Created Database Maintenance Planner for the Performance of SQL Server which covers Database Integrity Checks.
  • Configured up to 32 GB of memory.
  • Was responsible to develop customize reports requested by clients.
  • Deploy/Design cubes in SSAS environment Snowflake/ and Star Schema designs.
  • Generated reports by using Repots using SSRS.
  • Redesign/ Deploy reports using Report Services 2008.
  • Used Reporting Services (SSRS) to schedule reports to be generated on predetermined time.
  • Created report models for Report Builder and managed reports using Report Manager.
  • Created ad-hoc queries from SQL Server databases for custom reporting needs.
  • Created sub reports, cross tabs, charts, ad-hoc reports using SSRS.
  • Tuned servers to the peak for maximum performance by configuring memory and various server level parameters.
  • Involved in developing C#, C++ code as a part of various transition projects.
  • Hands of experience in writing batch files.
  • Extensively worked on extraction, transformation and loading of data from various data sources to SQL Server using SSIS.

We'd love your feedback!