Lead Big-data Engineer / Database Architect Resume
San Ramon, CA
SUMMARY:
- 15+ years of professional experience as Database architect, Administrator/Developer related to Financial, Mortgage, Health care and Manufacturing databases on SQL Server 2008 R 2 / 2005 / 2000 / 7.0, Greenplum 3.3.4/4.1,Oracle 10g, PostgreSQL 9.0/9.1
- Expertise and Interest include Administration, Database Design, Performance Analysis, and Production Support for Large (VLDB) and Complex Databases.
- Having good expertise in data modeling a metadata services.
- Specific hands on experience include administering large and complex Databases, Database Design, Normalization, Replication, Log Shipping, Full - text indexing, and Database Application Development using varied development tools.
- Extensively worked on ETL design and development.
- Expert in dimension modeling OLAP structures.
- Performance Tuning & Application Design.
- Hot and Cold Backups, Disaster Recovery Planning, Security Management
- Expert in writing PSQL/T-SQL/PL-SQL Functions
- Expert in handling Memory configurations for database servers.
- Implemented Polymorphic data storage, Multi-storage/SSD support in Greenplum databases
- Worked on implementing Multi level Data partitioning with dynamic partition elimination in Greenplum.
- Worked on Out-of-the-Box Support for Big Data Analytics.
- Worked on High Performance gNet for Hadoop.
- Data modeling Experience like Physical data modeling and logical by using ERWIN.
- Worked on Replication and Log shipping for SQL Server High Availability
- Extensively worked on Active-Active and Active-Passive Clustering
- Having hands on experience in Database Mirroring.
- Expert in parallel data loading services gpdist, gpload.
- Having Experience working on Data center servers and handling multi terabyte databases.
- Experience in Batch processes, Import, Export, Backup, Database Monitoring tools and Application support.
- Proficiency and expertise in Backup/Recovery, Disaster recovery planning and Security Management.
- Troubleshoot performance problems, fine-tuning of databases and index Analysis.
- Monitoring tools and Application support.
- Experience in Oracle Database 9i/10g Administration.
- Created PL/SQL packages, procedures and functions.
- Having good experience in configuring and troubleshooting IIS and XML.
- 24 X 7 Production Database on Call Support
TECHNICAL SKILLS:
Project Domain: Application Development, Systems Integration and Database Administration of MSSQL Server 2005/ 2000/7.0/6.5, PostgreSQL 9.0/9.1
Languages: Java, Visual Basic 6/5, ASP.NET, C, C++, C#, XML, HTML
Database: SQL Server 2005, 2000/7.0/6.5, Greenplum 3.3.4/4.2/4.3.3.1, PostgreSQL 9.0/9.1, DB2 8x UDB, Access and Sybase
Tools: Active X, IIS, SSIS, DTS, MTS, litespeed, Spotlight.
ETL Tools: Informatica 7.1.2/6.1.X.
Business Intelligence: Crystal Reports XI/10/9, Business Objects XI/6.5.1.
Operating System: Windows 2003/2000 Advanced Server, UNIX, Linux, NT & Windows 9x.
Other: Microsoft Office (word, excel, power point, outlook), MS Visio, Visual Studio 2005, 2003.
PROFESSIONAL EXPERIENCE:
Confidential, San Ramon, CA
Lead Big-Data Engineer / Database architect
Responsibilities:
- Extensively worked on HAWQ-Data Lake(Greenplum 4.2.0 build 1)
- Created external tables in Greenplum using HDFS, PXF.
- Fine-tuned data models to improve Green plum report functions/queries.
- Worked on changing existing distribution keys to avoid data skew and improve performance.
- Extensively worked on ELT performance tuning using multiple Gpfdist instances using GPLOAD utility.
- Worked on Greenplum features, benefits, and understanding architecture in terms of shared nothing and the MPP design and how Greenplum supports redundancy and high availability
- Install, initialize, validate, and configure Greenplum Databases.
- Implemented Greenplum’s workload management for dynamic query prioritization.
- Designed and implemented table partitioning for handling very large tables.
- Load data into a Greenplum database instance using external tables, copy command, insert commands, and parallel load utilities.
- Extensively worked on gpfdist and gpload utilities for parallel data load and unload.
- Worked on system administrative tasks, managing and checking the state of the Greenplum databases and its distribution of data.
- Used EXPLAIN and EXPLAIN ANALYZE to identify and fine tune sql queries for performance improvements.
- Worked on Improving query performance by updating statistics and tuning the database for sampling size and error conditions.
- Developed and automated regular database maintenance Jobs.
- Planned and implemented disaster recovery of Greenplum databases using EMC data domain.
- Installed, implemented sqoop to pull data from various RDBMS to Hadoop
- Expert in writing complex PSQL queries.
- Solid experience in fine tuning PSQL/Functions for better performance.
- Worked on setting up Cron jobs for various data loads into Greenplum
- Performed Linux Administration.
- Worked on Python scripts.
- Worked on writing MR jobs to parse data in Hadoop.
- Good understanding and experience working on Hive, Pig and HBASE.
Confidential, Foster City, CA
Sr Greenplum /PostgreSQL Consultant
Responsibilities:- Production Database Administration and Programming.
- Successfully completed 40 hr. certified Greenplum and Hadoop training at pivotal (EMC) location sponsored by client.
- Extensively worked on ELT performance tuning using multiple Gpfdist instances using GPLOAD utility.
- Worked on Greenplum features, benefits, and understanding architecture in terms of shared nothing and the MPP design and how Greenplum supports redundancy and high availability
- Install, initialize, validate, and configure Greenplum Databases.
- Worked on upgrading DCA with enterprise level standards and best practices.
- Worked on End to End migration from DCA to software ONLY Greenplum platform.
- Implemented Greenplum’s workload management for dynamic query prioritization.
- Designed and implemented table partitioning for handling very large tables.
- Load data into a Greenplum database instance using external tables, copy command, insert commands, and parallel load utilities.
- Extensively worked on gpfdist and gpload utilities for parallel data load and unload.
- Worked on system administrative tasks, managing and checking the state of the Greenplum databases and its distribution of data.
- Used EXPLAIN and EXPLAIN ANALYZE to identify and fine tune sql queries for performance improvements.
- Perform backup and restoration of Greenplum databases.
- Expert in writing complex PSQL queries.
- Setting up cron jobs and trouble shoot.
- Performed Linux Administration.
- Developed Python scripts to parse data.
- Won Big-Data Impact award from cloud era.
Confidential, Dallas TX
Sr Greenplum / PostgreSQL Consultant
Responsibilities:- Database Administration and Programming.
- Troubleshooting performance issues using postgres and UNIX level commands to gather data.
- Shell scripting for database maintenance functions
- Closely worked with SME to understand requirements.
- Reverse engineered data models to improve Green plum function queries.
- Worked on changing distribution keys to avoid data skew and improve performance.
- Worked on end to end application level performance tuning re designing data models, choosing proper distribution keys re write Greenplum function code.
- Extensively worked on ELT performance tuning using multiple Gpfdist instances and available hardware.
- Worked on Greenplum features, benefits, and understanding architecture in terms of shared nothing and the MPP design and how Greenplum supports redundancy and high availability
- Install, initialize, validate, and configure Greenplum Databases.
- Manage database objects and workload management processes by defining and managing roles, privileges, and resource queues.
- Implemented Greenplum’s advanced workload management for dynamic query prioritization.
- Designed and implemented table partitioning for handling very large tables.
- Load data into a Greenplum database instance using external tables, SQL copy and insert commands, and parallel load utilities.
- Extensively used gpfdist and gpload utilities for data load and unload.
- Extensively used data manipulation language and data query language to access, manage, and query data.
- Perform system administrative tasks, managing and checking the state of the Greenplum databases and its distribution of data.
- Used EXPLAIN and EXPLAIN ANALYZE to identify and fine tune sql queries for performance improvements.
- Worked on Improving query performance by keeping statistics up to date and tuning the database for sampling size and error conditions.
- Planned and implemented disaster recovery of Greenplum databases using EMC data domain
- Perform backup and restoration of Greenplum databases using gpcrondump, gp backup, gp restore utilities on EMC Data Domain.
- Improve query performance by following performance enhancement tips and database best practices.
- Expert in writing complex SQL queries.
- Expert in shell scripting to automate, monitor and perform administrative activities.
Confidential, IL
Greenplum / Postgres DBA / SQL Developer
Responsibilities:- Database Administrator, Database Programming.
- Played a key role in data modeling Data warehouse for reporting needs.
- Install, initialize, validate, and configure Greenplum Databases
- Manage database objects and workload management processes by defining and managing roles, privileges, and resource queues
- Designed and implemented table partitioning for handling large tables
- Load data into a Greenplum database instance using external tables, SQL copy and insert commands, and parallel load utilities.
- Extensively used data manipulation language and data query language to access, manage, and query data.
- Perform system administrative tasks, including managing and checking the state of the Greenplum database and its data and checking the distribution of data.
- Used EXPLAIN and EXPLAIN ANALYZE to identify and fine tune sql queries for performance improvements.
- Worked on Improving query performance by keeping statistics up to date and tuning the database for sampling size and error conditions.
- Perform backup and restoration of Greenplum databases
- Distribute and store data in Greenplum using a distribution key and partitioning
- Worked on optimizing indexes for better performance and maintainability
- Improve query performance by following performance enhancement tips and database best practices.
Confidential, Dallas, TX
LEAD DBA/Developer
Responsibilities:- Worked closely with end-users to Define Requirements, Resolve Issues, Troubleshoot problems and Provide Solutions for various applications and day to day business needs.
- Actively involved in Data Modeling a new star-schema based database from legacy flat tables to improve performance, redundancy and scalability.
- Designed and Developed mortgage models, pricing models, tax reporting models.
- Wrote complex stored procedures, triggers, functions and T-SQL.
- Tuned stored procedures for performance improvement.
- Daily administration and tuning of SQL Server 2005 and 2008 R2 servers
- Designed, developed and implemented Databases from scratch.
- Involved in logical and physical data modeling.
- Suggested and configured High availability solutions like Database Mirroring, Clustering and log shipping.
- Designed, Created and Implemented complex SSIS packages for data warehouse
- Good knowledge in Mortgage data analysis and elements.
- Planned and Configured Team Foundation Server (TFS) for maintaining source code and version control.
- Created “Release Process” Documentation to establish a staged migration method from the Development to the Production environment.
- Release deployment Lead responsible for mentoring other developers and ensuring best practice for coding and deployment were followed.
- Responsible management of offshore DBA and developers.
- Designed, Developed and Tested automation of Tax Reporting.
- Worked on Oracle OBIEE for designing, building complex reports.
- Responsible for performing a Proof of Concept using Microsoft Windows AZURE to demonstrate cloud technology to the client.
- Have good experience with SSIS, SSRS
- Extensive use of Red-Gate SQL Server and Developer Tools
Confidential, Los Angeles, CA
SQL Server DBA/Developer
Responsibilities:- Closely worked with business users in gathering requirements to provide key data for critical business decisions.
- Developed logical design and physical design
- Administered and maintained Production/Development servers with more than 2 TB of database size with complex structures.
- Designed a report on the monitoring of SQL Server Performance. This report gives a detail report to the management on the Performance of the server.
- Worked with SQL server Profiler to generate traces to find long running stored procedures and queries to tune for peak performance
- Integrated SQL server profiler traces with Performance monitor to have better graphical analysis to find system bottlenecks.
- Expertise in Fine tuning complex queries
- Expertise in setting up and troubleshooting Database mirroring.
- Optimized working process.
- Rebuilding the indexes at regular intervals for better performance
- Extensively working on IMS performance issues like update statistics, rebuilding the indexes at regular intervals for better performance.
- Created Database Maintenance Planner for the Performance of SQL Server which covers Database Integrity Checks.
- Configured up to 32 GB of memory.
- Was responsible to develop customize reports requested by clients.
- Deploy/Design cubes in SSAS environment Snowflake/ and Star Schema designs.
- Generated reports by using Repots using SSRS.
- Redesign/ Deploy reports using Report Services 2008.
- Used Reporting Services (SSRS) to schedule reports to be generated on predetermined time.
- Created report models for Report Builder and managed reports using Report Manager.
- Created ad-hoc queries from SQL Server databases for custom reporting needs.
- Created sub reports, cross tabs, charts, ad-hoc reports using SSRS.
- Tuned servers to the peak for maximum performance by configuring memory and various server level parameters.
- Involved in developing C#, C++ code as a part of various transition projects.
- Hands of experience in writing batch files.
- Extensively worked on extraction, transformation and loading of data from various data sources to SQL Server using SSIS.