We provide IT Staff Augmentation Services!

Principle Data Engineer Resume

2.00/5 (Submit Your Rating)

San Francisco, CA

SUMMARY:

  • 14 years business intelligence, data engineering, and data warehousing experience in the full SDLC.
  • Hands - on experience in Distributed systems, Cloud Computing, Big Data, NoSQL, Business Intelligence, ETL/ELT, and Social Network.
  • Experience working on very large database (VLDB) with high scalability / availability and implementing Best Practices in SQL Server (SSIS, SSAS, SSRS) /PostgreSQL /MySQL / Oracle databases.
  • Perform database monitoring, query/SP optimization, profiling and performance tuning using profiler, trace, DMV, Perfmon, index analysis, Execution Plan, Database Engine Tuning Advisor, and aggregation / demoralization etc .
  • Develop configurable / parameterized / modularized / asynchronous SSIS / DTS packages and C# / VB.Net / VBA programs, APIs, scripts, batch commands for ETL between databases and operational data stores/ various data sources (excel, csv, text, JSON, XML, flat files).
  • Data modeling and database and development for OLTP, OLAP (Star Schema, Snowflake Schema, Data Warehouse, Data Marts, Multi-Dimensional Modeling and Cube design), Business Intelligence and data mining.
  • Perform conceptual, logical, physical database design, schema design, and entity relationships diagram design using ER Studio, Visio, and Enterprise Architect.
  • Analyze / translate business needs and create functional/technical design document and develop in agile / SCRUM in regular or virtual environment with source control such as Git, SVN, TFS, and SharePoint.
  • Design and develop complex, highly scalable, N-tier (business logics, data access layer, SQL Server or Oracle back-end databases), mission critical web applications using C#, ASP.Net, JavaScript, jQuery, CSS, etc.
  • Apply advanced mathematical knowledge and skills learned during my two master’s programs that involved heavy math/statistical applications such as Multivariate Optimization, Metaheuristic, Simulated Annealing, Tabu Search, Genetic Algorithms, Markov Decision Process, etc.

SKILLS:

Languages/programming: SQL/T-SQL, PL/SQL, C#, C, Objective-C, C++, VC++ 6.0, VB6, VB.NET, VBA, Java, JavaScript, VBScript, HTML, DHTML, XML.

Database: SQL Server 7/2000/2005/2008/2012 , PostgreSQL,

Oracle, MySQL, SSIS, SSRS, SSAS, ER Studio.

Cloud Computing / Big Data: Hadoop, HDFS, Hortonworks, MapReduce, Pig, Hive, Scoop, AWS EC2,

Azure, NoSQL, Hbase, Cassandra, Neo4j Graph Database, Pentaho

Web technologies: JavaScript, ASP.Net, AJAX, .Net Framework, IIS, jQuery.

Network protocols: HTTP, FTP, Telnet, TCP/IP.

Operating systems: Windows 7/8/XP/2008/2012, Ubuntu Linux, Mac, iOS, MS-DOS.

Applications: SalesForce, TWiki, JIRA, Confluence, Alfresco, ArcGIS, Trapeze, Projects, SalesLogix, WebEx, ScriptLogic, Arena, Lindo, GNATS.

Source Control: Git, GitHub, SVN, TFS, VSS.

EXPERIENCE:

Confidential, San Francisco, CA

Principle Data Engineer

Responsibilities:
  • Took initiatives of data engineering efforts from beta to official product launch (won 2014 CES Award).
  • Designed data ETL solutions for multiple data source in a distributed computing infrastructure.
  • Analyzed large datasets and delivered summaries with clear graphical dashboards.
  • Designed/developed data warehouse and BI solution for sport data analytics and social media sharing.
  • Wrote complex algorithms and SQL queries to handle 100+ million records.

Tools and technologies: Amazon AWS, EC2, SQL Server, Azure Cloud, SSIS, SSRS, SSAS, BI, PHP, PostgreSQL, PostGIS, S3, Elastic Search, EMR, Talend, JAVA, VirtualBox, Big Data, Github, Conflunce, JIRA, Vagrant, Cloudformation, Nagios, New Relic, pgpl/sql, database replication, iOS, MacOS, Ubuntu, shell scripting, Powershell.

Confidential, Cupertino CA

Data Architect / DBA

Responsibilities:
  • Developed production data reports and analytics with automatic export to email or web dashboard.
  • Developed business intelligence system and data warehouse for usage analytics.
  • Developed two different data archiving strategies and archived data offloading by up to 94%.
  • Designed a database performance tuning framework and automated index defrag/rebuild.
  • Designed/developed the database for secured User Auto-enrollment and Self -recovery online system.
  • Designed/developed automated rolling window data migration/ archiving scripts and SQL Jobs with database partitioning and data profiling.
  • Designed/developed database versioning mechanism to ease software release management and upgradability.
  • Initiated and created Database Development Coding Standard and Naming Conventions.
  • Created db security objects including asymmetric/symmetric keys, certificates, and backups with Transparent Data Encryption (TDE).

Tools and Environment: SQL Server 2005/2008 R2/2012, Cassandra, Pig, Hive, Pentaho, AWS, SSIS, SSRS, SSAS, Mirrored Database / Cluster, data warehouse, OLTP, Visual Studio 2008/2010, ER Studio, Visio, Subversion, TFS, GNATS, Agile, C#, ADO.NET, XML, HTML, T-SQL, SharePoint, Beyond Compare, HDFS, VMware.

Confidential, Tampa FL

Data Architect / Technical Lead

Responsibilities:
  • Designed data systems for the social-local-mobile eCommerce system.
  • Developed ETL processes to move data between RDBMS and NoSQL data storage.
  • Designed social interaction based and content similarity based recommendation engine and user behavior modeling/analysis with machine learning.
  • Initiated the development and implementation of website user clickstream data analytics in Hadoop/Hive.
  • Designed, developed, and implemented complex SSIS packages, asynchronous ETL processing, Ad hoc reporting, and SSRS report server, and data mining in SSAS.
  • Developed / contributed to the coding standard, advocated database best practices and naming conventions.

Tools and Environment: Cassandra, PostgreSQL, Hive, Pig, Pentaho, AWS, HBase, SQL Server 2005/2008/2008 R2, Azure Cloud, VLDB, Oracle 10g, SSIS, SSAS, SSRS, OLTP, OLAP, BI, Visual Studio 2008/2010, TFS, Agile, TDD, BDD, ER Studio, Toad, ASP.NET, C#, JavaScript, jQuery, AJAX, XML, T-SQL,Visio, iPhone, iOS, Ubuntu Linux.

Confidential, Tampa FL

Data Architect / Programmer Analyst

Responsibilities:
  • Evaluated and proposed technical solutions with existing and emerging technologies such as Cloud Computing, Virtual Collaboration, and Social Networking.
  • Developed data warehouse, data marts, and data mining modules for the data analytics platform.
  • Developed web-based and Windows-based applications for Team Collaboration and Knowledge Sharing, Project Management, Budget Management, Task & Ticketing, and Grant Management System.
  • Designed/developed C# ASP.Net web applications for online survey, data collection, data-sharing, and data analysis.
  • Led or participated as key team member in the functional design, technical design, data modeling, data analysis, and performed technical writing and documentation.

Tools and Environment: SQL Server 2005/2008/2008 R2, PostgreSQL, ER Studio, T-SQL, SSIS, SSRS, SSAS, BI, ASP.NET, C#, jQuery, ADO/ADO.Net, TDD, Visual Studio, VBA, Visio, Access, Excel, XML, Windows Server 2003/XP, Dropbox, DimDim, Webex, VPN, Virtual Collaboration.

Confidential, Tampa, FL.

Sr. Software Engineer / Database Developer

Responsibilities:
  • Developed and managed multiple concurrent versions of code and product releases.
  • Designed/developed BI data models using star and snow flake schema for loan products.
  • Contributed to coding standards, best practices, functional and technical specifications design.
  • Initiated/conducted performance tuning and achieved 20-80% performance improvement for key systems.
  • Developed and/or optimized over 500 Stored Procedures, TSQL Queries, Indexes, Views, Functions, Triggers, scripts, DTS and SSIS Packages.
  • Designed complex SSIS Packages for Extract, Transform and Load (ETL) with data from different sources.

Tools and Environment: SQL Server 2005, SSIS, OLAP, OLTP, T-SQL, C#.NET, Visual Studio, .Net 2.0 Framework, VS Source Safe, ADO/ADO.Net, TDD, XML, N-Unit, Mercury, Visio, Novell Network, Windows Server 2003/XP.

Confidential, Tampa FL.

Programmer Analyst

Responsibilities:
  • I collaborated with Google on the innovative Google Transit and won the company award for the 1st successful implementation in the world. (Google Transit provides real time online trip planning and information around the globe, a cloud technology impacting our everyday life.
  • Contributed in a 10 million dollar project - the Intelligent Transportation Systems (CAD/AVL) project and developed ETL programs for data sharing among different systems such as Trapeze, MS Dynamics, and GFI.
  • Led a team effort to build ArcGIS 9.2 infrastructure and deployed the system in a distributed network topology.
  • Designed, developed, and improved 12 production databases-driven applications including Customer Service, Order/event Management, Fleet Management, Customer Survey, Employee Performance, HR, Performance Statistics, and Drug Test System.
  • Designed/developed an in-house software application for Outlook message importing/exporting/archiving.

Tools and Environment: SQL Server 7/2000/2005, Access, VBA, DTS/SSIS, Data Warehouse, Trapeze, ASP.NET, C#, VB.Net, Sybase, Windows API, OOP, IIS, APS, Fleet Watch, MS Dynamics, Windows Server 2000/2003/XP.

Confidential, Tampa FL, USA

Project Manager, Programmer Analyst

Responsibilities:
  • Managed project timelines and applied resource allocation optimization to meet priorities and deadlines.
  • Monitored status reports and tracked project progress and milestones on daily and weekly basis.
  • Managed globally distributed engineering team using IM, email, video conferencing, and Webex, etc.

Tools and Environment: MS Access, Project, Visio, ASP.NET, VB 6.0, Java Applets, JavaScript, HTML, CSS, IIS 5.0, COM, ADO, SQL Server 2000, Windows 2000/XP, VIM, Exceed(Unix/Windows), SalesLogix, and Visual Source Safe.

Confidential, Tampa FL.

Graduate Research/Teaching Assistant

Responsibilities:
  • Invented several new SQC algorithms in C using Regression Analysis and Multivariate Optimization.
  • Designed new mathematical models and conducted statistical analysis using Simulated Annealing/Tabu Search, Markov Decision Process, DOE, and ANOVA on Windows 2000 and Sun Solaris.
  • Redesigned and rebuilt Pilgrim’s (ranked Top 20 of Software Magazine's Annual Software 500 Companies) online customer service center database and GUI system in a master’s course project at USF.

Tools and Environment: VB, VC, C, SQL Server, Access, XML, Visio, Matlab, Arena, Lindo, UNIX and Windows.

Confidential

Graduate Research Assistant

Responsibilities:
  • Designed and developed a real-time Intelligent RSP Inspection System using Machine Vision, Image Processing, Dynamic Sampling, Computer Simulation, and Statistical Analysis in C programs;
  • Designed/developed a SPC software with graphic user interface for the automated quality inspection system using OOP, VC++6, and real-time image capture /processing.
  • Published several research papers in world top-ranked journals and conference proceedings.

Tools and Environment: MFC, Visual C++ 6, C, SQL, Access, Java, Windows NT 4.0/95/98.

We'd love your feedback!