We provide IT Staff Augmentation Services!

Consultant/big Data Architect Resume

San Francisco, CA

SUMMARY

  • I have more than twenty years of experience on top of twenty years of education.
  • I earned my first Computer and Systems Engineering (OR) Master’s degree in 1987 and then became a professor in the same department because of the outstanding achievements. With years of working experience on DSS systems and Analytics projects for Banks, Post Offices and other organizations, I then spent four more years on several graduate programs in the Department of Math (Ph.D. on Numerical Optimization) and the Dept of CIS and ECE (MS on Digital Signal Processing and Distributed Computing, etc) in the University of Florida (Top College in the Relational Database specialty).
  • I am a typical data expert and technical lead for Enterprise Data Strategy, Architecture and Data Modeling, Enterprise Data Integration (EDI), Enterprise Data Warehouse (EDW), Business Intelligence (BI), Metadata, Data Quality and MDM (Master Data Management), Data Mining/Analytics/Data Science, with expertise from theory to practice, from database to applications, from OLTP to OLAP, from ETL to Reporting, from Analytics, Machine Learning, to Data Mining, from MPP System to Big Data Eco - system (Hadoop, Yarn, MapReduce, Kafka, Spark, etc), from architect, DBA to Database development and performance tuning, from development support to production support.
  • I am very experienced in large-scale projects in all phases of the development lifecycle including needs assessment, requirements gathering and definition, architecture/design and development, etc. I have worked on projects in Enterprise Data Strategy (including VLDB and Big Data) and Data Governance Roadmap, EDI/EDW and Data Marts architecture and development, MDM and Metadata Repository, ETL/BI tools selection, DW/ETL Best Practices, BI/enterprise reporting and analytical applications, etc.
  • Strategies, roadmaps, blueprints, architecture, security, modeling and development of Data/Database system, MDM/CDI, EDW, Data Marts and Enterprise Metadata, Cloud/AWS EC2 AND EMR, Big Data Eco-system, especially their overall system quality, security, performance and optimization.
  • RDBMS overall expertise. I worked on almost all kind of RDBMS like Oracle, Teradata, RedShift, Greenplum, SAP Hana, Netezza, DB2/UDB, MS SQL Server and Sybase, etc.
  • NoSQL and Big Data Ecosystem Infrastructure. Hadoop Cluster, MapReduce and Spark, Kafka and Spark Streaming, Hive, Cassandra and HBase.
  • ETL systems. I worked on most of the main ETL tools like Informatica, Pentaho, DataStage, Talend, Ab Initio and SSIS/DTS with expertise on everything in Informatica 5 to 10 including BDE/BDM (Big Data Edition/Big Data Management) and cloud version: Designer/Developer, tasks/workflows, repository, server administration and Metadata Manager.
  • Business Intelligence, especially on Cognos and Business Objects.
  • Data profiling, data quality, data standardization, data security, data stewardship and governance.
  • Problem solving and troubleshooting with very strong analytic and logic ability.
  • Deliver projects with high quality on time and within the budgeted resources.
  • Top Skills on Oracle (up to version 11g) Architect/Data Modeling, DBA, and Database Development (SQL*Plus and PL/SQL).
  • Good training and solid skills on Teradata Data Warehouse including Teradata design, development and utilities like FastLoad, MultiLoad (MLoad), Teradata Parallel Data Pump (TPump) and Teradata Parallel Transport (TPT), etc. Fluent in Teradata SQL including Analytic functions.
  • Very experienced in MS SQL Server6.5 to 2005 administration and T-SQL development as well as MS SQL Server Analysis Services (SSAS)/Analysis Manager/MDX.
  • Hands on DB2/UDB (7.2 to 8.2) support for Data Warehouse and its source databases for PeopleSoft Finance/Oracle Financials, HRMS/HCM, EPM and CRM, and Ventive, etc. Experienced DB2/UDB Data Warehouse architect and ETL engineer including DB2/UDB SQL and Its Analytic functions.
  • Fluent in ANSI SQL including the Analytic functions.
  • Solid experience on Hadoop/Big Data, Kafka, Spark and NoSQL databases like HBase, Hive and Cassandra, etc.
  • Efficient with all kinds of Database tools (ERwin, Embarcadero ER/Studio and DBArtisan, Oracle Enterprise Manager, Oracle Management Package, SQL Navigator and Toad, DB2/UDB Control Center/Command Center, MS SQL Server Enterprise Manager and Sybase PowerDesigner, etc.
  • Excellent ETL architecture, design and development with several tools like Informatica PowerCenter (version 5 to 9.6.x), Pentaho/Kettle, BODI and Talend, etc. Informatica PowerExchange, Repository Manager, Metadata Manager and Repository Server administration.
  • Data sources include, but not limited to mainframe VSAM files, flat files, XML files and data stored in all kinds of databases like Oracle, DB2/UDB, Sybase and MS SQL Server.
  • Data Warehouse Multi-Dimensional/Star Schema design and performance special skills.
  • Data Warehouse Metadata and Enterprise Metadata Repository design and administration experience.
  • MDM leadership on Modeling, data quality, data standardization, data security, stewardship and governance.
  • Business Objects design (Designer/Universe), Web Intelligence and SAP Hana Information Modeling; Cognos modeling with Catalog, Transformer/Cubes and Framework Manager, OBIEE/Siebel Analytics and Implementation, etc.

PROFESSIONAL EXPERIENCE

Confidential, San Francisco, CA

Consultant/Big Data Architect

Responsibilities:

  • Lead overall project planning, design and development lifecycle in SCRUM and others.
  • Roadmaps, POCs, Architect, Design, Data Modeling and Solutions of Data Security, Data Governance, Enterprise Database, Data Warehouse, Data Mart, and ODS (Operational Data Store), Data Lake, Cloud/AWS/EC2 and EMR Systems.
  • Big Data Processing and Analytics (with Python, Hadoop, Yarn, MapReduce, Kafka, Spark (with Python and Scala), Pig and NoSQLs like Cassandra, HBase, Hive, etc.)
  • Teradata, Oracle and Exadata, MySQL, RedShift, Vertica, Greenplum/PostgreSQL, SAP Hana and MS SQL Server, etc, databases design and development (SQLs, Stored Procedures/Packages, database utilities, etc), logical and physical modeling, performance tuning and production support.
  • Architect, ETL, Data Integration of all kind of data sources of in-house Customer, Account, Product/Service applications and commercial applications like OBIEE/Siebel Analytics/DAC, Oracle E-Business Suite/Applications (Financials (AP, AR, GL), Order Management (PO), HRMS/HCM) etc.
  • Design and development of the ETL (Informatica PowerCenter, BDE/BDM and Cloud, DataStage, Talend, Pentaho Kettle/PDI and BODI/SAP BusinessObjects Data Service) processes with type 2, type 3 slow changing dimensions.
  • Establish ETL Best Practices on all kinds of RDBMS and other data sources.
  • Performance tuning of the ETL and BI used SQL statements including Analytic functions.
  • Consulting source of SQL and database performance.
  • Lead the whole lifecycle of the Data Governance, Data Security, Master Data Management, Metadata Management, Enterprise Data Integration and BI implementation.
  • Provide MDM, Data Integration and BI Roadmap, Maturity Analysis/Assessment and solution strategies.
  • Provide solution with D&B MDM technologies together with RDBMS, ETL and BI technologies.
  • Lead Architecting, Designing and Data Modeling of all the related systems.
  • Provide expertise on database, ETL and BI best practice.

The technologies included, but not limited to Oracle 10g-11g, Teradata, Data Warehouse, NoSql, OBIEE, ETL (Pentaho/Kettle, Informatica and BODI/SAP BusinessObjects Data Service,) SAP Information Steward, Metadata Management, Data Service and Data Abstraction for SOA and MDM/CDI, etc.

Confidential

Sr. Manager, Enterprise Data Architecture/Data Warehouse/BI/MDM.

Responsibilities:

  • Lead overall project planning, design and development lifecycle.
  • Meeting with end users and BA to collect and understand requirements.
  • Authoring Enterprise Data Modeling Best Practice Standards and Guidelines.
  • Database Systems Architecture, Data Integration and Data Modeling of Enterprise Data, (Active) Data Warehouse, and Enterprise Metadata Management. Data modeling of the dimensional data warehouse, data mart, and ODS (Operational Data Store.)
  • Establish Data Strategy, Policies, Stewardship, and Governance.
  • MDM/CDI and ETL Best Practice.
  • Design, Deployment, implementation of MDM Data Hub.
  • Data Quality and Data Anomaly Analysis.
  • Architecture, ETL/Data Integration of all kind of data sources of in-house Customer, Account, Product/Service applications and commercial applications like SAP, OBIEE/Siebel Analytics/DAC, PeopleSoft, Oracle E-Business Suite/Applications (Financials (AP, AR, GL), HRMS/HCM, Order Management (PO), Procurement,) etc.
  • Design and development of the ETL processes with type 2, type 3 and hybrid slow changing dimensions.
  • Data source analyses and profiling that include data from flat file, XML source, Oracle, DB2/UDB, Sybase, MS SQL, Mainframe/VSAM and Teradata and applications include PeopleSoft Finance/Oracle ERP/Applications, Ventive CRM, SAP CRM, SAP Financial and SAP BW, etc, and self-developed applications.
  • Migration of Oracle, Informatica and Cognos, OBIEE/Siebel Analytics and in-house applications to Teradata.
  • Oracle queries migration to Teradata BTEQ Scripts.
  • Standardize ETL design, development, QA, implementation and production support.
  • Performance tuning of the ETL, OBIEE/Siebel Analytics and BI (Cognos, Hyperion/Brio and Business Objects) used SQL statements including Analytic functions.
  • Oracle, Teradata, Greenplum, UDB/DB2, MS SQL Server, HBase, etc, databases design and development, Logical and Physical design and performance tuning.
  • Provide guidelines on the architecture of the BI tools and how to leverage the BI functions in the overall enterprise data warehouse systems.
  • Unix/Korn Shell scripting for database and Informatica command calls.
  • Production support, troubleshooting and problem solving for the above listed systems, etc.
  • Consulting source of SQL and database performance.
  • Erwin Data Modeler and Model Manager/ModelMart, ER\Studio, PowerDesigner, and Rochade/Adaptive/Informatica MM (Metadata tools).
  • All kinds of database (Oracle, Teradata, Greenplum, MS SQL, etc,) utilities and tools.
  • Hadoop, HBase, and Hive.
  • Rational Data Architect/Rational Rose/UML, etc.
  • All kinds of ETL tools (Informatica, Ab Initio, DataStage, Talend, and SSIS, etc.)
  • All kinds of BI and OLAP tools (Business Objects, Cognos, MicroStrategy, Hyperion/Brio and SSAS, etc.)
  • IBM Information Server (including WebSphere DataStage, WebSphere Information Analyzer, etc.)
  • Siperian and Purisma|D&B MDM, etc, and FirstLogic for data cleansing.
  • SuperGlue/Metadata Manager and Rochade, etc.

Consultant/ Data Warehouse specialist

Confidential

Responsibilities:

  • Lead database and data warehouse architect, DBA and developer lifecycle.
  • Redesign, remodel and development of the pre-built data warehouse/BI products.
  • Multi-dimensional data warehouse architecture design with conformed dimensions, facts, data marts, etc (Kimball Methodology).
  • ETL design and development for the above products for the clients.
  • Architect, modeler and database developer of the Next Generation QOS on Voice over IP system.
  • Provide consulting on the other next generation products on database design and performance improvement.
  • Implementation of BI tools for clients all over the world: Cognos, Business Objects, Brio and MS Analysis Services, etc, and also created models (relational and dimensional OLAP) with all kinds of BI/OLAP tools.
  • Worked on Teradata database design, data loading with international telecom clients with FastLoad, MultiLoad (MLoad) and TPump, etc.
  • ERwin data modeler and PowerDesigner.
  • ETL tools (Informatica, Oracle Warehouse Builder and Data Junction).
  • Database tools including Oracle and Teradata tools and utilities and Toad, etc.
  • BI tools like Cognos, Business Objects, Brio and MS Analysis Services/MDX, etc.
  • PL/SQL package, trigger and SQL*Plus.
  • Operating systems: Windows NT, Windows 2000 Server. Unix/Sun Solaris.

Confidential

DBA, Data Warehouse Architect and ETL Developer.

Responsibilities:

  • Data modeling (both relational/OLTP and OLAP/dimensional including dimensions, facts and data marts) and Data Architecture with ERwin, Designer/2000 and PowerDesigner (Kimball Methodology).
  • Data Warehouse Metadata and Enterprise Data Repository administration.
  • Korn Shell Scripts, PL/SQL, SQL*Plus and Oracle Utilities.
  • ETL with Informatica and Oracle plus korn shell scripts.
  • Database logical and physical design and performance tuning (Oracle and MS Sql).
  • DBA on Oracle7.3 to 8.1.6 and SQL SERVER 6.5 to 2000.
  • MS OLAP/Analysis Services/MDX dimensions and cubes design and development.
  • ERwin data modeler Designer/2000 and PowerDesigner.
  • ETL tools (Informatica and Data Junction).
  • Database tools including Oracle Enterprise Manager, Oracle Management Package and Toad.
  • PL/SQL package, trigger and SQL*Plus.
  • Database tools including Oracle Enterprise Manager, Oracle Management Package and SharePlex database replication tool.
  • MS SQL SERVER 6.5 to 2000 and MS Analysis Services/ Analysis Manager.
  • Windows NT Server. Unix/Sun Solaris.
  • Change control/management with VSS, etc.

Confidential

Sr. Software Engineer and DBA.

Responsibilities:

  • Collecting requirements and business rules from bank clients.
  • Retail banking operational data modeling with ERwin data modeler and PowerDesigner.
  • Internal CRM and Marketing data warehouse data modeling and project coordinator with Epiphany.
  • Database logical and physical design, configuration, problem solving and troubleshooting.
  • Transaction SQL (T-SQL) development.
  • Database and application development support and performance tuning.
  • Database tools: MS SQL Server Enterprise Manager.
  • Operating systems: NT/MS Cluster Server administration.
  • Supported programming tools and language: VC++ 5 and Forte.
  • Change control/management with VSS, etc.
  • In charge of a Budgeting and Planning product that was sold worldwide including Confidential (large paper company in Spain), Ford, etc.
  • The next generation product design and development.
  • Data modeling and Oracle database DBA for the next generation product.
  • Database performance tuning, problem solving and troubleshooting.
  • ERwin data modeler and Oracle Designer.
  • PL/SQL package, trigger and SQL*Plus.
  • UML, OOD using Rational Rose and OOP using VC++ 5.
  • UNIX of HP, AIX and SUN Solaris.
  • Galaxy Institute of Computer and MIS (04/90-01/94)
  • (Top computer research organization and one of the earliest governmental contract service company in China.)
  • Systems Development Manager of government (mainly banks) and industrial DSS systems.
  • Projects were all on DSS and operational information systems for Construction bank, Commerce Bank, Post Office, etc.
  • Collect requirements and evaluate scope and cost for projects.
  • Project and systems development management on Decision Support Systems (DSS) and Business Operational Systems (BOS).
  • Operations and Reporting System modeling for Financial services
  • Database systems design and data modeling.
  • Oracle 6 to 7.2 management and development with SQL*Plus, Pro*C, etc.
  • Programming with C/C++.
  • Operating systems: FACOM Mainframe, UNIX, DOS and Apple/Macintosh.
  • Xiamen University, Department of Computer and Systems Engineering (1987-1990)
  • Assistant professor/Systems Development Manger, government and industrial contracts.
  • Projects were mainly on corporate and government operational and DSS systems.
  • Collect requirements and feasibility studies.
  • Project Development and Team Management.
  • Databases: Dbase3 and 4, FoxBase and Oracle/SQL*Plus.
  • Application development with Pascal and C/Pro*C.
  • Operating Systems: FACOM Mainframe, UNIX and DOS.

Hire Now