- Azure Data Lake, U - SQL, COSMOS, SCOPE
- All versions of SQL Server, SQL Azure
- SQL Server R service
- SSAS, SSIS, SSRS
- Azure application insights analytics (Kusto)
- Design, mentor and implement database solution for real-time pipeline. Populate data from memory to database in real real-time (1-5 seconds).
- Heavily use Cosmos for multiple batch processing pipelines, do clickstream data (20TB/day) analysis e.g. TOP N search words.
- Use Kusto to do log analysis, get insights and generate dashboards.
- Prototype on integrating htmlwidgets with rWidget using different sources (e.g. Kusto interactive queries against big data). Mentor and compare different visual tools.
- Data pipelines end-to-end from ETL, aggregation, apply business logic, data flow to visualization.
- Implement automatic jobs to launch big data recurring jobs.
- Write tool to interactively browse and query big data (CosmosSqlPopulator) in SSMS way. Populate data from Cosmos to SQL Server or Excel.
- Performance tuning on OLTP and OLAP databases.
- Use PowerBI and other internal visualization tool to generate dashboard for data pipelines.
- Windows Phone Business Intelligence projects with MS BI stack, design, coding and perf tuning.
- Visualize diagraph with dgml.
- Fixed long-existed issue in proactive caching.
- Write C# tool to partition and process cube. Write TDD.
- Performance tuning of ETL. Optimize performance from average 320s to less than 10s.
- Performance tuning MDX. Optimize some query from 12s to 2s or from 5s to ms level.
- Optimize DW and cube design based on real-time scenario (SLA is 5 min from source to cube).
- Write workflow tool.
- Write Stored Procedures to populate data from SSAS cube to SQL Server database.
- Write C# tool to export data from DB to Excel.
- Design and implement database, data warehouse. Design and implement scale-out sharding architecture to ETL heterogeneous feeds with different granularities, formats and duplications.
- Write SPs, UDFs, implement automatic partitioned table management, etc.
- Dimension, aggregation and partition design.
- Design and implement the SCD dimension.
- Cube performance analysis and improvement.
- Implement SSIS package.
- Implement SSRS reports; Write MDX used in reports.
- Design and implement automatic partitioned table management system.
- Design and implement automatic cube partition management, dimension & cube processing system.
- Design and implement workflow engine to automatically check COSMOS stream and batch job status, submit batch job, download the aggregated files and launch workflow to merge heterogeneous sources to the main data flow.