We provide IT Staff Augmentation Services!

Data Scientist Ii Resume

Redmond, WA

SUMMARY:

  • Around 8+ years of extensive experience in IT industry, spanning across areas like Data Mining, Data Warehousing and Data Analysis, Data Modeling, Design, Development and Implementation of Business Intelligence solutions on Windows and UNIX environments in retail, media, telecom and financial domain.
  • Oracle certified associate in SQL and PL/SQL.
  • Strong expertise in Data Warehouse Architecture, Fact and Dimensional Data Modeling, Data Mining, BI Strategy, Performance Management Scorecards / Dashboards, Master Data Management, Conversions/ Migrations, ETL Architecture (Data Integration, Data Migration), OLAP / BI Reporting Tools, Data Quality / Metadata Tool
  • Strong expertise in Data Modeling (conceptual/local/physical) using Kimball Methodology.
  • Proficient in Dimensional modeling - Star/Snowflake modeling using ER-Studio and ERwin, Slowly changing dimension modeling, 3NF data modeling, OLAP, OLTP queries.
  • Expertise in Performance Tuning of the SQL queries & Optimization management.
  • Strong understanding of Relational Databases like Oracle, SQL Server and DB2.
  • Experience in data analysis and developing predictive models using statistical methods/ data mining methods linear regression, logistic regression, clustering, classification, decision trees, Support Vector Machine (SVM), naïve Bayes Classifier (NBC).
  • Extensive knowledge in Development, Analysis and Design of ETL methodologies in all the phases of Data Warehousing life cycle.
  • Extensive knowledge in Datastage Designer, Oracle Warehouse Builder, SQL queries, Oracle PL/SQL programming functions and procedures and packages.
  • Extensive experience in OBIEE repository (Physical, Business Model and Mapping and Presentation layers) for both Stand-Alone and Integrated OBIEE implementations
  • Experience in OBIEE providing end-to-end business intelligence solutions by dimensional modeling design, building business models, configuring metadata, creating reports and using dashboards.
  • Expertise in Data/Interface architecture for large scale telecom and financial OLTP/OLAP application environments.
  • Strong experience in Agile methodology(scrum calls, test driven development, continuous integration, sprint plan)
  • Strong understanding of Object Oriented Programming languages like Java
  • Expertise in assessing requirements both functional and non-functional
  • Driven by new challenges and desire to be successful in all endeavors.
  • Immensely enjoy navigating all aspects of complex projects.
  • Proficient in understanding business processes / requirements and translating them into technical requirements.
  • Team Player with excellent communication, analytical, verbal, writing and interpersonal skills.
  • Worked as a module lead and led a team of 5 members. Preparation of management reports, participated in sprint planning, burn down charts, capacity planning etc.

TECHNICAL SKILLS:

DATABASE: Oracle 11g, MS SQL SERVER 2012, DB2, Redshift, Cosmos

DATA MODELING: ER-Studio 9.5, Erwin

STATISTICAL ANALYSIS: R, Python, SAS E-miner 7.1, SAS Programming, Matlab, Jmp 8.0, Minitab

DATAWAREHOUSEING ETL: DataStage 8.1, Oracle Data Integrator (ODI) (11G), Oracle Warehouse Builder (OWB) 11g, Ab Initio

BUSINESS INTELLIGENCE: Oracle BI Foundation Suite 11g (dashboards, Answers, BI Office, BI Publisher Reports)LANGUAGES/SCRIPTS SQL, PL/SQL, JAVA, C, C++, Servlet/J2EE, UNIX Shell Scripts

DOCUMENT MGMT: Microsoft Office Products

CONFIGURE MGMT: SVM, Star Team

DB TOOLS: TOAD 7.2, SQL Developer

Other Tools: Altova XML spy

OPERATING SYSTEMS: UNIX, SCO UNIX, LINUX, VMS, Windows 95/98/NT, Windows 2000, XP

PROFESSIONAL EXPERIENCE:

Confidential, Redmond, WA

Data Scientist II

Responsibilities:

  • Responsible for designing predictive/ML models and implementing cutting edge algorithms for Microsoft Office data. (Clustering/ Classification / Regression)
  • Worked on seasonal sales trend analysis of Microsoft office products by using Time series, ARIMA models.
  • Determined the key factors/ leading indicators of user retention by analyzing usage pattern using collaborative filtering, logistic regression.
  • Develop Strategy for client to move on-prem to Azure. Develop Data Factories to extract data from various sources and populate Data Lake, Azure SQL and Cosmos DB
  • Determined the magic number of usage for user retention— A certain number of document actions of a new user can convert that user to a frequent user.
  • Acquire knowledge of innovative methods/ algorithms, tools from scientific journals and apply with scalability and portability.
  • Discover new data sources and improve data quality using normalization etc.

Technologies: R, Python, Cosmos

Confidential, Seattle, WA

Senior Data Modeler / Data Analyst

Responsibilities:

  • Responsible for data warehouse design, requirement analysis, data modeling and data mapping, complex SQL query writing, query tuning, report building(RPD) for projects in retail domain.
  • Worked on analyzing and creating reports on financial data for Amazon.com Hardlines business.
  • Worked with cloud data services such as Cosmos DB/Document DB and Azure Tables My responsibility includes data profiling and cleansing, ensure efficient data storage and retrieval in the data warehouse (Fact and Dimensional modeling) and apply different reporting methods (business intelligence) /statistical methods (Clustering, Classification) to generate vendor scorecard based on the sales, revenue, profit and other retail supply chain KPI metrics. Wrote complex SQL queries for data extraction, transformation.
  • Designed star schema (Fact and Dimension model) for item sales, revenue, profit, inventory, supply chain, order, demand-supply forecasting and search impression.
  • Worked on normalization and standardization of item attributes for consumer electronics products at amazon. Developed solution to improve item data quality by identifying items with bad item description, missing image, brand etc.
  • Worked on identification and fixing of wrongly categorized items (consumer electronics) in the catalog using keyword based probability scoring.
  • Worked on amazon streamlined vendor onboarding by creating a report on weekly progress of vendor onboarding and nudging them accordingly.
  • Communicating with business and gather requirements
  • Collecting data from various sources in the form of oracle, XML, flat files and bringing those into uniform format and normalization.
  • Create fact and dimensional data modeling for easy data retrieval.
  • Determine partition strategy and indexing for fact and dimensions involving TB of data.
  • Design the distribution and sorting for amazon redshift tables.
  • Configuration of redshift DB clusters and OBI
  • Design slowly changing dimension (SCD 2, 3, 4, 6) to store product deal information with audit tracking.
  • Create reporting for 300 users (including vendor managers, supply chain managers and brand specialists in amazon) on weekly sales, profit, revenue, inventory, product demand supply, search impression etc.

Technologies: Oracle 11g, Redshift, Oracle Business Intelligence, UNIX shell scripting, ER Studio

Confidential, Kansas City, MO

Data Modeler /Data Analyst

Responsibilities:

  • Worked on analyzing customer demographic data and their purchase pattern for creating a recommendation system.
  • My responsibility includes data profiling and cleansing, ensure efficient data storage and retrieval in the data warehouse (Fact and Dimensional modeling) and apply different statistical methods (Clustering, Classification) to generate a score against every customer.
  • Wrote complex SQL queries for data extraction.
  • Designed star schema (Fact and Dimension model) for ticket sales and revenue data.
  • My responsibilities include
  • Communicating with business and gather requirements
  • Collecting data from various sources in the form of XML, flat files and bringing those into uniform format.
  • Create fact and dimensional data modeling for easy data retrieval.
  • Design slowly changing dimension (SCD 2, 3, 4, 6) to store film booking information with audit tracking.

Technologies: Microsoft SQL Server 10.5, ER-Studio 9.5, SAS E-miner 7.1, Microsoft Excel, Oracle 11g, Altova XML spy

Confidential, Chandler, AZ

Data Modeler/Data Analyst

Responsibilities:

  • Responsible for Information Extraction and ensuring data quality and data integrity.
  • Data Analysis of large datasets in distributed database environments.
  • Preparation of predictive model for future forecasts using statistical/data mining methods-- linear regression, logistic regression, clustering, classification, decision trees, Support Vector Machine (SVM).
  • Optimized code development using SAS Eminer, Matlab and Jmp.
  • Extensive data analysis using SQL in MS SQL Server
  • Representation of bulk data using MS Excel power pivot.

Technologies: Statistical/data mining tools - SAS Eminer 7.1, Jmp 8.0, Matlab, Microsoft SQL Server, Microsoft Excel with Power Pivot

Confidential

Data Modeler/Data Analyst

Responsibilities:

  • Responsible for data modeling, ETL processes design/development, technical documentations, using Datastage 8.1/8.5, Unix scripts, Oracle, DB2,SQL scripts
  • Wrote complex SQL queries in Oracle to test ETL job functionality; Enhanced the performance of DataStage jobs by breaking down the complex jobs if necessary.
  • Wrote several complex SQL queries to extensively test the ETL process and user-defined Functions, Routines and Transforms in Datastage Basic to be used in derivations
  • Extensively used most of the Oracle Analytic functions (LISTAGG, PIVOT, UNPIVOT, LEAD, LAG and All Rank functions). Developed PL/SQL stored procedures, packages and triggers. Implemented DBMS ERRLOG model.
  • Managing Schema, Schema objects & Database changes for the application team
  • Developing views/SQL queries for Oracle Business Intelligence (OBI) reporting.
  • Performance monitoring, Performance analysis and tuning queries.
  • Database Tuning/Stats pack/AWR Reports & Application SQL Tuning (EXPLAIN PLAN), PLSQL profiling with stats collection techniques.
  • Optimized code development to confirm the data using object relational features, analytic SQL functions and bulk load arrays.

Technologies: Oracle 10G, TOAD, Datastage 8.1(ETL), DB2, Erwin, Oracle 10g (Database)

Confidential

Software/System Engineer

Responsibilities:

  • Data Modeling Logical/Physical (Erwin), creating de-normalized star and snowflake schema.
  • Designed the source and integration/staging layer for migrating the large volume of CRM and MDM Data in Oracle Database 10g using ETL (ODI/OWB), SQL, PLSQL, SQL*LOADER, UNIX script from various different sources.
  • Analysis and ETL of large datasets in distributed database environments (Oracle RAC). Applied concepts of partitioning, parallelism, global/local concept.
  • Optimized code development to confirm the data using object relational features, analytic SQL functions and bulk load arrays.
  • Developed PL/SQL stored procedures, packages and triggers.
  • Database Tuning/Stats pack/AWR Reports & Application SQL Tuning, analysis of EXPLAIN PLAN).
  • Developed custom reports/Ad-hoc queries using OBI Answers and Filters. Designed customized OBI reports/dashboards using aggregate (hierarchical) navigation, content filtering on user ID /user groups/user roles, guided navigation, drill down reports.
  • Configured OBIEE Repository, set connection pools, implement physical data model as per requirements, importing the tables and having the physical joins/keys required and changed the physical joins/key on the existing tables.
  • Designed Oracle Data Integrator ETL processes to collect and summarize data for reporting (Aggregate facts design).

Technologies: Oracle 10g, SQL, Oracle Data Integrator (ODI), OBIEE, Oracle Warehouse Builder (OWB) 10g, PL/SQL, TOAD, SQLLOADER Oracle business Intelligence (OBIEE), UNIX shell scripting was a part of software development team for one of the world’s largest data warehousing project --Enterprise Data-warehouse of British Telecom - Largest UK-based Telecom Company

Hire Now