Lead Data Scientist Resume
Helena, MT
PROFESSIONAL SUMMARY:
- Python, Tableau and R certified, highly accomplished Data Scientist with 10 years of experience, who drives data science projects by developing Prescriptive, Descriptive, Predictive solutions using analytical, statistical and machine learning strategies, with a result centric approach. Highly skilled in machine learning, data visualization and creative thinking.
TECHNICAL SKILLS
Programming Languages: R | Python | SAS | UNIX Shell | C++ | Java
Machine learning algorithms: Linear Regression | Logistic Regression | Random Forest | Decision Tree
Statistical Techniques: t - test | Chi-Square Test | ANOVA | Correlation | Hypothesis Testing
Applications: MS OFFICE
Tools: Teradata Studio Express | Google Analytics
Data Infrastructure: Cassandra | Teradata | Oracle | AWS EC2 S3 | Hadoop | Hive | HDFS MongoDB | MS SQL | MySQL | NoSQL
PROFESSIONAL EXPERIENCE
Confidential, HELENA, MT
Lead Data Scientist
Responsibilities:
- Developed multiple dashboards using Tableau for the business to interpret the effect of COVID-19 on non-payments and non-reporting of premiums and payrolls and also dashboards to identify Policy fraud, claim fraud and Medical Provider fraud.
- Developed a webs crapper in Jupyter using Python that can fetch information about upcoming events for the Insurance firm to build a strategy.
- Responsible for building machine learning models (Classification) to identify claims having high potential for fraud based on flags that are handpicked by the model as part of feature selection.
- Lead a team of data scientist, tableau developer and BDM engineer to create an end to end solution on prediction models, data visualization and data modeling.
- Managed complete client communications, expectations, and ensured on-time delivery of results.
Confidential, MCLEAN, VA
Data Scientist
Responsibilities:
- Successfully implemented a forecast model (ARIMA) in Python - Jupyter Notebook which will predict the revenue for channels that is used by the stakeholders to make an evaluation of the business performance.
- Implemented a LightGBM Regressor model with 90% accuracy to predict the revenue for each property.
- Created effective dashboards and stories quantifying growth and variance in revenue for the executives.
- Collected, compiled and created SQL Queries to join multiple tables, cross database joins to build dataset for analysis and modeling.
Confidential, SILVER SPRING, MD
Data Scientist
Responsibilities:
- Successfully implemented a dispatch reduction model which will predict the number of hours required for dispatch event.
- Increased the accuracy by 5% compared to the existing model and helped identifying slow, moderate and fast dispatches to route jobs based on dispatch efficiency.
- Worked closely with the business team and dispatch team to understand the complexities and addressed those using visual presentations.
- Used R programming, Python- Jupyter notebook and Tableau for creating models and visualizing data using matplotlib and seaborn.
- Collected and compiled data from MS SQL SERVER, ORACLE and Teradata into a single source for data analysis, data visualization and building models.
- Collaborated with the business team and Executives to work on Cancel Analytics for FiOS.
- Gathered requirements and data from CRM teams, worked with other cross functional teams to create visualizations on cancel trend analysis.
- Identified multiple factors affecting cancels and helped reduce cancels by 4% overall.
- Responsible for creating a data classification model to predict whether a customer would cancel service.
- Developed topic model using text mining to help implement/automate the topics discussed in chat bot.
- Used Tableau to generate cancel trend reports and R- programming and Python- Jupyter notebook for building train and test datasets and creating models.
Confidential, FAIRFAX, VA
Research Data Analyst
Responsibilities:
- Gathered data from Golden Corral and Ovation brands to analyze and understand the factors used to assess restaurant locations.
- Explored and manipulated data and created visualizations in Python - Jupyter Notebook using NumPy, pandas, matplotlib and seaborn.
- Created classification machine learning models and used the results to interpret the restaurants location for better profits.
- Identified 4 prime locations where the restaurant could garner more customers.
- Communicated the results to the executives in the form of Tableau reports and also provided visual representation of current mode of restaurant placements.
- Developed and executed predictive models to detect patterns and forecast currency exchange rate by analyzing time series data.
- Identified key factors influencing currency trends; performed exploratory data analysis; data pre-processing, classification, time series analysis; and K fold CV in R studio to identify the future currency trends.
- Used regression trees, decision tree, random forest, ensemble methods, and time series analysis.
- Conducted performance metrics analysis to find best model to forecast trends, along with generating reports in R studio, performing analysis using Tableau, and developing tidy dataset and data visualization in R using dplyr, tidyr, and ggplot2 packages.
- Worked on AdWords campaign management to consolidate data reports using SQL Server Management studio.
- Developed and modified database using advanced SQL queries
- Performed modeling, as well as extracting, transforming, and loading data from various source systems using SAS / SQL and SAS / macros.
- Maximized the ROI for AdWords campaign clients by 20% by optimizing the best fit keywords.
Confidential
Senior Project Engineer
Responsibilities:
- Worked on Google AdWords project for consolidating reports from internal data and ETL frameworks.
- Used SAS and SQL to perform ETL from Oracle database and created SAS datasets, including transferring and migrating data from source systems to SAS datasets for further statistical analysis using SAS EG.
- Created test structures in SQL and implemented performance tuning and query optimization by reviewing SQL Server / Instance Level settings of server and tuning it based on system workload.
- Served as ETL Developer and Quality Assurance Engineer to maintain claims data for the state government of AP.
- Participated in database design development, developed data structures to standardize data, generated reports, and created ETL specifications and SQL scripts for automating data processing.
- Loaded data files into SQL Server 2008 database tables; migrated Oracle tables into SQL Server database tables and generated reports with SSRS; and created ad-hoc reports, sub reports, linked reports, charts, and drill through and drill down reports.
