Data Scientist Intern (analytics Group) Resume
0/5 (Submit Your Rating)
SUMMARY:
- Web - scrapped Amazon reviews and predicted customer rating with 90% accuracy using Python (NLTK, Sklearn) to web-scrape HTML source code and a Naïve Bayes model to predict customer rating
- Reached top position in a loan prediction competition (data set is 150 GB) by implementing an Ensemble (Machine Learning) of a Random Forest model and a Neural Network model
- Evaluated the relationship between news’ publishing time and stock price jumps by developing a Python(Beautiful Soup for web scrape) and R for machine learning algorithm for predictions
- Launched the back-end of a dating website by implementing a MySQL database with 100 GB worth of data and developed Map-Reduce script to perform profile matching
- Created regression model for human health data set after performing outlier detections, tests for multicollinearity, tests for heteroscedasticity, and used step-wise regression to improve the model
- Executed a SARIMA time series model for a stock index and incorporated GARCH model after testing for stationary, heteroscedasticity, and white noise
TECHNICAL SKILLS:
Programming Languages: Python, R, MATLAB, SQL, SAS
Analytics Techniques: Machine Learning, Time Series, Linear Regression, Logistic Regression, Text Mining, Map Reduce, Teradata, Mongodb, Tableau, Hadoop
PROFESSIONAL EXPERIENCE:
Confidential
Data Scientist Intern (Analytics Group)
Responsibilities:
- Project to analyze customer retention rate for one of William Sonoma’s brands by modeling customer behaviors with Machine Learning Algorithm through Python’s Sklearn and R’s Glm package
- Worked in Teradata (SQL) environment for data gathering (data set consists of 60 tables, 300 fields, and one million records)
- Performed statistical analysis using R’s statistical packages (car, ggplot2, e1071)
- Search Engine Optimization by improving recall and ranking using Machine Learning Algorithms
Confidential
Test Engineer (Contract)
Responsibilities:
- Generated material characterization for solar coating with spectroscopy (FTIR, Vi-IR, LIBS)
- Implemented and upkeep an SQL database of all coating related test results and performed data analysis using MATLAB and data visualization with Tableau
- Worked in a cross functional and international research and manufacturing team consist of French Scientists, Chinese Solar Manufacturer, and American scientists
Confidential
Product Development Engineer
Responsibilities:
- Engineering lead in the development of a consumer product ($10M Project)
- Worked in an international cross functional team consists of Procurement, Marketing, Sales, and Finance
- Communicated in Mandarin Chinese with Chinese manufacturers and in English with U.S. Marketing team
Confidential
Intern
Responsibilities:
- Worked with a femtosecond laser to develop an autocorrelator to measure pulse length and conducted Helium Ion Imaging experiments