Data Science Intern Resume
0/5 (Submit Your Rating)
NyC
SUMMARY:
- Built a decision support system (DSS) that extracts topics from online customer reviews and segments the market based on these review topics, other quantitative ratings and businessIDs. This DSS serves as a tool for (1) making targeted advertisements and personalized recommendations to customers and (2) competitor mining - shows which other businesses (hotels) the users/customers write similar reviews about. Used MongoDB, Python, Django and data from TripAdvisor.
- Used text network analysis to visualize large volumes of user-reviews (text) on Yelp.
- Identified network’s structural properties and quantitative metrics: Detected most central terms within user-reviews.
- Ascertained what kinds of terms are most important and influential in deriving meaning out of the other terms in the network.
- Used this data visualization for preliminary exploratory analysis of topics before running a topic modelling algorithm (LDA).
- Predicted creditworthiness of loan applicants using random forest, logistic regression and principal component regression classification algorithms in R programming language.
- Wrote scripts in Python to scrape and parse HTML data from Autotrader website to predict car prices.
- Designed and modelled relational databases, wrote SQL queries, developed ER data models for business operations of a bank, an online market place and an airline database.
TECHNICAL SKILLS:
SKILLS: R *Python *SAS *SPSS *Weka *Gephi *Matlab *MySQL *MongoDB *HDFS *Hive *Pig *AWS *Machine Learning algorithms *Statistics *NLP *Text Mining *Web Scraping using Python *JavaScript *Tableau *HTML *CSS *Excel
PROFESSIONAL EXPERIENCE:
Confidential, NYC
Data Science Intern
Responsibilities:
- Used Computer Vision and Machine Learning algorithms for Object (Logo) Detection and Recognition in Images.
- Used the Bag of Visual Words algorithm to predict if a given image contains a client's trademark logo.
- Brought in significant savings as this model will be used to automate the company’s Trademark Infringement Analysis and Prediction workflow, replacing a third party vendor the job was outsourced to.
Confidential, Hoboken, NJ
Research Assistant, Business Intelligence and Analytics
Responsibilities:
- Mining competitor information from user reviews on Trip Advisor using MongoDB and Python (PyMongo, NLTK)
- Wrote Python programs to scrape and parse data on user-profiles from Odesk website to extract users’ technical skills.
- Derived association rules between user-skill itemsets.
Confidential
Research Associate
Responsibilities:
- Designed statistically reliable questionnaires for large-scale data collection.
- Built and analyzed quantitative and qualitative (audio/text) databases on cognitive capacities of young children in India.
- Prepared research based technical reports for the funding agency.
- Handled parametric and non-parametric (ordered) statistical procedures.
- Conducted 150 semi-structured interviews to contribute to a database of 500 interviews.