We provide IT Staff Augmentation Services!

Data Science Intern Resume

0/5 (Submit Your Rating)

NyC

SUMMARY:

  • Built a decision support system (DSS) that extracts topics from online customer reviews and segments the market based on these review topics, other quantitative ratings and businessIDs. This DSS serves as a tool for (1) making targeted advertisements and personalized recommendations to customers and (2) competitor mining - shows which other businesses (hotels) the users/customers write similar reviews about. Used MongoDB, Python, Django and data from TripAdvisor.
  • Used text network analysis to visualize large volumes of user-reviews (text) on Yelp.
  • Identified network’s structural properties and quantitative metrics: Detected most central terms within user-reviews.
  • Ascertained what kinds of terms are most important and influential in deriving meaning out of the other terms in the network.
  • Used this data visualization for preliminary exploratory analysis of topics before running a topic modelling algorithm (LDA).
  • Predicted creditworthiness of loan applicants using random forest, logistic regression and principal component regression classification algorithms in R programming language.
  • Wrote scripts in Python to scrape and parse HTML data from Autotrader website to predict car prices.
  • Designed and modelled relational databases, wrote SQL queries, developed ER data models for business operations of a bank, an online market place and an airline database.

TECHNICAL SKILLS:

SKILLS: R *Python *SAS *SPSS *Weka *Gephi *Matlab *MySQL *MongoDB *HDFS *Hive *Pig *AWS *Machine Learning algorithms *Statistics *NLP *Text Mining *Web Scraping using Python *JavaScript *Tableau *HTML *CSS *Excel

PROFESSIONAL EXPERIENCE:

Confidential, NYC

Data Science Intern

Responsibilities:

  • Used Computer Vision and Machine Learning algorithms for Object (Logo) Detection and Recognition in Images.
  • Used the Bag of Visual Words algorithm to predict if a given image contains a client's trademark logo.
  • Brought in significant savings as this model will be used to automate the company’s Trademark Infringement Analysis and Prediction workflow, replacing a third party vendor the job was outsourced to.

Confidential, Hoboken, NJ

Research Assistant, Business Intelligence and Analytics

Responsibilities:

  • Mining competitor information from user reviews on Trip Advisor using MongoDB and Python (PyMongo, NLTK)
  • Wrote Python programs to scrape and parse data on user-profiles from Odesk website to extract users’ technical skills.
  • Derived association rules between user-skill itemsets.

Confidential

Research Associate

Responsibilities:

  • Designed statistically reliable questionnaires for large-scale data collection.
  • Built and analyzed quantitative and qualitative (audio/text) databases on cognitive capacities of young children in India.
  • Prepared research based technical reports for the funding agency.
  • Handled parametric and non-parametric (ordered) statistical procedures.
  • Conducted 150 semi-structured interviews to contribute to a database of 500 interviews.

We'd love your feedback!