We provide IT Staff Augmentation Services!

Data Scientist Resume

5.00/5 (Submit Your Rating)

Cambridge, MA

SUMMARY

  • Data scientist wif extensive experience in application of statistical and machine learning methods to data analysis, predictive modeling, and natural language processing

TECHNICAL SKILLS

Statistical languages: R/S - PLUS, Matlab/Octave, SPSS Clementine/Text Miner

Programming languages/tools: Perl, C/C++, Hadoop, Pig, bash/ksh

Operating systems: Ubuntu/Linux, FreeBSD, Solaris, Windows XP/NT

Database systems: MySQL, Microsoft SQL Server, Oracle PL/SQL

Mathematical/statistical techniques: Hidden Markov models, k-means cluster analysis, time series analysis, neural networks, naïve Bayes classifiers, text mining, OLS/logistic regression, linear programming, simulated annealing, constraint satisfaction

PROFESSIONAL EXPERIENCE

Confidential, Cambridge, MA

Data Scientist

Responsibilities:

  • Analysis of reach data from cable TV subscribers for audience forecasting (Perl/MySQL)
  • Extension of Gurobi IP model for optimization of ad placement

Confidential, Roseville, CA

Senior Scientist

Responsibilities:

  • Analysis of e-commerce Web site metrics for incorporation into price optimization algorithms (Matlab/R/Perl/MS SQL Server)
  • Implementation of affinity analysis for retail transaction log data
  • Implementation of Web scraper for competitive price tracking

Confidential, San Francisco, CA

Data Scientist

Responsibilities:

  • Statistical prediction of stock market price movements from sentiment analysis of relevant Twitter messages
  • Sentiment analysis NLP algorithm wif SVM classifier for social media text
  • Trend detection algorithm for Twitter stream topics
  • Statistical NLP algorithm for topic extraction
  • Simulated annealing optimization of parameters of search result ranking function
  • Time window selection algorithm for search result presentation
  • Eigenvector centrality algorithm for Twitter and Google+ user influence ranking (implemented in Amazon Web Services Elastic MapReduce)
  • Scheduling algorithm for Google+ profile site crawler

Confidential, Sunnyvale, CA

Senior Researcher

Responsibilities:

  • Social network analysis for Web advertising behavioral targeting models
  • Cluster-based customer segmentation and user dynamics analysis from Web interaction data, using k-means and multinomial mixture probabilistic clustering
  • Model-based time series classifier for IP addresses used in spammer detection

Confidential, Santa Clara, CA

Senior Staff Engineer

Responsibilities:

  • Naïve Bayes text mining classifier for predicting risk of customer escalations of software defects from software defect reports
  • Neural network model for predicting defect escalation risk from defect characteristic data and product history
  • Design of model for prediction of system outage risk from semi-structured configuration data and maintenance and outage history

Confidential,Seattle, WA

Technical group leader

Responsibilities:

  • Hidden Markov model for predicting customer propensities from Web responses
  • Rasch adaptive-response model for Web-based interactive survey system based on response-curve parameterization and dynamic Bayesian question selection algorithms
  • Recommender system for cross-selling of consumer financial products

We'd love your feedback!