Data Scientist Resume
Cambridge, MA
SUMMARY
- Data scientist wif extensive experience in application of statistical and machine learning methods to data analysis, predictive modeling, and natural language processing
TECHNICAL SKILLS
Statistical languages: R/S - PLUS, Matlab/Octave, SPSS Clementine/Text Miner
Programming languages/tools: Perl, C/C++, Hadoop, Pig, bash/ksh
Operating systems: Ubuntu/Linux, FreeBSD, Solaris, Windows XP/NT
Database systems: MySQL, Microsoft SQL Server, Oracle PL/SQL
Mathematical/statistical techniques: Hidden Markov models, k-means cluster analysis, time series analysis, neural networks, naïve Bayes classifiers, text mining, OLS/logistic regression, linear programming, simulated annealing, constraint satisfaction
PROFESSIONAL EXPERIENCE
Confidential, Cambridge, MA
Data Scientist
Responsibilities:
- Analysis of reach data from cable TV subscribers for audience forecasting (Perl/MySQL)
- Extension of Gurobi IP model for optimization of ad placement
Confidential, Roseville, CA
Senior Scientist
Responsibilities:
- Analysis of e-commerce Web site metrics for incorporation into price optimization algorithms (Matlab/R/Perl/MS SQL Server)
- Implementation of affinity analysis for retail transaction log data
- Implementation of Web scraper for competitive price tracking
Confidential, San Francisco, CA
Data Scientist
Responsibilities:
- Statistical prediction of stock market price movements from sentiment analysis of relevant Twitter messages
- Sentiment analysis NLP algorithm wif SVM classifier for social media text
- Trend detection algorithm for Twitter stream topics
- Statistical NLP algorithm for topic extraction
- Simulated annealing optimization of parameters of search result ranking function
- Time window selection algorithm for search result presentation
- Eigenvector centrality algorithm for Twitter and Google+ user influence ranking (implemented in Amazon Web Services Elastic MapReduce)
- Scheduling algorithm for Google+ profile site crawler
Confidential, Sunnyvale, CA
Senior Researcher
Responsibilities:
- Social network analysis for Web advertising behavioral targeting models
- Cluster-based customer segmentation and user dynamics analysis from Web interaction data, using k-means and multinomial mixture probabilistic clustering
- Model-based time series classifier for IP addresses used in spammer detection
Confidential, Santa Clara, CA
Senior Staff Engineer
Responsibilities:
- Naïve Bayes text mining classifier for predicting risk of customer escalations of software defects from software defect reports
- Neural network model for predicting defect escalation risk from defect characteristic data and product history
- Design of model for prediction of system outage risk from semi-structured configuration data and maintenance and outage history
Confidential,Seattle, WA
Technical group leader
Responsibilities:
- Hidden Markov model for predicting customer propensities from Web responses
- Rasch adaptive-response model for Web-based interactive survey system based on response-curve parameterization and dynamic Bayesian question selection algorithms
- Recommender system for cross-selling of consumer financial products