Data Analyst Intern Resume
Sacramento, CA
SUMMARY:
Seeking a full - time opportunity related to data analyst, data scientist, statistical consulting after my graduation on. Passionate on data and always feeling excited on exploring insights from the data. Rich experiences in data analysis and predictive modeling with R and Python; Hands-on experience in big data tools and machine learning algorithms; Expertise in database management, from database design to querying with SQL; Skilled at data integration and BI tools such as Pentaho, Tableau; Programming skills in C++.
TECHNICAL SKILLS
R: Three years of experience with Statistic Models, Machine Learning, Data Mining and Data Visualization
Database: Microsoft Access, Oracle SQL Developer, MySQL, Database Design ETL, Pentaho, KNIME
Data Warehouse: Hadoop, MapReduce, HDFS, Hive, Spark, MongoDB Python (Numpy, Scipy, Pandas, BeautifulSoup, Scikit-learn)
Big Data: C++, Unix Hypothesis Testing, ANOVA, Regression, Predictive Modeling, DOE, etc.
Programming Tools: Regression, Decision Tree, Random Forest, Boosting, KNN, SVM, Neutral Network, Clustering
Statistics Models: Linear/Nonlinear Programming, Dynamic Programming, Stochastic Process Tableau, SPSS, MS Office, GIS, CAD, Data Structure, Algorithm
PROFESSIONAL EXPERIENCE
Data Analyst Intern
Confidential, Sacramento, CA
Responsibilities:
- Proposed a machine learning algorithm of predicting travel modes and activities with accelerometer and GPS data from smartphone with R, which lowers the cost of surveying by 30%
- Updated 2012 CHTS database and improved query efficiency by 20%
- Designed and built the Linked Trip database and applied queries using SQL
Research Intern
Confidential, Davis, CA
Responsibilities:
- Integrated and analyzed data (over 5 Gb) from Census Bureau and California State Government to predict election behaviors with R and SPSS