Data Scientist/ Programmer Resume
4.00/5 (Submit Your Rating)
CaliforniA
SUMMARY:
- Excellent analytical and communications skills with ability to explain complex data to others. Skilled at translating business requirements to working code.
- R statistical programming:(AB Testing, MVT, multivariate statistics, statistical methods, data munging and regular expressions; design develop, maintain, and test code; 5 years experience)
- Predictive analytics: applied statistics, regression and classification, cluster analysis/market segmentation, classification, tree methods, advanced statistical learning/data mining
- SQL: T - SQL, sqldf, joins and subqueries, 4 years experience
- Excel, including pivot tables, VLOOKUP, and complex formulas; Microsoft Certified Expert (10+ years experience), and VBA macros
- Passed first actuarial exam (statistics)
- Basic knowledge of Spark. BerkeleyX: CS105x Introduction to Apache Spark and BerkeleyX: CS120x Distributed Machine Learning with Apache Spark courses
- Basic knowledge of SAS. Passed the advanced SAS programmer exam seven years ago but have not used SAS professionally. Currently reviewing SAS on an Oracle virtual machine
- Basic knowledge of Shiny (interactive R) and LaTeX
- See my web site for an expository statistics paper and an R coding data mining paper
PROFESSIONAL EXPERIENCE:
Confidential, California
Data Scientist/ Programmer
Responsibilities:
- Helped with editing of his upcoming book on regression, including running R code, checking statistical formulas, and organizing book chapters.
- Used R, SQL, Excel and Tableau to manipulate and analyze computer repair data
Confidential
Responsibilities:
- Primary data analyst at the company, responsible for data and statistical analysis.
- Designed developed, maintained, and tested R code
- Performed numerous SQL queries in SQL Server and discovered many actionable insights
- Main Project: Statistical Analysis and model building for the Search Engine Marketing (SEM)/Pay Per Clinic (PPC)/Google AdWords campaign which has a budget of several million dollars per year. Database is 100 million rows. Statistical analysis combined with outside the box thinking and actionable insights will save company over 1 million dollars per year.
- Statistical Methods and models: AB Testing/MVT, Binomial test, Wald Sequential Probability Ratio Test, Multiple Regression, Time Series, optimization for economic analysis
- Main tools for analyses and reporting: R, SQL, Excel, PowerPoint
- Performed a cluster analysis/market segmentation on domestic and foreign customers (millions of rows) so that firm can do targeted email campaigns
- Performed a time series trend analysis to measure the effectiveness of a TV campaign
Confidential
Responsibilities:
- Primary task was to create charts using the R programming language after obtaining data via Excel spreadsheets
Confidential
Responsibilities:
- The company had slow running code with files containing about 500,000 rows and more than 200 columns and taking over 24 hours to run
- Worked with the consulting firm’s senior data scientist, and used understanding of the gbm library and the boosting method to reduce the total run time to about an hour
- Performed testing of existing R code and then developed several code improvements
- Used SQL and Excel and other software to improve performance
- The algorithm was written in R and used SQL to retrieve data from an Oracle database
Confidential
Responsibilities:
- This start up hedge fund’s portfolio managers developed an investment strategy that produced large returns in a simulated investing environment. However the risk (variability) of the strategy was not known
- Used R, SQL, and Excel to first clean up and summarize the data and then wrote R code to solve the quadratic optimization problem that measured the risk
Confidential
Volunteer
Responsibilities:
- Programmed in R.
- Cleaned and manipulated data. Extracted, transformed, and loaded data from dbf files into R.
- Used SQL to transform and summarize data
- Did a regression statistical analysis.
- Wrote a report on the performance of Albany High School on the California API using the California Department of Education Database
Confidential
Responsibilities:
- This small start up was trying to develop a software product using C++ but needed to do some prototyping using R
- Started with financial data in Excel spreadsheet. Then wrote R code to price financial derivatives and to evaluate portfolio manager performance using Value at Risk (VaR) and other performance measures.
- Output included R objects, results written to Excel spreadsheets, and charts written to PDF files.
- Also wrote a users’ manual.
Confidential
Responsibilities:
- Helped to write an MRA concentration risk credit report that the FDIC requested. Analyzed expected and unexpected losses, probability of default (PD), loss given default (LGD) in a portfolio of loans.
- Used Excel (including Solver and Analysis ToolPak) to create a dashboard that will be used to analyze and optimize a portfolio of loans
Confidential
Responsibilities:
- Excel VBA macro project
- Wrote macros which organized data, created pivot tables, and created charts, saving the user hours of work each week
Confidential
Responsibilities:
- Excel project. Just a four day position. I finished it in three days and was told I was the fastest to ever do the job. I was also completely accurate.
- Organized Excel spreadsheets for government reporting.
- Wrote a VBA macro that reduced a five hour task to a five second task.
Confidential
Responsibilities:
- Excel VBA macro project
- The non-technical manager needed to have some Excel tasks automated via VBA. Then she needed technical support. Wrote the required code and then clearly explain how to use the macros.
Confidential
Responsibilities:
- A project manager had customer data for some major hardware products in an Excel spreadsheet. However, the data was very messy and there were no IDs available for the customers. The objective was to group the data by customer and then produce a variety of summary statistics
- The problem was resolved by first writing Excel VBA macros to clean the data. Then wrote additional macros to summarize the data by customer. The code ran in under two minutes
- Developed a users’ manual
Confidential
Responsibilities:
- Worked for the vice president of medical informatics who needed to be able to view summary data at the click of a button such as being able to select parameters to segment market data by region, type of medical facility, type of treatment and then see the requested output on a spreadsheet or summarized via chart.
- Problem was resolved by writing Excel VBA macros to create
- Used SQL to verify the correctness of the VBA macros
Confidential
Responsibilities:
- Worked closely with director of technical support who had an Excel file filled with movie theater data from around the world that came from multiple sources. The objective was to combine data for the rows which represented the same theaters.
- Originally task was expected to be solved by hand
- Proposed an alternative solution and then implemented it by writing Excel VBA macros which took less than 5 minutes to run and which reduced the workload on the data analysts by over 200 hours
- Wrote a users’ manual. Gave presentation to staff on how to use macros.
Confidential
Responsibilities:
- The company’s controller had financial data on their products in Excel spreadsheets.
- Reduced the size of the file by over 90%
Confidential
Responsibilities:
- Wrote Excel VBA macros that cleaned, organized and formatted spreadsheets of data that came from different marketing data sources
- Used VBA to create workbooks and worksheets and then populated them
- Used VLOOKUP tables, pivot tables, and complex functions.
- Wrote a users’ manual. And gave a presentation to staff on how to use macros.
Confidential
Responsibilities:
- The manager of sales needed to fairly reorganize the territories of staff
- Programmed mostly with VBA macros with some SQL and R
- Used pivot tables, VLOOKUP, VBA macros and complex solve the problem
Confidential
Responsibilities:
- The CFO had many Excel spreadsheets, some of which tracked the performance of the sales team and other spreadsheets were used for additional purposes.
- Created new, better organized spreadsheets and wrote Excel VBA macros that produced reports
- Wrote macros to generate reports
Confidential
Responsibilities:
- Worked under the marketing manager who had an Excel spreadsheet with data for every employee.
- Traditionally each month the manager created separate spreadsheets for each employee. There were several dozen.
- Solved the problem by writing Excel VBA macros that created reports for each employee and put each report into a separate spreadsheet.
- The macros reduced eight hours of work to the click of a button.
- Also used SQL to organize the data and check accuracy of work
Confidential
Responsibilities:
- Worked with a risk manager. Each month the manager received reports on Excel spreadsheets. The old reports had to in corporate the new information. This was time consuming and error prone.
- Resolved the problem by writing Excel VBA macros to compare and modify spreadsheets.
Confidential
Risk Advisor
Responsibilities:
- In this small start up, the CEO wished to analyze the performance of various portfolio managers and produce summary reports.
- Resolved the problem by writing Excel VBA macros that summarized risk performance.