We provide IT Staff Augmentation Services!

Data Scientist Resume

0/5 (Submit Your Rating)

WORK EXPERIENCE:

Data Scientist

Confidential

Responsibilities:

  • Provide insights to customers of Confidential re: shipment details, customs clearance, and percentage of clearance.
  • Develop KPIs/SQIs for process improvement for overall entry level Confidential employees using mathematically techniques such as Mahalanobis distance.
  • Create predictive models that provide insights on shipment/pieces that reach pre - clearance based on origin, destination, and seasonal effects.
  • Provide dashboards and data visualization using SAS Visual Analytics and graphics vis R Shiny/ggplot2.
  • Languages and tools: R, SAS, SAS VA, machine learning, decision trees, cluster analysis, predictive analytics

Data Science Consultant

Confidential

Responsibilities:

  • Lead initiative on project and project life cycle to analyze 100 GB of proprietary data from Fortune 100 Company.
  • Purpose was to identify significant features associated with maximization of revenue.
  • Utilized machine learning techniques such as cross validation, Lasso regression, and linear regression.
  • Created a GUI for easy feature selection that utilizes machine learning model above.
  • Deliverables are expected to be completed by end of the year.
  • Provide updates, reports, and dashboards for company.
  • Languages and tools: R, statistical programming, Lasso regression, bootstrap, cross validation

Research Data Specialist

Confidential

Responsibilities:

  • Created database that stores data from cancer treatment protocols using customized VBA.
  • Primary database administrator using SQL statements to extract data prior to analysis in SAS and R.
  • Provide reports and dashboards for protocol updates and Confidential t progress through interventions.
  • Implemented imputation methods along with non-parametric techniques for protocol objectives.
  • Analyzed Confidential t text responses using Multidimensional Scaling.

Data Scientist Consultant

Confidential

Responsibilities:

  • Analyzed large volumes of genomic data to locate associations with diseases.
  • 850,000 DNA sites sequenced from 2000 persons with 326 variables collected per person.
  • Utilized R to iteratively use logistic regression to find significant associations between 548 different CPG sites and associations with asthma and covariates.
  • Used Lasso regression for significant variable selection out of 326 variables.
  • 28 out of 548 CPG sites were shown to be significantly associated with asthma outcome.
  • Prepared a comprehensive statistical report which included R generated figures.
  • Languages and tools: R, Glmnet, Lasso regression, logistic regression, SAS, Python, statistical programming

Research Statistical Consultant

Confidential

Responsibilities:

  • Analyzed 14 GB of electrophysiological brain data via MATLAB and Neuroscan Curry.
  • Created customized MATLAB scripts for matrix manipulation of 21 x 68 x 1,000,000 data.
  • Utilized PCA dimensionality reduction prior to data analysis.
  • Analyzed data using SAS mixed modeling with Tukey post-hoc test adjusting for multiple comparisons.
  • Prepared comprehensive statistical report for supervisor which included R figures and plots.
  • Language and tools: MATLAB, R, SAS, PCA, mixed modeling, glm, Neuroscan Curry, statistical programming

Data Scientist Consultant

Confidential

Responsibilities:

  • Provided statistical support for experimental designs such as nested RBD.
  • Analyzed significant features associated with performance outcomes using GBC.
  • Analyzed changes in proportions of reported outcomes using Cochran-Armitage Trend Test.

We'd love your feedback!