Chief Data Scientist Resume
Cupertino, CA
SUMMARY:
- I am the data lover.
- My expertise is to transform the data into insight to facilitate the decision - making processes.
- I find the hidden truth in data and turn into a good story or a valuable asset.
- At multilevel, I communicate with team members, collaborators, senior executives, manager and analyst in multiple projects.
- I am actively involved in a process of data generation, collection, reorganization and transformation.
- I am a self-starter.
- I ensure the execution and a successful completion projects as well.
- I am responsible for advanced statistical modeling using informatics tools such as machine learning framework, deep learning (caffe, computer vision) programing (R, Python, Scala) and business analytics tools.
- I have a proven track record of being successful and productive in multiple organization as demonstrated by 9 awards, 2 media coverage, and outstanding publications.
- I remotely help a startup company to establish its health care and life science division. In free time, I work on side projects as freelancer.
- Recently I am working on two outside projects with group of people.
- Total 8+ Years US working experience of data analysis of multiple projects.
- Proficient in statistical modeling, programming, prototype development using machine learning languages Python, R, Scala, Spark
- Hands on expertise of data preprocessing, data wrangling, and visualization tools (such as tableau/spotfire, SPSS modeler),
- Solid experience of neural network/deep learning framework and libraries - Tensorflow, Theano, Torch, Keras, Numpy, Metplotlib, Caffe, Scikit-learn
- Experience of solving machine learning problems such as predictive analysis, forecasting, text analytics, recommender system etc using SVM, Naïve Bayes, regression, clustering analysis
- Experience of computer vision, text mining using natural language processing (NLP), artificial neural network, recurrent neural network, convolution neural network analysis, graph analysis, stream analysis, deployment and maintenance.
- Ability to transform the data into a visual to facilitate decision-making processes.
- Communicate efficiently with all level professionals including analyst, architect, manager etc.
- Enthusiastic to accept new challenges in unknown area, ability of learn fast and gel with new culture.
- Published 3 article in machine learning (received IBRO award). Published 22 healthcare articles.
- Proven track record of handling multiple projects, prioritizing need and delivering results (9 awards).
- Strong oral and written communication skills for effective teamwork.
PROFESSIONAL SKILLS:
Artificial Intelligence and Machine Learning: Deep learning (supervised - artificial neural network, convolution neural network-caffe, computer vision, recurrent neural network; unsupervised-self-organizing map learning, Boltzmann machine, autoencoders), Regression (simple, multi and polynomial), cluster (logistic, KNN, Kernel SVM, Naïve Bayer, Decision tree, Random forest), Association rule (aprion, éclat), Reinforcement (UCB, Thompson sampling), Natural language processing, Dimension reduction (LDA, Kernel PCA), Density-based spatial clustering of applications with noise (DBSCAN), Naïve Bayes (NB), Amazon web service machine learning, Microsoft azure machine learning.
Machine learning framework, library and package: numpy, scipy, pandas, scikit-learn, statsmodels, mtaplotlib; tensorflow, torch, keras, theano; catools, ggplot, elemEstatLearn, rmarkdown, rsconnect, calibrate, biomaRt, libxml2
Big data analysis: Hadoop, Hive, Pig, Spark, HDFS, MadReduce, MySQL, NoSql, Pentaho, SPSS Modeler
Business Analysis Tools: Tableau, QIIME, Graph pad, SAS, fun rich, Reactome, Panther, Epic, InfoEd
Data visualization: 2D/3D plot, PCA, NMDA plot, α and β diversity, heatmap, decision trees, random forest, bubble plot
Data monetization: Deidentification of person information, data collection, data cleaning, data reorganization, maintain the privacy of a sensitive data while sharing the data, translating the raw clinical and preclinical data into exciting story to increase benefits or revenue using hadoop, NoSQL, Python, Pentaho etc.
Programming: Python, R, Scala, HTMB, JAVA, C++, PhP, Management studio server 2016, HTML5
Advanced statistics: ANOVA, logistic & linear regression, post hoc
GitHub and Bit Bucket: Manage files, create branch, and share projects/codes.
Communication, leadership and project management: Strong oral and written communication skills for effective teamwork and for leading projects productively and time-bound, alone and as a team. Experience of managing multiple projects, coordinating with teams and collaborators, designing hypothesis driven protocol, collecting and cleaning the data, communicating the results, writing reports/ grants, meeting deadline.
PROFESSIONAL EXPERIENCE:
Chief Data Scientist
Confidential, Cupertino, CA
Responsibilities:
- Designing protocol, data collection, preprocessing, cleaning, alogrithm, analysis and visualization
- Ability to make a story from analyzed data, wrote reports,
- Hands on experience of R studio, Python, SQL management studio, markdown, shiny, QIIME, base-space, ggplot2, graph pad, machine learning, big database tools;
- Statistical modeling and Data visualization - heat map, NMDA plot, PCA plot, box plot, bubble plot, 2D or 3D graphs, pi-chart, map etc
- Built my own code bank using Python and R script for many machine learning problems
Post-Doc/ Research Data Scientist
Confidential, Galveston, Texas
Responsibilities:
- Data production, collection, integration, management, data analysis, visualization, report drafting, and editing.
- Alogrithm, Regression, Correlation, Cluster analysis, Classification, Heatmap generation, Prediction analysis and several others.
- Statistical modeling and Data visualization
- I got experience IRB and FDA rules and regulation and patient consenting.
- Made story from analyzed data, wrote reports.
- I used Environment: R, graph pad, SPSS, ms-excess, Epic, InfoEd, SPSS, Hadoop, Spark, Pig, Hive, bioinformatics tools.
- Clinical and preclinical studies experience.
Senior Research fellow
Confidential
Responsibilities:
- Molecular mechanism of antiamnesic effect of bacopa monniera for attenuation of memory deficit
- Neuroprotetive effect of bacopa monniera in reveral of brain stroke
- Hypercoagulation state of acute brain stroke in Indian poluation
- Basic understanding of statistical analysis (including regression, correlation, 2D XY plots) and
- Hands on experience of statiscal package such as sigma stat, SPSS, MATLAB, systematic review
- Gain experience of imaging, MS office drafting reports and protocol,
- Collection, preprocessing, reorganization, integration, visualization of patients data
- Building communication and leardership skill with medical science community
Confidential
Responsibilities:
- Gain experience of memory deficit models, animal behavioral and drug evaluation
- Drafting reports, presentation, visualization and protocol
- Basic understanding of statistical analysis
- Hands on experience of statiscal package
- Collection, reorganization, integration of animal data
- Building communication and leardership skill with pharmaceutical science community