Principal Data Scientist Resume
Houston, TX
SUMMARY:
To obtain a position as a Data Scientist that will utilize my skills in building predictive models in Oil & Gas or a related energy industry. I am a Computer Scientist with 7+ years’ experience in building machine - learning models using techniques such as ANN, Random Forest, SVM, Logistic Regression and Bayesian methods GMM EM. Skilled in Predictive Analytics, and applications’ software, with analytical and system design skills. Possess a Master degree in Computer Science - Artificial Intelligence. I have well-honed abilities in statistical data mining. Programming in R, Python, and Java. Working skills in data management with Oracle, SQL Server and MySQL. Good knowledge of NoSQL databases. Extensive Experience in Oil & Gas Reservoir/Production Engineering Workflows.
SKILLS:
Data Science Modeling, Predictive Analytics, and Statistical acumen
ETL - Oracle 11g, MS SQL Server, SSRS, MySQL - Excel & Access - NoSQL: Cassandra - JSON Documents
Data Management: Structured & Unstructured Data Management
Web crawling, scraping, and Sentiment Analysis
Programming in R Notebook, Python Notebook, and Java Netbeans, RegEx language
Windows & Linux Ubuntu maintenance, software installation, and upgrades
Linear & Logistic Regression, SVR
Decision Trees, Bagging, and Boosting, SVM
SOM - Segmentation and Neural Networks
Web Services: Azure ML Studio NN Modeling
PMI Project Management approach
Experience working with API's and integrating external data sources
Programming using R with tidyverse, plotly, readxl, imputeTS, stringr, rvest, and many others.
Oil & Gas Subject Matter Expert - Reservoir & Production Engineering Workflows – Technical Sales
EXPERIENCE:
Principal Data Scientist
Confidential, Houston, TX
Responsibilities:
- Gas Turbine Compressor Decay State Coefficient Prediction: it is an R Project to diagnose/predict any faulty type of equipment across Oil & Gas (upstream, midstream and downstream), Mining, Chemicals, Power and Utilities.
- The project uses a dataset of 530K+ records and 30 predictors making use of PCA for dealing with Near-Zero Values and calculation of Multi-Collinearity among predictors, as well as Neural Networks with Tensorflow in R.
- Unsupervised Cluster Analysis Study of 13K records Simulating Charter of Accounts to determine similarity among accounts for potential restructuring.
- Used Gaussian Mixture Modeling with Expectation Maximization, sensitivities to select most fit model using different covariances and Silhouette analysis to study the separation distance between the resulting clusters.
- Pilot Project in preparation for Chevron Project. Pilot coded in Python.
Technical Data Scientist Consultant
Confidential, Houston, TX
Responsibilities:
- Serve as a Technical Consultant in Data Science mainly for the Oil & Gas Industry and has produced various papers and documents for field performance optimization using Data Mining, Predictive Analytics and Machine Learning algorithms, clustering, regression and classification.
- Leveraged Unsupervised Learning Algorithms Neural Networks and Well Segmentation to elucidate best production performance signature from producing wells.
- Use of R, ANN SOM, ggplot, plotly and stringr libraries.
- Develop Sentiment Analysis Study for Political Situation in Venezuela (South America) including Emotion and Polarity using ‘cloud’ distribution of emotions (anger, disgust, fear, joy, sadness, and surprise) based on a bayesian algorithm included in the library sentiment in R. Data acquired through API Twitter using R.
- Performance Prediction and Forecasting in Permian Grayburgh/San Andres formation using a Supervised ML Neural Network Model.
- Produced a model for prediction of and performance forecasting on 1447+ wells with 30+ years of history.
Production Engineering Instructor
Confidential, Houston, TX
Responsibilities:
- Served on an as-needed contract basis as a Reservoir Engineering and Artificial Lift Instructor in Houston USA, Quito Ecuador and Kuala Lumpur Malaysia in Production Optimization and Petroleum Economics.
Production Optimization Engineer
Confidential, Houston, TX
Responsibilities:
- Performed in-depth Data Mining of completion data, production performance profiles, and forecasting for conventional and unconventional fields.
- Use of R, Excel and MS SQL Server.
- Develop efficient database designs, creation of SQL Scripts, Triggers, Schemas, and Models in Oracle 11g.
- Maintenance of SQL Server database for gigantic unconventional oil field in Texas.
- Identified 138 potential candidates for re-fracking, re-connecting, and re-charging stimulated reservoir volume.
- Confidential needed an evaluation of possible candidates for further stimulation.
- Using a parametric model, pivot tables and cluster analysis (Unsupervised Learning - SOM). Use of Excel and VBA.
- Well Segmentation through Self Organizing Maps (Kohonen Maps) in an Unsupervised ML model. Study to create segments of oil producing wells with similar signature regarding production performance behavior.
- Use of ANN SOM wrapped in Avocet Workflow Manager.
Production Engineering Business Development
Confidential, Houston, TX
Responsibilities:
- Performed Business Intelligence/Market Evaluation to develop new opportunities using Confidential solutions, leading to 30-40% revenue growth. Solved customers’ business problems using Confidential ’s software solutions.
- Build databases through SQL Scripts, Triggers, Schemas, and Models in Oracle and MS SQL Server.
- Maintenance and Support of SQL Server databases for Confidential clients.
- Successfully completed Confidential Certification in Technical Sales in Abingdon, England for Production Software Solutions and Workflow Automation.
Production Eng. Team Lead
Confidential
Responsibilities:
- Built a Business-Critical Solution for ZADCO (JV ADNOC & XOM) for Downtime monitoring and Operational Efficiency using Oracle 9i, MS SSRS, and a Confidential commercial solution Oilfield Manager (OFM) for Reservoir & Production Surveillance.