Data Science Developer Resume
2.00/5 (Submit Your Rating)
Houston, TX
SUMMARY:
- Data scientist interested in machine learning and predictive modeling with background in Math and Business. Willing to take on new challenges. Using critical thinking and models to solve business issues.
TECHNICAL SKILLS:
Machine Learning: Deep Learning, Linear & Logistic Regression, Classification, Clustering, Natural language processing, Sentiment Analysis, Recommender System
Database: SQL, MySQL, Mongodb, Prostgres MapReduce Hive, SAP HANA
Languages and Libraries: R, Python, Pandas, Numpy, Tensorflow, Scikit - learn, dplyr, tidyr
Reporting & Visualization: Tableau, SAP BO, Crystal Reports, SSRS.
Other: Computer Vision, AWS, Spark
PROFESSIONAL EXPERIENCE:
Confidential, Houston, TX
Data Science Developer
Responsibilities:
- COMPLETION OPTIMIZATION SPECIALIST - BUILD DIFFERENT MODELS TO OPTIMIZE PRODUCTION Using different Machine Learning tools and libraries: Rapidminer, H2O, Scikit-learn…etc
- Create analytics/machine learning driven outcomes for Oil and Gas industry
- Use data science techniques to identify potential efficiency/improvement opportunities in the areas of Design, Construction, Recovery and Production of Oil and Gas
- Use Tableau, Alteryx, UBL…etc. for data cleansing and exploration.
- Use Python Pandas for Data wrangling and enrichment
- Run Different set of algorithms to maximize well production (KNN, KMN, SVM, Gradient Boosting, Random Forest…etc.) using different tools and libraries: RapidMiner, Scikit-learn, Genix…etc.
- Working in an agile environment.
- Data wrangling and cleansing using Python scripts and Pandas library.
- Refactoring R code to Python for Data scalability
- Using NLP techniques on drilling logs to predict trips out of the holes.
- Using Sparklyr to scale application in AWS.
Confidential, Austin, TX
Data Science Fellow
Responsibilities:
- Developed a model to predict classes for tweets, using natural language processing, Random Forest, Naïve Bayes, KNeighbor and gradient boosted tree methods using scikit-learn Python Library.
- Text mining using NLP techniques to cluster based on topics thousands of articles from NY times.
- Using NLP techniques to predict sentiment analysis on tweets github.com/Redwa/Capstone
- Build simple TensorFlow graphs for everyday computations
- Apply logistic regression for classification with TensorFlow
- Design and train a multilayer neural network with TensorFlow
- Understand intuitively convolutional neural networks for image recognition
- Bootstrap a neural network from simple to more accurate models
- Access public datasets and use TensorFlow to load, process, clean, and transform data
- Use TensorFlow on real-world data sets including images and text
- Applying TensorFlow in various hands-on exercises using the command line
- Developed case studies predicting user churn and responses to ride sharing;
- Detecting event ticketing fraud
- Scraping websites to derive insights from data
- Applied distributed computing to approaching data problems—AWS, Spark, MapReduce
- A/B testing to compare website versions to maximize profit
- Data Analysis using Pandas and Graphing.
- Data Visualization Using Matplotlib.
- Pig for Wrangling Big Data.
Confidential, Dallas, TX
Data Science Fellow
Responsibilities:
- Worked on various datasets using Machine learning in R. Used Tableau to create Dashboards
- Using Machine Learning to Predict: US Elections, Supreme Court voting, Crimes in Chicago, Stock market, Census Bureau data, Flights delay, Grapes Harvest, Credit Card Fraud, Patients Insurance…etc.
- R Predictive analysis using linear regression, Logistic Regression, Classification, Random Forest, Time Series
- Using data.table, tidyr and dplyr libraries to clean and munge data
- Using ggplot2 and ggvis for visualization.