Sr. Database Engineer/data Scientist Resume
Westborough, MassachusettS
PROFESSIONAL SUMMARY:
- Data Scientist with 5+ years of professional experience using Data Analysis, Predictive modeling, Data Visualization, Statistical Analysis
- Involved in python supervised deep learning programs and passionate about learning unsupervised deep learning
- Proficient in SQL(SQL server, MySQL)
- Experienced in developing different Statistical Machine Learning, Forecasting, Text Analysis and Data Mining Solutions to the various business problems using Python, Tableau, SQL
- Proficient in Machine Learning and Deep Learning Models (Linear, Logistic Regression, Multivariate Regression, Random Forest, K - NN, SVM, Natural Language Processing, ANN, CNN, RNN, Xgboost, LightGBM)
- Familiar with Hadoop Ecosystem and Big Data Tools such as HDFS, HiveQL, Pig
- Expertise in Python programming using various frameworks including Numpy, Pandas, SKlearn, Scikitlearn, Tensorflow, Keras, Pytorch, CV2
SKILL:
Languages: SQL, T-SQL, R Programming Language, Python
Tools: Microsoft SQL Server Management Studio, SQLyog, Shiny, R Studio, Spyder, Git/Github
Big Data Technologies: Hadoop, MapReduce, NOSQL (Hbase), Pig, Hive
Machine Learning Algorithms: Decision Trees, Random Forests, Linear Regression, Logistic Regression, Naive Bayes, k-Nearest Neighbors (k-NN), Neural Networks, Support Vector Machines (SVM), Gradient Descent, Artificial Neural Networks (ANN), Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN)
Libraries: Pandas, Numpy, Seaborn, Scikit Learn, NLTK, Keras
WORK EXPERIENCE:
Sr. Database Engineer/Data Scientist
Confidential, Westborough, Massachusetts
Responsibilities:
- Implement analytical and statistical tools to collect analyze and interpret large data sets. Utilize this information to develop data driven solutions to solve business problem
- Perform statistical and predictive analysis for client business segmentation in python and use business intelligence tool (tableau) for final analysis
- Collaborate with business unit and team to determine the long-term strategies.
- Maintain database of electronic Health Records software for several healthcare practices on data storage systems like MySQL and MS SQL.
- Identify, analyze, and interpret trends or patterns in complex data sets using R Programming Language.
- Evaluate performance of employees and provide insight into making appropriate business decisions.
Software Developer
Confidential
Responsibilities:
- Designed model to Predict future churn rate of customers using Artificial neural networks with accuracy of 85% on validation set; adopted drop out regularization to achieve 86.67% accuracy
- Hunted 15% of credit card applications which were incorrectly flagged as non-fraudulent using Self Organizing Maps; Model was created to identify patterns in high dimensionality relationship
- Tackled structured, semi-structured and unstructured data sets, determined key takeaways and visualized using box plots, histograms to abridge via Seaborn, matplotlib library of Python and ggplot of R
- Created SVM classifier model to detect tone of customers on survey datasets of 15000 rows as positive or negative using NLTK and NLP techniques