Data Analytics Practicum Intern Resume
Dallas, TexaS
SUMMARY:
Highly analytical and process - oriented student with 2+ years of experience interpreting and analyzing data in order to drive successful business solutions. Excellent understanding of business operations and analytics tools for effective analysis of data.
TECHNICAL SKILLS:
Programming Languages Databases Tools & IDE: Python, R, SAS, Java, C, Perl MySQL, Microsoft SQL Server, Oracle, Mongo DB Tableau, Power BI, Qlikview, STATA, MS Excel
Bi g Data Technologies: Statistical methods & Machine Learning
Web/Application Development: HDFS, Flume, Sqoop, Hive Regression, Classification, Clustering jQuery, JavaScript, HTML5, CSS3, JDBC Pig, Spark. Survival Analysis, ANN, Keras, Tensorflow, NLP JSON, Ajax
PROFESSIONAL EXPERIENCE:
Confidential, Dallas, Texas
Data Analytics Practicum Intern
Responsibilities:
- Analyzed various factors contributing to poverty and applied Classification models using Python
- Segmented the cities and applied clustering algorithms to analyze the high opportunity index for better targeting
- Developed and maintained ETL scripts to scrape data from external web sites and load cleansed data into a MySQL DB
Confidential
Senior Systems Analyst
Responsibilities:
- Used Predictive analytics and data mining techniques to forecast company sales of new products with a 95% accuracy rate
- Developed Python based API (RESTful Web Service) to track sales and performed sales analysis using Flask, SQLAlchemy and PostgreSQL
- Increased speed and memory efficiency by implementing code migration to convert python code to C/C++ using Cython
- Involved in ETL code using PL/SQL in order to meet requirements for Extract, transformation, cleansing and loading data from source to target data structures
- Used Bulk Collections for better performance and easy retrieval of data, by reducing context switching between SQL and PL/SQL engines
- Generated weekly and quarterly Metrics using Tableau to track trends, changes in business climate, and statistical variances
- Improved data mining processes, resulting in a 20% decrease in time needed to infer insights from customer data used to develop marketing strategies
- Tracked the site traffic and extracted monthly traffic reports from Google analytics to evaluate page performance of 4000+ products and to simplify the process of website redesign project
Telecom customer churn
Responsibilities:
- Determined the best classification model and predicted customer churn for the telecom dataset
- Achieved 80% accuracy by feature engineering and determined the most important features to predict customer churn
Fashion MNIST
Responsibilities:
- Performed data slicing and trained a CNN using Tensorflow and Keras for classifying clothes in the Fashion MNIST dataset and got a validation accuracy of 0.91 after training for 50 epochs.
Confidential
Data Analysis
Responsibilities:- Performed various Regression techniques using Python.
- Data imputation was done using Pandas.
- Performed data slicing and built random forest model using machine learning library scikit-learn.
- Created visually impactful dashboards in Tableau for data reporting by using pivot tables and VLOOKUP.
- Extracted, interpreted and analyzed data to identify key metrics and drawn conclusions.
Trending YouTube Video Statistics
Responsibilities:
- Performed extensive data analysis using R, by applying data mining techniques on factors considered for making the video trending.
- Used various regression and classification techniques like Linear Regression, Logistic Regression, LDA, KNN, CART to predict the categories of the videos.
- Implemented sentiment analysis and created word cloud on words that appeared frequently on comments