Data Scientist Resume
CaliforniA
SUMMARY
- 2 years of hands on experience and seeking a position as a Data scientist that provides a platform to utilize my skills and experience gained from educational background and hands on experience.
- Strong computational background (complimented by Statistics/Math/Algorithmic expertise), a healthy portfolio of projects dealing with Big Data, solid understanding of machine learning algorithms, and with a love for finding meaning in multiple imperfect, mixed, varied, and inconsistent data sets.
- Experience in Data Science with expertise in Descriptive, Inquisitive, Predictive and Prescriptive analytics.
- Well versed in Linear/non - linear, regression and classification modeling predictive algorithms.
- Extracted, cleansed and analyzed complex data from multiple sources to find useful insights via data analytics using Big data Hadoop, Hive, SQL.
- Experience in writing hive scripts and shell scripting.
- Worked on tuning, optimizing the performance of SQL queries.
- Migrated data and scripts from SQL Server/Teradata to Hadoop.
- Created dashboards as part of Data Visualization using Tableau.
- Team player with logical reasoning ability, coordination and interpersonal skills.
- Proficient in SQL and programming languages like C, C++, Java, Python.
- Excellent written and communication skills.
TECHNICAL SKILLS
Statistics/Machine Learning: Univariate/Multivariate regression, Lasso, Ridge, Decision trees, Ensemble methods - Random forests, Gradient Boosting, ANOVA, Supervised learning, Unsupervised learning, Principal component analysis, K-Means, Hierarchical clustering, Bayesian learning, Support Vector Machine(SVM),Neural Networks, Time series Analysis, Feature selection and Linear programming, Recommender systems - collaborative filtering (user based, item based), Low rank matrix factorization, Natural Language Processing
Statistics/ML Programming/Programming: Python, R, Excel, Java
Databases/ETL/Query: Teradata, SQL Server, Oracle
Visualization: Tableau, ggplot2
Others: Hadoop, Hive, Spark, Sqoop, Pig, Hbase, Github, Jira, Agile
PROFESSIONAL EXPERIENCE
Confidential, California
Data Scientist
Responsibilities:
- Understanding the business requirements and collecting the required data.
- Performing cleaning of the data, treating missing values, outliers in Python.
Confidential, California
Jr Data Scientist
Responsibilities:
- Developed a personalized recommender engine for the client Confidential The products are recommended to the user based on the user preferences (questionnaire taken by the user when entered the website).
- The responsibilities involved are exploratory data analysis, modelling of the data. Used Excel for some part of the Data manipulation. Used python libraries such as Numpy, Pandas, Matplotlib, Scikit-learn.
- Involved in the preparation of the technical documentation.
- Built an affinity score matrix for the products and their corresponding questionnaire.
- Extracted data from Teradata/Oracle to Hadoop using sqoop and migrated SQL/Stored procedures to Hadoop.
- Developed hive scripts and spark applications using python.
- Used classification algorithms such as SVM, KNeighborsClassifier and Random Forests algorithms. Build these models in python. This enables the user to engage better.
Confidential
Data Analytics Engineer
Responsibilities:
- Extracted datausing SQL queries, cleaned, imputed missing values, and made the datasets ready for analysis.
- Used pandas, Numpy, Seaborn, Scipy, Matplotlib, Scikit-learn in Python for developing various machinelearning algorithms.
- Assisted in the development of customer segments using unsupervised learning techniques like KMeans clustering. The clusters helped business simplify complex patterns to a manageable set of 11 patterns that helped set strategic and tactical objectives pertaining to customer retention, acquisition, spend and loyalty.
- Created visualizations summarizing key insights using Tableau.
- Effectively communicated project status, risks, issues, and constraints to project management.
- Investigated data from varied sources, cleaned, transformed and performed Exploratory Data Analysis(EDA) with visualizations.
- Worked with other Business Analysts to ensure that business design is cohesive across all business areas impacted.
- Developed and implemented Predictive analysis using R for Management and Business users for decisions making process.
Confidential
Software Intern
Responsibilities:
- Assisted in maintaining records in the database satisfying the requirements of the user using Toad Application.
- Worked for the mapping of data into database objects and ER diagrams, database designs and tables.
- Involved in the development of the SQL scripts to Insert, Update and Delete data in MYSQL database tables.
- Ensured the stable state of the database by providing the optimized solution to the issues raised by the users.
- Enhancements are made on daily basis by expanding the database to meet the business requirements.
