Jr Data Scientist Resume
4.00/5 (Submit Your Rating)
Plano, TX
SUMMARY:
- A career minded professional with 2.5 years of IT experience includes in Data Science (Machine Learning, Text Mining), Data/Business Analytics, Data Visualization, Data Warehousing, Data Governance & Operations.
- Experience in Analytics, developing different Statistical Machine Learning, Data Mining solutions to various business problems and generating data visualizations using R, Python and Tableau.
- Expertise in transforming business requirements into analytical models, designing algorithms, building models, developing data mining and reporting solutions that scales across the structured and unstructured data.
- Experience in utilizing statistical techniques which include Correlation, Hypotheses modelling, Inferential Statistics as well as data mining and modelling techniques using Regression, Classification, Clustering, Decision trees.
- Documenting new data to help source to target mapping. Also updating the documentation for existing data assisting with data profiling to maintain data validation.
- Implementing scalable Statistical & Predictive Decision Science Models using Machine Learning platforms like R & Python Data Science Packages (Pandas, NumPy).
- Proficient in research of current process and emerging technologies which need analytic models, data inputs and output, analytic metrics and user interface needs.
- Understanding on Hadoop MapReduce & Amazon EMR big data frameworks.
- Mitigated risk factors through careful analysis of financial and statistical data. Transformed and processed raw data for further analysis, visualization, and modelling.
- Team builder with excellent communications, time & resource management & continuous client relationship development skills.
TECHNICAL SKILLS:
Programming: Python, R SQL Command line
Development Tools: Amazon Web services Google Cloud Platform Tableau, PowerBI Jupyter Notebooks Databases
Machine Learning: Azure Machine Learning Regression Classification Clustering Decision Trees
Techniques: Data Analysis Data Mining & Cleaning Business Analysis & Monitoring Statistical Methods Correlations, Association Test
PROFESSIONAL EXPERIENCE:
Jr Data Scientist
Confidential, Plano, TX
Responsibilities:
- Designed applications of Machine learning, Statistical Analysis and Data visualizations with challenging large data processing problems.
- Worked with various databases like Oracle, SQL and performed the computations, log transformations, feature engineering, and Data exploration to identify the insights and conclusions from complex data using R - studio.
- Implemented predictive models using machine learning algorithms Regression and Classification algorithms and performed in- depth analysis on the structure of models, compared the performance of all the models and found boosted decision tree algorithm gives best for the prediction.
- Applied concepts of R-squared, R.M.S.E, P-value in the evaluation stage to extract interesting findings through comparisons.
- Proficient in the entire Data Science life cycle and actively involved in all the phases of project life cycle including data acquisition, data cleaning, data engineering.
- Used Azure Machine Learning to set up the experiments and creating Web services for the predictive analytics.
- Worked on writing complex SQL queries in performing Data analysis using window functions, joins, improving performance by creating partitioned tables.
- Prepared multiple dashboards using Tableau to reflect the data behavior over period of time Analyzed and worked with all aspects of regression models (OLS etc.)
- Responsible for working with stakeholders to troubleshoot issues, communicate to team members, leadership and stakeholders on findings to ensure models are well understood and optimized.
Data Scientist
Confidential, NJ
Responsibilities:
- Experience with working on clickstream activities, Customer Journey activities, Fraud Detection, Sales and managing Store items.
- Used pandas, numpy, matplotlib, sci-kit-learn in Python for developing various machine learning algorithms.
- Experience with NoSQL databases such as MongoDB, Cassandra and Utilized SQL, NoSQL databases, Python programing and API interaction.
- Experience using ETL and data visualization tools like PowerBI.
- Implemented Classification using supervised algorithms like Logistic Regression, Decision trees.
- Data transformation from various resources, data organization, features extraction from raw and stored.
- Involved in defining the source to target data mappings, business rules, and data definitions.
- Performed automation engineer tasks and implemented the ELK stack (Elasticsearch, Kibana) for AWS EC2 hosts.
- Extracting the source data from Oracle tables, MS SQL Server, sequential files and other databases.
Jr Data Analyst
Confidential
Responsibilities:
- Experience with Python programs to prepare transform and harmonize data sets in preparation for modeling.
- Developed large data sets from structured and unstructured data.
- Performed Ad-hoc reporting/customer profiling, segmentation using Python.
- Tracked various campaigns, generating customer profiling analysis and data manipulation.
- Provided SQL programming, with detailed direction, in the execution of data analysis that contributed to the final project deliverables.
- Analyzed large datasets to answer business questions by generating reports and outcome.
- Worked in a team of programmers and data analysts to develop insightful deliverables that support data-driven marketing strategies.
- Maintenance in the testing team for System testing/Integration/UAT.
- Involved in loading data from RDBMS and web logs into HDFS.
- Launching Amazon EC2 Cloud Instances using Amazon Images (Linux/ Ubuntu) and Configuring launched instances with respect to specific applications.
- Performed performance improvement of the existing Data warehouse applications to increase efficiency of the existing system.