We provide IT Staff Augmentation Services!

Jr Data Scientist Resume

4.00/5 (Submit Your Rating)

Plano, TX

SUMMARY:

  • A career minded professional with 2.5 years of IT experience includes in Data Science (Machine Learning, Text Mining), Data/Business Analytics, Data Visualization, Data Warehousing, Data Governance & Operations.
  • Experience in Analytics, developing different Statistical Machine Learning, Data Mining solutions to various business problems and generating data visualizations using R, Python and Tableau.
  • Expertise in transforming business requirements into analytical models, designing algorithms, building models, developing data mining and reporting solutions that scales across the structured and unstructured data.
  • Experience in utilizing statistical techniques which include Correlation, Hypotheses modelling, Inferential Statistics as well as data mining and modelling techniques using Regression, Classification, Clustering, Decision trees.
  • Documenting new data to help source to target mapping. Also updating the documentation for existing data assisting with data profiling to maintain data validation.
  • Implementing scalable Statistical & Predictive Decision Science Models using Machine Learning platforms like R & Python Data Science Packages (Pandas, NumPy).
  • Proficient in research of current process and emerging technologies which need analytic models, data inputs and output, analytic metrics and user interface needs.
  • Understanding on Hadoop MapReduce & Amazon EMR big data frameworks.
  • Mitigated risk factors through careful analysis of financial and statistical data. Transformed and processed raw data for further analysis, visualization, and modelling.
  • Team builder with excellent communications, time & resource management & continuous client relationship development skills.

TECHNICAL SKILLS:

Programming: Python, R SQL Command line

Development Tools: Amazon Web services Google Cloud Platform Tableau, PowerBI Jupyter Notebooks Databases

Machine Learning: Azure Machine Learning Regression Classification Clustering Decision Trees

Techniques: Data Analysis Data Mining & Cleaning Business Analysis & Monitoring Statistical Methods Correlations, Association Test

PROFESSIONAL EXPERIENCE:

Jr Data Scientist

Confidential, Plano, TX

Responsibilities:

  • Designed applications of Machine learning, Statistical Analysis and Data visualizations with challenging large data processing problems.
  • Worked with various databases like Oracle, SQL and performed the computations, log transformations, feature engineering, and Data exploration to identify the insights and conclusions from complex data using R - studio.
  • Implemented predictive models using machine learning algorithms Regression and Classification algorithms and performed in- depth analysis on the structure of models, compared the performance of all the models and found boosted decision tree algorithm gives best for the prediction.
  • Applied concepts of R-squared, R.M.S.E, P-value in the evaluation stage to extract interesting findings through comparisons.
  • Proficient in the entire Data Science life cycle and actively involved in all the phases of project life cycle including data acquisition, data cleaning, data engineering.
  • Used Azure Machine Learning to set up the experiments and creating Web services for the predictive analytics.
  • Worked on writing complex SQL queries in performing Data analysis using window functions, joins, improving performance by creating partitioned tables.
  • Prepared multiple dashboards using Tableau to reflect the data behavior over period of time Analyzed and worked with all aspects of regression models (OLS etc.)
  • Responsible for working with stakeholders to troubleshoot issues, communicate to team members, leadership and stakeholders on findings to ensure models are well understood and optimized.

Data Scientist

Confidential, NJ

Responsibilities:

  • Experience with working on clickstream activities, Customer Journey activities, Fraud Detection, Sales and managing Store items.
  • Used pandas, numpy, matplotlib, sci-kit-learn in Python for developing various machine learning algorithms.
  • Experience with NoSQL databases such as MongoDB, Cassandra and Utilized SQL, NoSQL databases, Python programing and API interaction.
  • Experience using ETL and data visualization tools like PowerBI.
  • Implemented Classification using supervised algorithms like Logistic Regression, Decision trees.
  • Data transformation from various resources, data organization, features extraction from raw and stored.
  • Involved in defining the source to target data mappings, business rules, and data definitions.
  • Performed automation engineer tasks and implemented the ELK stack (Elasticsearch, Kibana) for AWS EC2 hosts.
  • Extracting the source data from Oracle tables, MS SQL Server, sequential files and other databases.

Jr Data Analyst

Confidential

Responsibilities:

  • Experience with Python programs to prepare transform and harmonize data sets in preparation for modeling.
  • Developed large data sets from structured and unstructured data.
  • Performed Ad-hoc reporting/customer profiling, segmentation using Python.
  • Tracked various campaigns, generating customer profiling analysis and data manipulation.
  • Provided SQL programming, with detailed direction, in the execution of data analysis that contributed to the final project deliverables.
  • Analyzed large datasets to answer business questions by generating reports and outcome.
  • Worked in a team of programmers and data analysts to develop insightful deliverables that support data-driven marketing strategies.
  • Maintenance in the testing team for System testing/Integration/UAT.
  • Involved in loading data from RDBMS and web logs into HDFS.
  • Launching Amazon EC2 Cloud Instances using Amazon Images (Linux/ Ubuntu) and Configuring launched instances with respect to specific applications.
  • Performed performance improvement of the existing Data warehouse applications to increase efficiency of the existing system.

We'd love your feedback!