Data Scientist Resume
Richardson, TX
PROFESSIONAL SUMMARY:
- Over 7+ years of experience inIT industry on analytical programming using Python, R programming, Java, Django, Flask,database design and agile methodologies
- Over 4+ years of experience with Statistics, Data Analysis, Machine Learning using Python andRlanguage.
- Experienced in SQL programming and creation of relational database models.
- Experienced in creating cutting edge data processing algorithms to meet project demands.
- Worked with complex applications usingR, SPSS and Python to develop neural network algorithms, cluster analysis.
- Proficient in Tableau data visualization tool to analyze and obtain insights into large datasets, create visually powerful and actionable interactive reports and dashboards.
- Worked on complex KPI scorecards, heat maps, tree views, circle views, histogram visualizations and interactive dashboards to find the trend analysis of data.
- Worked with packages like matplotlib, Seaborn, Pandas in Python and ggplot2 and shiny in R to understand data and developed applications.
- Designed text classification tool using Scikit - Learn and Natural Language Processing (Spacy) in python to automatically scan documents and check the meets of policy
- Experience in analyzing Format data using Machine Learning algorithm by Python Scikit-Learn.
- Using Spacy processed the text code for deep learning and connected to statistical model
- Developed predictive models using Python to predict customers churn and classification of customers.
- Developed predictive models using Decision Tree, Random Forest and Naive Bayes.
- Automated recurring reports using SQL and Python and visualized them on BI platform like Tableau or QuickView.
- Worked with python libraries like matPlotLib, numPY, sciPY and pandas for data analysis
- Worked on Statistical models to create new theories and products.
- Experience working with statistical and regression analysis, multi-objective optimization.
- Designed and implemented supervised and unsupervised machine learning.
- Identify problems and provide solutions to business problems using data processing, data visualization and graphical data analysis.
- Solid knowledge of mathematics and experience in applying it to technical and research fields.
- Identifying areas where optimization can be efficient.
- Worked with clients to identify analytical needs and documented them for further use.
- Worked on python code and Scikit Learn showcasing machine learning for improving the forecast of business.
- Worked on Designing and configuration of the database and back end applications and programs.
- Connected python with Hadoop Hive and Spark and performed data analytics
- Worked with Amazon Web Services environment
- Strong experience working with databases like SQL Server 2008, Oracle, MS Access, No SQL databases like MongoDB and Cassandra
- Query optimization, execution plan and Performance tuning of queries for better performance in SQL.
- Extensive experience working with Django, Flask frameworks.
- Experienced in developing web-based applications using Python, Django, XML, CSS3, HTML5, JavaScript and JQuery.
- Experience in creating multiple Django apps and extensively used Django Session and management.
- Mastered in understanding of Python requests module and HTTP requests, HTTPResponseRedirect, response, methods, headers
- Excellent analytical & troubleshooting skills.
- Experienced the full software life cycle in Agile and Scrum methodologies
TECHNICAL SKILLS:
Languages: Python, R, Java, C++
Frameworks: Django, Flask, AngularJS, CSS Bootstrap
Databases: Oracle, MySQL, Postgres, SQLite3, NoSQL(MangoDB, Cassandra)
Web Technologies: JavaScript, HTML5, CSS, XML, AJAX
Application Servers: WebSphere Application Server, Apache Tomcat
Operating Systems: Linux, Unix, Windows, MAC
Web Servers: Apache, Nginx
SDLC: Agile, Scrum, Waterfall
PROFESSIONAL EXPERIENCE:
Confidential, Richardson, TX
Data Scientist
Responsibilities:
- Conducted research on development and designing of sample methodologies, analyzed data regarding the claim status, coverages and estimate cost of medical procedures
- Worked on Business forecasting, segmentation analysis and Data mining.
- Developed Machine Learning algorithms to find out the number of claims that are properly being claimed and suggest customers by providing insights for the smarter healthcare.
- Created reports to show the customers trends, insights, helping them to make decisions that advance patient care, reduce costs and improve health.
- Generated graphs and reports using matplotlib,Seaborn and pandas packages in python for analytical models.
- Developed and implemented R and Shiny application which showcases machine learning for business forecasting.
- Developed predictive models using Decision Tree, Random Forest and Naive Bayes.
- Performed time series analysis using Tableau.
- Using Tableau Desktop, created detail level summary reports and dashboards using KPI's and visualized trend analysis.
- Designed Tableau scorecard dashboards, stack bars, bar graphs, scattered plots, geographical maps, Gantt Charts, Lollipop Charts using tableau desktop 9.2.
- Used scikit- learn for machine learning and cross data validations.
- Implemented standardization of datasets one of the common requirement of machine learning using scikit-learn.
- Worked with spaCy library for deep learning. Using spaCy prepared text for deep learning and connected to statistical models and rest of application.
- Performed data analysis using python libraries like numPY, sciPY, pandas and matplotlib.
- Performed analysis using JMP
- Collaborating with dev-ops teams for production deployment.
- Worked in Amazon Web Services cloud computing environment.
- Connected to Bigdata and Spark using python. Used blaze library to connect to distributed environments
- Written connectors to extract data from databases.
Environment: Python, Tableau Desktop, scikit-learn, spacy, R Studio, Shiny, Amazon Web Services, Machine Learning, Tableau, Hadoop, Spark, JMP, Segmentation analysis
Confidential, San Francisco, CA
Data Science Analyst
Responsibilities:
- Experience working in project with machine learning, big data, data visualization, R and Python development, Unix, SQL
- Performed exploratory data analysis using numPY, matplotlib and pandas
- Expertise in quantitative analysis, data mining, and the presentation of data to see beyond the numbers and understand trends and insights
- Python numPY supports NumPY Arrays which has huge library for performing statistical calculations element-wise in lists.
- Python with pandas allows for fast analysis, data cleaning and preparation
- Expertise in working with multi-index and index hierarchy using pandas DataFrame.
- Visually plotted the data using matplotlib and Seaborn after performing analysis with pandas
- Using pandas DataFrame performed Groupby, merging and joining operations like in SQL
- Read date from different sources like CSV file, Excel, HTML page and SQL and performed data analysis and written to any data source like CSV file, Excel or database.
- Experience in using the Lambda functions like filter (), map () and reduce () with pandas DataFrame and perform various operations.
- Used Pandas API for analyzing time series.
- Creating regression test framework for new code.
- Designed and text classification tool using Scikit-Learn and Natural Language Processing (Spacy) in python to automatically scan documents and check the meets of policy
- Using Spacy processed the text code for deep learning and connected to statistical model
- Using R packages performed data analysis and connected to Tableau Desktop to visualize the same.
- Proficient in handling huge data and performing creating, reading, updating and deleting (CRUD) operations on MongoDB using PyMongo module
- Developed and handled business logic through backend Python code
- Created templates for page rendering and Django views for the business logic.
- Used DjangoREST framework and integrated new and existing API's endpoints.
- Created forms and loaded data into the Oracle database.
- Utilized PyUnit for unit testing of the application.
- Performed data analysis using google API's and created visualizations such as pie charts, waterfall charts and displayed in the web application
- Extensive knowledge in loading data into charts using python code.
- Using High charts, passed data and created interactive JavaScript charts for the web application
- Extensive knowledge in using python libraries like OS, Pickle, numPY and sciPY.
- Used Bitbucket for version control and coordinating with the team.
Environment: Python,Django, R, Jupyter,Machine Learning, HTML, CSS JavaScript, Ajax, JSON, CSS Bootstrap, JQuery, MongoDB, Postgres, Oracle, GIT, Amazon EC2, Tableau Desktop, Tableau Server
Confidential, Cleveland, OH
Tableau Developer
Responsibilities:
- Experience in working at various phases of project such as analysis, design, development, and testing.
- Development of Tableau visualizations and stories using tableau desktop. Documenting Business requirements and plans for creating dashboards.
- Developing customized and interactive reports, dashboards also scheduling extracts on Tableau server.
- Used action filters, parameters and calculated fields for preparing dashboards in Tableau
- Experience in migrating client’s reports from excel (static) based solution to an interactive service.
- Designed Donut Charts, Panel Charts, Waterfall, Pareto Charts along with regular charts used in Reporting.
- Used Advanced tableau functions like Regular expressions, LOD Calculations to create calculated fields.
- Forecasted trends, patterns using tableau advanced analytic functions based on the client’s data.
- Administered user, user groups, and scheduled instances for reports in Tableau.
- Hands-on development assisting users in creating and modifying worksheets and data visualization dashboards.
- Expert in creating complex dashboards, Adhoc business views along with user acceptance test and data accuracy test.
- Designed Dashboards that are compatible across any device like laptop, desktop, tablet, mobile.
- Created incremental refreshes for data sources on Tableau server.
- Developed customized Tableau visualizations and stories using tableau desktop
- Written R scripts and performed data analysis using the packages
Environment: Tableau Desktop 9.0/8.0, R language, Tableau Server, Crystal Reports, SQL developer, Oracle 10g/11g, JavaScript API, Microsoft Excel, Hadoop, MS Access.
Confidential, Rochester, MN
Python Developer
Responsibilities:
- Involved in interaction with client for requirement specifications and analyzed the system requirements and specifications.
- Worked on designing the front end of web application using HTML, CSS, JavaScript, JQuery, Bootstrap and created interactive webpages.
- Experience in updating only a portion of webpage using JavaScript and XML
- Worked on creating RESTful http services to interact with user interface
- Developed web application using MVC architecture with the help of Django framework.
- Experience on using Django admin and created superusers, updated tables in the database.
- Experience using many Python modules and packages to build an application rapidly.
- Worked on registering URLs in app URLs and linking them to views.
- Extensive knowledge in using models to create tables and synchronizing it with database.
- Automated the code using the puppet tool.
- Experience in writing complex SQL queries and performed various operations like Create, Update, Read and Delete.
- Worked on databases like MS SQL server, Postgres and SQLite.
- Experience in using the regular expressions to match the pattern with the existing code.
- Extensive use of python data structures lists, tuples and dictionaries.
- Experience on working with complex list comprehensions and python inbuilt functions such as map, filter and lambda.
- Experience in debugging and worked on resolving the bugs
- Used GIT for version control
Environment: Python, Django, HTML, XML, CSS, Bootstrap, SQL, RESTful, JavaScript, JSON, JQuery, Oracle, MySQL, SQLite, GIT.
Confidential
Python Developer
Responsibilities:
- Involved in Design, Development and Support phases of Software Development Life Cycle (SDLC)
- Used OOPs concepts in overall design and development of web/system applications
- Experienced working with a team of developers on Python applications for prioritizing tasks and for RISK management
- Designed and developed the UI of the website using HTML, XHTML, AJAX, CSS and JavaScript.
- Experience in developing entire frontend and backend modules using Python on Djangoand Flask Web Frameworks.
- Most of the client side validation is done using JavaScript.
- Designed and developed data management system using MySQL. Built application logic using Python 2.7.
- Used Django APIs for database access and worked on databases like MySQL, Postgres
- Used Python to extract weekly hotel availability information from XML files.
- Participated in requirement gathering and worked closely with the team in designing and modelling.
- Worked on development of SQL and stored procedures on MYSQL.
- Developed shopping cart for Library and integrated SOAP web services to access the payment.
- Experience in writing application level code to interact with APIs, Web Services using JSON.
- Designed and developed a horizontally scalable APIs using Python Flask.
- Involved in Agile Methodologies.
Environment: Python 2.6/2.7, JavaScript, Django Framework 1.3, Flask, HTML, CSS, SQL, MySQL, SOAP, LAMP, JQuery, Apache web server
