We provide IT Staff Augmentation Services!

Data Scientist Resume

0/5 (Submit Your Rating)

Atlanta, GA

SUMMARY

  • 7 years of strong proactive experience in Data Analytics with ability to build/create new opportunities for organizations.
  • Strong ability to analyze sets of data for signals, patterns, ways to group data to answer questions and solvecomplex data puzzles
  • Proficient in advising on the use of data for compiling personnel and statistical reports and preparing personnel action documents.
  • Work experience in analytics, working with data to convert large volumes of structured and unstructured data into actionable insights and business values.
  • Ability to analyse raw data, drawing conclusions and developing recommendations.
  • Proven experience in hypotheses development, identification of patterns within data, analyzing data and interpreting results.
  • Skilled in Advanced Regression Modeling, Time Series Analysis, Statistical Testing, Correlation, Multivariate Analysis, Forecasting, Model Building, Business Intelligence tools and application of Statistical Concepts.
  • Proficient in: Data Acquisition, Storage, Analysis, Integration, Predictive Modeling, Logistic Regression, Decision Trees, Data Mining Methods, Forecasting, Factor Analysis, Cluster Analysis, ANOVA, Neural Networks and other advanced statistical and econometric techniques.
  • Expert in Excel Macros, Pivot Tables, vlookups, PowerPivot and other advanced functions.
  • Adept in writing code in R and T - SQL scripts to manipulate data for data loads and extracts.
  • Proficient in data entry, data auditing, creating data reports & monitoring data for accuracy.
  • Ability to extract Web search and data collection, Web data mining, Extract database from website, Extract Data entry and Data processing.
  • Strong experience with R Visualization, Qlikview and Tableau to use in data analytics and graphic visualization.
  • Extensively worked on using major statistical analysis tools such as R, SAS, MATLAB.
  • Strong knowledge in all phases of the SDLC (Software Development Life Cycle) from analysis, design, development, testing, implementation and maintenance with timely delivery against deadlines.
  • Extensive experience with creating MapReduce jobs, SQL on Hadoop using Hive and ETL using PIG scripts, and Flume for transferring unstructured data to HDFS.
  • Strong ability to work with Mahout, for applying machine learning techniques in Hadoop Ecosystem.
  • Strong Oracle/SQL Server programming skills, with experience in working with functions, packages and triggers.

PROFESSIONAL SKILLS

Tools: SAS, R Studio, SPSS, Qlikview, Tableau, SQL*PLUS, Eclipse, Business Objects, Microsoft Word and Excel

Statistical Techniques: Advanced Regression Models, Logistic Regression, Time Series, Predictive Models

Big Data Ecosystems: Hadoop, HDFS, MapReduce, Mahout, Hive, Pig, Sqoop, Flume

Database: Oracle, MySQL, Teradata SQL Studio

Languages: SQL, PL/SQL, T-SQL, R, Java, C/C++, Python

PROFESSIONAL EXPERIENCE

Confidential, Atlanta, GA

Data Scientist

Responsibilities:

  • Used SAP Business Objects and Teradata SQL Assistant to read data from IHG’s Teradata database.
  • Extensively used Microsoft Excel Macros, Pivot Tables, vlookups, Match/Index, Pivot Tables and other advanced functions to leverage raw data.
  • Used SAS to retrieve data from Teradata Database, perform ETL and conducted different p-tests, t-tests, regression and non-regression tests.
  • Analyzed main Revenue Management Indicators (ARI, MPI and RGI) of hotels to determine their performance against competitors at a high level and advised on opportunities to drive higher occupancy and revenue.
  • Used Tableau to create both ad hoc reports requested and automatic weekly scheduled Forecasts Outlook report including several bar charts, heat maps, etc.
  • Created Report slides using PowerPoint and Thinkcell.
  • Developed new KPIs to measure performance of suggested promotion initiatives for IHG’s specific hotel brands.

Environment: SAP Business Objects, Teradata SQL Studio, SAS, Excel, PowerPoint, Tableau

Confidential, Middletown, NJ

Data Scientist

Responsibilities:

  • Evaluated feature importance related to AT&T’s mobile platform user churn rate, via Random Forest and “L-1” Regularized Logistic Regression and presented insights based on feature importance in interactive user-friendly Web-based platforms.
  • Implemented user segmentation using Decision Trees and K-means Clustering, prototyped dynamic visualization of clustering results in R shiny and Plotly.
  • Predicted user churn rate using General Additive Models, combined with feature clustering, to understand non-linear patterns between user churn rate and related monthly platform usage features.
  • Prepared large volumes of user history data and performed ETL with Hadoop and applied above-mentioned machine learning techniques using Mahout.
  • Used Mahout and collaborative filtering to build predictive models, which were used to optimize ad campaign performance.
  • Developed statistical tools for comparing the performance of predictive models.

Environment: R/R Studio, Hadoop, MapReduce, Mahout, Qlikview

Confidential, New York, NY

Data Scientist

Responsibilities:

  • Used various approaches to collect the business requirements and worked with the business users for ETL application enhancements by conducting various JRD sessions to meet the job requirements.
  • Designed data profiles for processing, including running PL/SQL queries and using R for Data Acquisition and Data Integrity which consists of Datasets Comparing and Dataset schema checks.
  • Performed exploratory data analysis like calculation of descriptive statistics, detection of outliers, assumptions testing, factor analysis, etc., in R.
  • Conducted data/statistical analysis, generated Transaction Performance Report on monthly and quarterly basis for all the transactional data from U.S., Canada, and Latin America Markets using SQL server and BI tools such as Report services and Integrate services (SSRS and SSIS).
  • Used R to generate regression models to provide statistical forecasting.
  • Applied Clustering Algorithms such as K-Means to categorize customers into certain groups.
  • Implemented Key Performance Indicator (KPI) Objects, Actions, Hierarchies and Attribute Relationships for added functionality and better performance of SSAS Warehouse.
  • Used Tableau and designed various charts and tables for data analysis and creating various analytical Dashboards to showcase the data to managers.
  • Performed data management, including creating SQL Server Report Services to develop reusable code and an automatic reporting system and designed user acceptance test to provide end users (my manager and the member of other team) with an opportunity to give constructive feedback.

Environment: R/R Studio, SAS, Oracle Database 11g, Oracle BI tools, Tableau, MS-Excel, Windows 7

Confidential, West Point, PA

Data Scientist

Responsibilities:

  • Supported project by data governance, cleaning data, creating exploratory tables and listings, reviewing output, and validating analyses.
  • Performed Sentimental analysis with public health surveillance by Machine learning.
  • Identified suitable predictive model and executed in BI.
  • Assisted to implement ad-hoc project and analysis.
  • Executed A/B and multivariate tests to optimize web analyst performance.
  • Support preparations for interactions with regulatory agencies.
  • Assisted with the data governance: Data policies, Data Standards, Data Management, Strategic planning, ongoing control, Key matrix, etc.
  • Developed and assisted in maintenance of department tools, templates, guidelines, SOPs, etc.
  • Practiced to estimate the value of data and manage data related issues and suggested to implement new data policy in the organization.

Environment: SQL, R, MS-Excel, Qlikview, MATLAB R2010a, Minitab, Pentium PC, Windows 7

Confidential, Wilmington, DE

Junior Java Developer

Responsibilities:

  • Enhanced the Portal UI using HTML,JavaScript, XML, JSP,Java, CSS as per the requirements and providing the client sideJavascript validations.
  • Used client sideJavascripting: JQUERY for designing TABS and DIALOGBOX.
  • Used Hibernate for mapping the ORM objects to table using Hibernate annotations.
  • Developed Web services component using XML, WSDL, and SOAP with DOM parser to transfer and transform data between applications.
  • Effectively interacted with team members and business users from different regions for requirements capture and analysis.

Confidential, PA

BI Analyst/Credit Analyst

Responsibilities:

  • Installed, customized and integrated Financial Software.
  • Developed integrated CRM system using Salesforce.
  • Developed integrated loan management IT systems.
  • Optimized IT system infrastructures.

We'd love your feedback!