We provide IT Staff Augmentation Services!

Senior Data Scientist Resume

0/5 (Submit Your Rating)

PROFESSIONAL SUMMARY:

  • 10+ years in Big Data Analytics, Research, Development and Management experience in industry, academia and government.
  • Using Big Data tools (Hadoop, Spark, Hive, TensorFlow) to develop AI (ML/DL/NLP) applications on billions of customer transactions & records
  • Building machine learning models from development through testing and validation to our 66+ million customers in production.
  • Invited Speaker for major Machine Learning Conferences (Spark Summit (East & West), AGU, Confidential )
  • AI engine for processing unstructured natural language data for small business index (news, reports, social media to glean insights)
  • Internships: Stanford University Press & IIT Bombay.

TECHNICAL SKILLS:

Analysis: Python, R, SQL, C, Shell, NLTK/NumPy/SciPy/Pandas.

Big Data: Apache Spark (Spark, SparkSQL, MLlib), Hadoop, Hive, TensorFlow, Cassandra, Databricks, AWS, Azure.

Machine Learning: Clustering, Classification, Regression, Decision Trees, Anomaly Detection, Recommender Systems, Pattern Discovery, and Text Mining, Deep Learning (CNN, RNN, Word2Vec)

Text Analytics: regex, NLP, LDA, Cosine Similarity, Feature Engineering, Tagging, Voice to text conversion.

PROFESSIONAL EXPERIENCE:

Senior Data Scientist

PetSmart

Responsibilities:

  • Develop real time data processing and machine learning pipelines to process transactions and get insights over Microsoft Cloud. Design, develop and manage Data Science Products (Personalization, Customer Loyalty, Product Recommendation and Segmentation (targeted marketing) Engine).

Principal Data Scientist

Confidential

Responsibilities:

  • Developed scalable data processing pipelines (ETL), analytics and segmentations using Python, Hadoop, Hive and Spark for Small Businesses. Use open source data (Census etc.) for targeting.
  • Deep Learning algorithms for language processing on Terabytes of data.
  • Design and development of industry verticalization ( Confidential Patent) using Latent Dirichlet Allocation, personalization, domain recommendation using NLP, Customer Success Dashboard/Go Initiative, Customer360 (Binary Feature Creation, Customer Segmentation, Churn, Next Product Buy), Confidential Small Business Success Index, Competitor Pricing Intelligence.
  • Develop and enhance Machine Learning products and launch them on a regular basis as type I (final products) and type II (PoC). As a product owner define goals, set priorities and manage backlog of innovative features and AI (ML/DL/NLP) efforts.
  • Apply data mining algorithms and statistical modeling techniques such as clustering, classification, regression, decision trees, neural nets, support vector machines, anomaly detection, recommender systems, sequential pattern discovery, and text mining.

Senior Data Scientist & Senior Data Architect

Confidential

Responsibilities:

  • As product owner lead In - Memory Processing POC implementation using Apache Ignite/Gridgain and Apache Spark for credit card risk analytics for Amex Merchants in payment processing.
  • Built machine learning models and algorithms for banking, credit cards and capital markets data.
  • Real time customer personalization (customer 360). A/B testing. Predictive analytics using Apache Spark on credit card fraud data for volume and $ for 1,3,6,9,12 months with data assimilation capability.
  • Recruitment, mentoring and managing data analytics team (CoE) across multiple time zones on data science.
  • Leadership and technical guidance on development of analytics toolkit for consumer trends & market analytics on merchant payment data, root cause analytics of payment failure, portfolio analytics, and customer 360 platform using Apache Spark and RevoR.

Data Architect

Confidential

Responsibilities:

  • Lead the Machine Learning and analytics api development efforts for Confidential Earth Analytics. Automated data ingestion and QA/QC pipelines to/from the federal agencies (NASA, USGS, UN, ESA) to Confidential .
  • Technical consulting for federal, state, vendor organization across multiple time zones and teams including Confidential ’s Product Management, Engineering, Design & Research, and other Data and Analytics Professionals to identify, build, and launch insightful tests.

Assistant Research Professor

Confidential

Responsibilities:

  • Train, test and deploy suites of models (OLS, Logistic, SVM, KNN, etc.) for earth engine analytics on Confidential ’s data analytics platform (Python/Java); Data architect to make optimal strategies for data ingestion.
  • Lead team of 5 Analysts team working for Confidential funded project on developing Data Analytics API’s.

Research Data Analys

Confidential

Responsibilities:

  • Developed information system for analyzing and visualization of large data (xml, json, hdf, txt, tiff) from IoT devices.
  • Developed Machine Learning algorithms (Regression & Classification) on large datasets (5 TB) for predictive analytics. Developed optimization scheme using genetic algorithm for SWAP model.

Senior Data Assimilation Researcher

Confidential

Responsibilities:

  • Worked with JAXA (Japanese Space Agency's) regional mission at Thailand. Spatio-temporal data modeling, time series analysis, quality analysis, design and analysis of QA/QC experiments.

Graduate Research Assistant

Confidential

Responsibilities:

  • Geospatial and remote sensing image processing using C on UNIX platform. Supervised and unsupervised classification.

We'd love your feedback!