Principal Data Scientist Resume
Parsippany, NJ
SUMMARY:
Creative Problem Solver. Working as Head of Data Science with 14+ yr. of relevant exp. Accountable for building vision and strategy to realize full potential of data contained in company, making product design data driven, innovating cutting edge research and development of predictive models & optimization algorithms; while serving as technical & scientific leader for machine learning & advancing industry leadership.
LEADERSHIP SKILLS:
- Leadership & management skills of growing a startup team in various stages & leading product development
- Expertise applying advanced statistics and machine learning to large, messy, unstructured, real - world datasets and communicating insights to executives & clients. Seasoned thought leadership in Data Science & Big Data.
- Spearheaded 10+ Greenfield projects, guided to steady state, led team of ~40 while influencing
- Innovation charter to build next generation products with embedded machine learning that improves efficacy
- Conceived product ideas; envisaging long term impacts at germination and articulating actionable roadmap
- Led projects that drive product personalization, marketing effectiveness, channel optimization, better customer experience. Designed & developed large-scale recommendation system propelled by social psychology research
- Hand holding of F100 to build Big Data platform.
- Built Data driven frameworks for ~260 million large datasets
- Spoken at Technical Conferences & events as Business Analytics expert.
- SME speaker in IBM to evangelize its signature analytics brands viz.
- Cognos & SPSS at events like IMTC, Software Universe, IOD, MIL etc.
TECHNICAL SKILLS:
- Machine Learning-Segmentation, Classification, Regression, Optimization (Stochastic Gradient Descent)
- Algorithms- k-means, k-NN, Linear and Generalized Linear Models, naïve Bayes, SVM, Random Forest
- Stats - Probability Distribution, Hypothesis Testing, Confidence Intervals, Dimensionality Reduction (PCA)
- Statistical Packages- SPSS, Python, Spark ML-lib, Scikit learn, Stata, R, Tensor flow
- Big Data Toolkit-Hadoop, MapReduce, Spark, Hive, Hortonworks, NoSQL, NiFi, Ranger on Azure & AWS
- Data Wrangling- Deep technical background of Data Warehousing, BI, Ralph Kimball dimensional modeling
- Exposure to Deep Learning- Neural Net, CNN, Semi-supervised learning (Char-RNN), Text Analytics (NLTK)
- Social profiling cluster with k-means & hierarchical cluster movement models using Gaussian mixture models
- Trained Bayesian beta-binomial conjugate for payments p, C5 pessimistic pruning derived decision tree algorithm for timecard exception, Deep learning char-RNN LSTM model to generate job description
- Published in DMA Journal Dynamically Evolve Right Offer for Right Customer using Genetic algorithm
EXPERIENCE:
Confidential, Parsippany, NJ
Principal Data Scientist
Responsibilities:
- Chartered to drive differentiation in HCM market by cohesively infusing AI into products. Inceptor of ML team
- Product Innovation- Prescriptive payroll, Payments recommendation engine, Timecard curator, Talent reflow
- Submitted 2 patent disclosures. Data Lake in AWS. Payroll recommendation engine productionalized
Confidential, Boston, MA
Data Scientist / Lead Architect- Big Data
Responsibilities:
- Strategize Big Data Shared Service as data fabric to support group divisions viz. Insurance, Investment, RPS
- Action clustering for demographic micro-targeting using data of whole USA (260 Million with 1800+ attr)
- Architectured Data Lake, global end-end data pipelines from metadata driven data ingestion framework to enlightenment dashboards.
- Designed fully automatic process for cluster creation, restarting Hadoop, storage etc.
Confidential, Princeton, NJ /Armonk, NY
Principal Data Scientist / Lead Architect- Big Data
Responsibilities:
- Built Confidential (Next Best Action) with big data and data science team. Responsible for data pipelines & predictive modelling algorithms that underpins marketing strategy and direct customer targeting to maximize profitability
- Led team ~31 for Prescriptive Analytics to orchestrate Physician Segmentation using golden ratio based centroid clustering model. Improved Silhouette coefficient of social profiling model from 0.15 to 0.55
- Led team from inception to deployment which includes hands-on crafting of dimensional models for variable depth hierarchy. Created data Architecture, data quality framework with audit, balance & accuracy modules
- Pipelined Social Media Analytics project to derive employee sentiment using internal & ext. sites like Twitter et.al. Identified key nodes using network influencer & homophile approach. Experimented with NLP
Confidential
Predictive Analytics Solution Architect / Product Development
Responsibilities:
- Envisaged & Designed 7+ out of box predictive models, growing product footprint with $25M+ waterholes
- Designed SPSS Decision Management (DM6) apps combining Predictive Modeling output with BRMS (Business Rules Engine) with Optimization logic to deliver real time recommendations
- As Data Scientist designed DM6 Telecom Campaign Analytics. Optimized to maximize profitability by matching right offer to right customer over right channel by taking in NPS, Lifetime value, churn propensity
- Built hybrid recommender system. Large scale sentiment keyword categorization with LDA (Latent Dirichlet)
- Envisaged SPSS-DM6 Intelligent Call Routing as Prescriptive Analytics app which assesses behavior & overall importance level of customers and recommend specific action to be taken before responding to customers’ call
- Developed SPSS-DM6 Retail Analytics for loyalty clustering by a deep understanding of the customers behavior
- Cognos Product Labs -As Data Modeler designed developed Cognos Adaptive Analytics Framework (AAF). Led team of product engineers to build out of box analytics for ERP based systems for as many as ~12 ERPs
- Designed, developed & released products as Cognos 8 Workforce Performance Talent Analytics & Cognos 8 Financial Performance Analytics for read-to-use & configurable Adaptive Warehouse & Adaptive Reporting
- Worked with CTO Team at IBM Almaden Labs to build multi tenancy framework to put Cognos on Cloud (‘09)
Confidential, Chicago, IL
Data Architect/ Data Engineering
Responsibilities:
- Led Accelerated Data Growth architecture for one of largest DW in Europe at 36TB, 2600 tables, 112 sources
- Account breaking story for diffident client. Leading team of ~10 modelers, engineers to a benchmark project
- Identified barriers to develop gracefully scalable solution for a full-fledged HR Analytics portfolio of F100 clients; spanning 7+ functions, 50+ modules designed to be re-usable & plug-n-play model across the industry
- Re-designed HRO system for clients with millions of rows to improve data pipeline efficiency by 93%
- Modeling for first Decision Support System for Indian Army’s “Big Data” of ~1.1M servicemen along with millions of ammunition & vehicles. Created variable depth hierarchy models. Replaced 350+ division by ~35