We provide IT Staff Augmentation Services!

Principal Data Scientist /research Leader Resume

2.00/5 (Submit Your Rating)

SUMMARY:

  • Innovator, Designer, Strategist & Storyteller in real time adaptive systems algorithms in artificial intelligence/machine learning specializing in: Analytics Practice/Product based on strategically focused data engineering, data science & software delivery
  • Project manager & consummate consultant who has managed large scale teams (20+) across different functions, levels including executives & off - shore management experience with a focus on delivery using agile development
  • Has an exceptional aptitude for verbalizing mathematics to business audiences & translating their business issues into solvable, actionable, & systematize-able mathematical solutions & systems
  • Big Data Enterprise Data Scientist & Engineer/Intelligence Architect/Research Leader with a strategic & passionate focus in building an analytics business
  • Consistent Status Quo Challenger and Benchmark Buster with a proven track record in mathematical & computational algorithm design & implementation with a unique background in integrated text {Natural Language Processing, Concepts/Semantic Search, LSA(Latent Semantic Analysis)} & numerics in decision engines & BI focusing on: integrated enterprise-wide analytic solutions through machine learning/Artificial Intelligence(automated decision engine) design, build, diagnostics, & implementation
  • Roadmap designer: strategic & tactical: data science (predictive analytics that include semantic/text analytics), LSA (Latent Semantic Analysis), data engineering/design/modeling, machine learning/AI, process engineering, data/model governance, software design using object oriented programming (OOP)
  • Intelligence designer, creator & extractor of integrated mathematical and algorithmic solutions
  • Integration of disparate data sources; Marketing data mart & data strategy for structured (numeric) & unstructured (text/NLP) data, Operational, & analytical data design, Integration of structured & unstructured data in analytics for business process re-engineering & process mining. Fluent in the nuances of sparse (data imputation) & big data issues: Design of data governance & integration for business use, Statistically driven data governance/ cleansing algorithms(fuzzy logic & match); Statistical data tracking design leading to process mining; Ontological database design
  • Risk & Marketing: Suite of marketing/CRM analytics/forecasting; Risk analytics/forecasting; Social media/network analytics; NLP; Process engineering; Benchmarking: predictive & virtual; CRM suite of analytics (acquisition, retention, up & cross-sell, divestiture); Credit Risk (Probability of fraud, overdraft, default, foreclosure, early payment; sensitivity & scenario analysis); BI, Machine Learning/AI: Model control systems: model efficacy tracking & alert systems design & build, quality control; NLP(Natural Language Processing)/Naïve Bayes algorithm design; Latent Semantic/ Classification Analysis; Random Forest

TECHNICAL EXPERTISE:

Data (numeric & textNLP) Mining & Data (numeric & text) Science & Engineering : Design of integrated automated business intelligence (BI) systems: artificial intelligence; Forecasting, Predictive Modeling; Data mining; Data Engineering: Data Governance. Database Design, Data Modeling, Process Engineering; Machine Learning/Artificial Intelligence; Support Vector Machines; Benchmarking. Network Analysis. CRM & Risk Analytics: Multi-D Segmentation: B-to-C & B-to-B: decision tree, cart, chaid, factor & cluster analysis, principal components, random forest. Econometric Modeling, Predictive modeling/recommendation engine: logistics regression, principal component, latent classification modeling, regression. multivariate regression & analysis (multi dimensional scaling, MANOVA), experimental design (ANOVA, factorial design, fractional factorial design), Time series. Survival analysis. Quality control. Regression diagnostics (heteroskedasticity, influential diagnostics). Categorical analysis: experimental design. Dynamic decision engine design/build/implementation. Simulation. Structured & unstructured learning Artificial intelligence & operational data build: CRM/Marketing optimization models, risk prevention/assessment/measurement models, charge-off, fraud, foreclosure, credit, operational, economic risk models, Panel models. Automated business intelligence (BI)/artificial intelligence. Social Media Analysis. Analytical search engine optimization & design. Semi-structured machine learning; Naïve Bayes; Semantics Web: Taxonomy, folksonomy, ontology & semantics development & design; Simulation, Sampling, Bootstrapping, Resampling, Jack-Knifing

Data (Numerics & NLP) Design & Modeling/Programming/Networks: Data modeling, Data warehouse, Data engineering, Data forensics, SAS, JMP, SPSS, SQL, MS SQL/Analyses Server, Knowledge of: ETL, SQL, Tableau, JAVA, C#, mingle

Research Areas : Continuation of discrete variables, form & curve clustering, flattening out of cluster analysis, fractal mathematics in transactional data, mathematical transformations of categorical & time series data; building intelligence through using & integrating structured & unstructured data/information, NLP Data warehouse & design, spatial statistics, n-D factorial analysis, artificial intelligence & machine learning; missing value imputations

PROFESSIONAL EXPERIENCE:

Principal Data Scientist /Research Leader

Confidential

Responsibilities:

  • Point person: spearheading innovation in engineering under the global directive of innovation within Confidential Kluwer
  • Leadership advisor/SME & strategist on the subjects of Artificial Intelligence/Machine Learning & the business of analytics & data
  • Architect mathematical & computational algorithms for machine learning/artificial intelligence systems
  • Write product offerings & go-to-market strategies
  • Product design & implementation; Design & research analytical products for outside sales & an analytical foundation for a consulting practice
  • Manage all technical documentations: Business/technical requirements, project plans/discussions, system codes (data cleansing, models derivation, quality control, BI) & all presentations to various owners & decision makers
  • Collaborate across business units to design product offerings from concept to market; deliver the products & productionalize the products off-shore; this included a strong analytical self - governing machine learning component of the global model efficacy system
  • Manage a cross functional team of in-house, outside consultants & off-shore resources
  • As a data engineer, devise the backend delivery data & collaborate with a team of software engineers to deliver the on-line data view & analytics
  • Interview various participants & write the business, technical, analytical, & functional requirements
  • Designer/Initiator of several patent potential mathematical intelligence driven products
  • Spearhead product development & marketing strategies based on data & research to be developed in US & abroad
  • Go-to person for all analytical products, issues & verbiage for outside analytical product introductions & marketing
  • Design AI/Machine Learning systems using structure & unstructured dat
  • Managed outside vendor team; developed series of integrated analytical solutions that beat the current solution by 11X; created 12 models when they had “0” models
  • Designed first ever semantic/ontological database structure design/foundational text mining machine design & layout for the group. Semantics & concept search design & implementation/LSA (Latent Semantic/Categorical Analysis): Taxonomy/ Ontology development: develop mathematical & computational algorithm for opportunity identification & machine learning
  • Expert in missing value estimation: missing value and data enhancement algorithm design
  • Produced a predictive model that has 95% accuracy rate with 35% coverage using only ONE VARIABLE with minimal correlation with the predicted variable. Given the industry average accuracy of 8%, at best, with many predictive variables, this beat the average by 12X.
  • Flattened out sophisticated clustering algorithm into a flat spreadsheet based algorithm that can be implemented simply
  • Designed a dynamic, unstructured machine learning model that is live & implemented real-time; Designed the first ever statistically driven data cleansing algorithms for data governance & machine learning systems
  • Within1 week of starting, identified major flaws in software design & overhauled the product & processes
  • Within a year, worked with cross functional teams including marketing to create a market buzz around recently created products; created a department from one person to 8; went from inculcating the value of the central limit theorem to using time series, & generalized linear models; went from 0 analytical product offerings stream of analytical product offerings, & data offerings; designed & wrote a comprehensive data governance plan that include administration & automated data quality assessment (this will be launched into other products). Spoke at company’s executive board meetings
  • Requested by the company to go full time 5 times during consulting; finally accepted on the 5th request

Confidential, Bridgeport, CT

VP/Senior Finance Director

Responsibilities:

  • Designed, implemented & presented risk strategy & first line of defense mathematical models against charge-off & overdraft for the company
  • Served as an analytics/BI/SAS Capacity Planner during analytics & data infrastructure design & selection
  • Managed, led, taught & worked with vendors & RBS’s cross functional teams
  • Documented & validated all models for the Risk Committees before & after operationalization. Worked with IT to assess & implement new software & technology
  • Managed 10 TB of data while serving as an analytical data design architect
  • Provided thought leadership around dynamic model design & execution, semantics intelligence, automated business intelligence, database design for analysis & decisioning, & dynamic decision engine design
  • Designed & implemented all live market experiments & their corresponding tracking systems & documented the final results
  • Designed & test marketed the first ever behavioral transformation models
  • Within 1 month, presented the departmental analytical strategy moving forward to the C & Executive Levels
  • Within 3.5 months, created an integrated SAS analytical data environment for the analysis of 3 TB of data
  • Within 6 months, developed the bi-level analytical approach for the delivery of the first ever sensitivity model & attrition model
  • Within 6 months, developed 4 charge-off models, 2 sensitivity models, 5 attrition models
  • Developed a charge off model that is 10X better than their best existing model
  • Within 7 months, created a cross-sectionalization of time series data for binary outcome modeling
  • Within a year, developed an automated schema where one time series variable is transformed into 125 variables, which were later used for modeling
  • Within a year, developed a program of a ranking mechanism where it beat the current best model by 500%. This is done without any underlying distributional assumption of the data
  • Worked with over 15000 independent variables per one model
  • Pinpointed a flaw in an ongoing experimental design, which later resulted in righting the experiment
  • Developed the most comprehensive live marketing experiment portfolio profitability tracking system ever
  • Served as an SME for analytical platform & data design

Confidential, New York, NY

Director of Heuristics & Algorithm

Responsibilities:

  • Member of an internet start-up that focused on local search, social networking & semantics web/web3
  • Thought leader in the areas of: semantics intelligence (taxonomy, folksonomy design & strategy); web analytics for operational efficiency, dynamic decisioning, data & database design; analytical search engine optimization; integration of structured & unstructured data for artificial intelligence in BI (business intelligence) & in dynamic decision engine design
  • Inculcate strategic initiative around semantics intelligence, web analytics, & the integration of text & numbers (unstructured & structured data) for data mining & decision engines design. Most frequently used techniques are: Bayesian filtering/poisoning, pivotal normalization, naïve Bayes, Markov chain, r&om walk/stochastics processes, cluster analysis, factor analysis, principal components, regression, logistics regression, time series, quality control, experimental design, categorical analysis
  • Serve as an SME integrated enterprise-wide analytical strategy, analytical decision engines build & business intelligence systems. Focus is on marketing reporting & forecasting by using various statistical techniques
  • Serve as an SME to the businesses on technical issues & problem solving by using data & analytics. Manage vendor relationships. Research focus: clustering of forms & curves

Confidential, East Hartford, CT

SVP of Analytics

Responsibilities:

  • Member & owner of a start-up company that focus on mining the text through use of Natural Language Processing methods (NLP) of insurance claims within the verticals (auto, home owners/rentals, casualty, worker’s compensation) of the insurance industry
  • Designed & built real-time decision engines for the insurance industry using structured & unstructured data. Data mined through text (unstructured ) & structured data to determine subrogation response scores, which were later used to determine claims subrogation actions
  • Built claims dictionary, taxonomy & ontology using categorical & naïve Bayes statistical algorithms
  • Develop new products for the company & provide on-going insight to search engine algorithms & scoring engine development. Manage & support all aspects of product delivery & client service
  • Within 2 months, introduced a different modeling technique from what they were using before. This technique beat their going average accuracy rate by 400%
  • Focus has been in integration of structured & unstructured data; centroid definition of dummy variables; variance analysis of binary variables

Confidential, Norwalk, CT

SVP of Risk, Systems & Analytics

Responsibilities:

  • Design & derive statistics & models for credit risk, operational risk & economic risk. Design database & performance metrics for credit, operational, & market risks to be incorporated into capital allocation analysis & Basel II preparation: Built PD, LGD, EAD models for capital allocation & performed sensitivity & scenario analysis
  • Manage a group of business analysts & up to VP Level
  • Managed a group budget of $2.5 MM: capacity planning, software selection, vendor management
  • Design, update & report portfolio analysis for internal, external & for the board of directors of Confidential
  • Provide CFS leverage results through statistical analysis during yearly capital allocation process
  • Design analytical decision engines for account & portfolio management within CFS. Design processes & controls for Sarbanes-Oxley compliance
  • Work with cross-functional teams to drive, derive & deliver a systems & analytics solution for the business. Work with outside systems vendors to design a systems solution for the group

Confidential, Washington, DC

Senior Manager

Responsibilities:

  • Client facing role which involved management & development of CRM solutions & CRM solution strategies for multiple industries from requirements gathering, SDLC, project management through design & roll out of all analytical projects
  • Served as a project manager/planner & capacity planner for projects
  • Designed & built: 1) analytical architecture (brains) of marketing engines that include: segmentation, acquisition, retention, cross-sell & risk 2) performance metrics, predictive & descriptive models for real-time integrated marketing systems that encompass all customer touch-points such as call centers & internet contacts 3) the sampling schemes, campaigns & campaign test strategies. Led the modeling of individual pieces within the analytical architecture
  • Led, taught & guided analytical teams internationally
  • Identified necessary hardware, software, data infrastructure & analytics for solutions implementation
  • Operationalized the models with the technology teams. Interacted with the clients to assess their capacities & processes to design the final deliverables. Presented the deliverables
  • Worked with other Confidential practices to generate analytical business. Introduced & created new analytical techniques & approaches to the Customer Insight practice
  • Led several projects at once. Became an analytical/industry SME within 1 year with the firm. Acknowledged as a segmentation expert within 1 year
  • Within 1 year, redefined & heightened the definition of a “good segmentation” for rest of CRM. This segmentation got much publicity within Confidential & acceptance by all of the Confidential team on the client site
  • Taught segmentation courses internationally within Confidential
  • Toppled Confidential ’s leading acquisition modeling (predictive modeling) results for direct mail by 16x while using the same data. The project was commended by the client partner & led to follow-up work
  • This project was expected to have an impact of 300-500% lift over the client’s acquisition rate
  • This project set a precedent in the CRM/CI practice for predictive modeling
  • Consistently outperformed Confidential ’s existing analytical results & met tough deadlines
  • Consistently out-ranked peers during annual reviews

We'd love your feedback!