Sr. Director And Chief Data Scientist Resume
Cupertino, CA
PROFESSIONAL SUMMARY:
- Have unique combination of data science and technology background with an experience of over 30 years as a research scientist, data scientist, product/engineering manager, and architect.
- Extensively participated in various R&D and developmental projects as Data Scientist and Application Architect that involve experiment design, developing and evaluating advanced analytical and numerical models, such as those from statistical inferencing and machine learning.
- Expertise and experience in specific problem areas are Confidential bid response pricing, B2C pricing and revenue optimization, customer value perception analytics, competition - tactics planning, and supply chain - demand forecasting, capacity planning and inventory optimization.
FUNCTIONAL AND ALGORITHMS EXPERTISE:
- Statistical Machine Learning, Hypothesis Testing, Classification and Regression. Feature Engineering, NLP, Pattern recognition (used in Forecasting) and Clustering - Market Segmentation, Anomaly Detection for Intrusion Detection and IOT, User/Item Based Recommenders: Collaborative Filtering.
- NLP: Worked on Document Discovery, and Sentiment Analysis.
- Time Series Analysis and Modeling, Forecasting, Moving Averages Exponential Smoothing, and ARIMA
- Optimization - Cyclic and Safety Inventory Optimization, Bidding/Pricing and Revenue Optimization, Advertisement Budget Optimization
TECHNICAL SUMMARY:
Software Tools, Architecture and Design:
ML: Python Pandas, NumPy, Scipy, Scikit-learn, NLTK, Open NLP, R, Matlab, Tkinter, Jupyter Notebook
ML/DL: Spark ML/MLLib, Mahout, Weka, DL4j, Mallet, MOA, ELKI
Big Data: Apache Spark MLLib, Spark ML with Hadoop/YARN; AWS S3, EC2
Languages: Java, Python, Scala, JavaScript, D3.js, XML, JSON and SQL
Software Modeling and Design: UML; OOAD/ API/Framework Design, Cloud/SAAS Computing Data Modeling
PROFESSIONAL EXPERIENCE:
Confidential, Cupertino, CA
Sr. Director and Chief Data Scientist
Responsibilities:
- As Director, built the business insights (data science) and relevant technology team to develop predictive models using structured, semi-structured and unstructured data.
- Mentored data analysts to come up to speed with the project.
- Clarified project objectives, delegated responsibilities, and delivered results to corporate management.
- As Data Scientist, developed predictive models for classification, regression and clustering problems. Developed standards and created patents and copy rights.
- As Product Manager interacted with clients, captured and documented business requirements; advocated clients on potential concepts, system functionality and use cases.
- Interfaced with development teams to ensure the schedules and accuracy in the delivery of use cases.
- Develop a clustering model based fraud detection system and compared with probability density based models. Responsible for data extraction, preprocessing, model evaluation, and post-processing.
- Used K Means and Normal Distribution Function based models. Developed optimal number of clusters.
- Generated the mean entropy of clusters and F1 score to evaluate model relevance.
- Architect and design the application to deploy in a multi-node cluster environment.
Tools: Scala, Java, MLLib, Spark ML, Hadoop, Spark RDDs, Spark SQL, and R/RStudio/SparkR Visualizations.
Confidential
Sr. Director and Chief Data ScientistResponsibilities:
- Review use cases and interface with pilot customers to gain insight into the complexities of cases.
- Receive datasets from customer and make initial assessment on data from users.
- Responsible for data extraction, preprocessing, model evaluation, and post-processing.
- Conduct comparison studies on different types of recommenders: Item based and User based.
- Design APIs for the recommender system components and subcomponents.
- Architect and design the application to deploy in a multi-node cluster environment.
Tools: Python, Scala, Java, MLLib, Spark ML, Hadoop, YARN, SQL, J2EE and MYSQL DB, Oracle, and Spread Sheets, and Jupyter
Confidential
Product Manager and Scientist
Responsibilities:
- Pricing models/algorithms development - Price analytics, market sensitivity analysis, market segmentation, tactical promotion planning.
- Designed APIs for clustering/segmentation: Kmeans for customer segmentation.
- Developed best response functions to deal with pricing from competitors based on game theory.
- Developed optimization models to generate margin maximization.
- Created a proprietary multi-recursive market segmentation and optimization algorithm.
Environment: Python: Scikit-Learn, Hadoop HDFS, Spark, MLLib, Java API, MySQL, SQL and Eclipse
Confidential
Data scientist
Responsibilities:
- Responsible for price segmentation/clustering and models development and code delivery.
- Conducted comparison studies on Classification algorithms to decide the best response (LR) model.
- Designed APIs for bidding LR algorithms and Machine Learning based clustering/segmentation: Kmeans.
- Preprocessed data and compressed sets through dimensionality reduction through LDA.
- Tuned and tested algorithms for performance - accuracy and speed.
Environment: Python (x, y): Scikit-Learn, SQL, Java, J2EE, HDFS, MapReduce, MySQL DB
Confidential
Data scientistResponsibilities:
- Preprocessed data for completeness and accuracy including visualization and exploration of data.
- Performed feature engineering; conducted model evaluation and optimization - iterative cross validation, performance measurements, parameter tuning.
Environment: Python(x, y): NumPy, Scipy, Scikit-Learn, pandas
Confidential, Plano, TX
Data scientist
Responsibilities:
- Created models - used extrapolation based smoothing, and pattern recognition based algorithms to minimize statistical errors - MAPE, MAD, and MSE.
- Models operated well with level, trend and seasonal data and also for data with variance in trend.
- Developed causal forecasting models using multi-variate regression - pricing, ad budget, team size.
- Developed API interfaces to work with inventory optimization models and related data.
- Supported the team with analytical modeling and algorithms through the life cycle of product.
Environment: Java/J2EE, Numerical Recipes, Struts, SQL, Oracle DB, Tomcat 6.0, Eclipse IDE
Confidential, La Palma, CA
Data scientistResponsibilities:
- Acted as an architect and designer to develop the software system for Confidential project.
- Mentored developers and guided them to design and develop a multi-tier system.
- Project recognized as one of the most successful and on time delivered system.
Environment: J2EE stack, Spring MVC, Big Data HDFS, Oracle DB, Web sphere, Eclipse
Confidential, Dayton, OH
Data scientistResponsibilities:
- Built a documents classification system with predefined characteristic attributes.
- Used ML algorithms to broadly identify different types and subtypes of documents.
- Developed a document classification model in Java to classify documents based on certain keywords.
- Used SVD based approach to recommend documents to readers based on their selections of interest.
- Implemented as a three tier system with Struts framework.
Environment: J2EE stack, Struts MVC, IBM DB2, Oracle DB, Sybase DB and data warehouses, Web Logic, Eclipse
Confidential
Data scientistResponsibilities:
- Developed the pricing models using Oracle BRM (previously Portal Infranet) developed by us at Confidential .
- Catastrophes and Perils Data System to manage the data over Web.
Environment: Swing Applet, Servlets (for Authentication), Session, Entity (BMP) and Message-driven Beans holding service calls, Model-View-Controller Design, Hibernate, XML, Log4j, Oracle, WebSphere/WSAD/Eclipse Development Environment, CVS
Confidential, San Diego, CA
Consulting Scientist
Responsibilities:
- Crystal Brain is a web based tool to view and manipulate the research data in real time.
- Responsible for data preparation and model development.
- Also carried out the software design and development as a Web based 3-tier system.
Environment: R, Python, and J2EE: JSP, Servlets, JDBC; J2SE: Swing applets; Struts framework, WebLogic
Confidential, Cupertino, CA
Senior Staff Engineer - Software Architect and Lead
Responsibilities:
- Developed a billing and customer management software for ISPs and telecom industry.
