We provide IT Staff Augmentation Services!

Sr. Director And Chief Data Scientist Resume

2.00/5 (Submit Your Rating)

Cupertino, CA

PROFESSIONAL SUMMARY:

  • Have unique combination of data science and technology background with an experience of over 30 years as a research scientist, data scientist, product/engineering manager, and architect.
  • Extensively participated in various R&D and developmental projects as Data Scientist and Application Architect that involve experiment design, developing and evaluating advanced analytical and numerical models, such as those from statistical inferencing and machine learning.
  • Expertise and experience in specific problem areas are Confidential bid response pricing, B2C pricing and revenue optimization, customer value perception analytics, competition - tactics planning, and supply chain - demand forecasting, capacity planning and inventory optimization.

FUNCTIONAL AND ALGORITHMS EXPERTISE:

  • Statistical Machine Learning, Hypothesis Testing, Classification and Regression. Feature Engineering, NLP, Pattern recognition (used in Forecasting) and Clustering - Market Segmentation, Anomaly Detection for Intrusion Detection and IOT, User/Item Based Recommenders: Collaborative Filtering.
  • NLP: Worked on Document Discovery, and Sentiment Analysis.
  • Time Series Analysis and Modeling, Forecasting, Moving Averages Exponential Smoothing, and ARIMA
  • Optimization - Cyclic and Safety Inventory Optimization, Bidding/Pricing and Revenue Optimization, Advertisement Budget Optimization

TECHNICAL SUMMARY:

Software Tools, Architecture and Design:

ML: Python Pandas, NumPy, Scipy, Scikit-learn, NLTK, Open NLP, R, Matlab, Tkinter, Jupyter Notebook

ML/DL: Spark ML/MLLib, Mahout, Weka, DL4j, Mallet, MOA, ELKI

Big Data: Apache Spark MLLib, Spark ML with Hadoop/YARN; AWS S3, EC2

Languages: Java, Python, Scala, JavaScript, D3.js, XML, JSON and SQL

Software Modeling and Design: UML; OOAD/ API/Framework Design, Cloud/SAAS Computing Data Modeling

PROFESSIONAL EXPERIENCE:

Confidential, Cupertino, CA

Sr. Director and Chief Data Scientist

Responsibilities:

  • As Director, built the business insights (data science) and relevant technology team to develop predictive models using structured, semi-structured and unstructured data.
  • Mentored data analysts to come up to speed with the project.
  • Clarified project objectives, delegated responsibilities, and delivered results to corporate management.
  • As Data Scientist, developed predictive models for classification, regression and clustering problems. Developed standards and created patents and copy rights.
  • As Product Manager interacted with clients, captured and documented business requirements; advocated clients on potential concepts, system functionality and use cases.
  • Interfaced with development teams to ensure the schedules and accuracy in the delivery of use cases.
  • Develop a clustering model based fraud detection system and compared with probability density based models. Responsible for data extraction, preprocessing, model evaluation, and post-processing.
  • Used K Means and Normal Distribution Function based models. Developed optimal number of clusters.
  • Generated the mean entropy of clusters and F1 score to evaluate model relevance.
  • Architect and design the application to deploy in a multi-node cluster environment.

Tools: Scala, Java, MLLib, Spark ML, Hadoop, Spark RDDs, Spark SQL, and R/RStudio/SparkR Visualizations.

Confidential

Sr. Director and Chief Data Scientist

Responsibilities:

  • Review use cases and interface with pilot customers to gain insight into the complexities of cases.
  • Receive datasets from customer and make initial assessment on data from users.
  • Responsible for data extraction, preprocessing, model evaluation, and post-processing.
  • Conduct comparison studies on different types of recommenders: Item based and User based.
  • Design APIs for the recommender system components and subcomponents.
  • Architect and design the application to deploy in a multi-node cluster environment.

Tools: Python, Scala, Java, MLLib, Spark ML, Hadoop, YARN, SQL, J2EE and MYSQL DB, Oracle, and Spread Sheets, and Jupyter

Confidential

Product Manager and Scientist

Responsibilities:

  • Pricing models/algorithms development - Price analytics, market sensitivity analysis, market segmentation, tactical promotion planning.
  • Designed APIs for clustering/segmentation: Kmeans for customer segmentation.
  • Developed best response functions to deal with pricing from competitors based on game theory.
  • Developed optimization models to generate margin maximization.
  • Created a proprietary multi-recursive market segmentation and optimization algorithm.

Environment: Python: Scikit-Learn, Hadoop HDFS, Spark, MLLib, Java API, MySQL, SQL and Eclipse

Confidential

Data scientist

Responsibilities:

  • Responsible for price segmentation/clustering and models development and code delivery.
  • Conducted comparison studies on Classification algorithms to decide the best response (LR) model.
  • Designed APIs for bidding LR algorithms and Machine Learning based clustering/segmentation: Kmeans.
  • Preprocessed data and compressed sets through dimensionality reduction through LDA.
  • Tuned and tested algorithms for performance - accuracy and speed.

Environment: Python (x, y): Scikit-Learn, SQL, Java, J2EE, HDFS, MapReduce, MySQL DB

Confidential

Data scientist

Responsibilities:

  • Preprocessed data for completeness and accuracy including visualization and exploration of data.
  • Performed feature engineering; conducted model evaluation and optimization - iterative cross validation, performance measurements, parameter tuning.

Environment: Python(x, y): NumPy, Scipy, Scikit-Learn, pandas

Confidential, Plano, TX

Data scientist

Responsibilities:

  • Created models - used extrapolation based smoothing, and pattern recognition based algorithms to minimize statistical errors - MAPE, MAD, and MSE.
  • Models operated well with level, trend and seasonal data and also for data with variance in trend.
  • Developed causal forecasting models using multi-variate regression - pricing, ad budget, team size.
  • Developed API interfaces to work with inventory optimization models and related data.
  • Supported the team with analytical modeling and algorithms through the life cycle of product.

Environment: Java/J2EE, Numerical Recipes, Struts, SQL, Oracle DB, Tomcat 6.0, Eclipse IDE

Confidential, La Palma, CA

Data scientist

Responsibilities:

  • Acted as an architect and designer to develop the software system for Confidential project.
  • Mentored developers and guided them to design and develop a multi-tier system.
  • Project recognized as one of the most successful and on time delivered system.

Environment: J2EE stack, Spring MVC, Big Data HDFS, Oracle DB, Web sphere, Eclipse

Confidential, Dayton, OH

Data scientist

Responsibilities:

  • Built a documents classification system with predefined characteristic attributes.
  • Used ML algorithms to broadly identify different types and subtypes of documents.
  • Developed a document classification model in Java to classify documents based on certain keywords.
  • Used SVD based approach to recommend documents to readers based on their selections of interest.
  • Implemented as a three tier system with Struts framework.

Environment: J2EE stack, Struts MVC, IBM DB2, Oracle DB, Sybase DB and data warehouses, Web Logic, Eclipse

Confidential

Data scientist

Responsibilities:

  • Developed the pricing models using Oracle BRM (previously Portal Infranet) developed by us at Confidential .
  • Catastrophes and Perils Data System to manage the data over Web.

Environment: Swing Applet, Servlets (for Authentication), Session, Entity (BMP) and Message-driven Beans holding service calls, Model-View-Controller Design, Hibernate, XML, Log4j, Oracle, WebSphere/WSAD/Eclipse Development Environment, CVS

Confidential, San Diego, CA

Consulting Scientist

Responsibilities:

  • Crystal Brain is a web based tool to view and manipulate the research data in real time.
  • Responsible for data preparation and model development.
  • Also carried out the software design and development as a Web based 3-tier system.

Environment: R, Python, and J2EE: JSP, Servlets, JDBC; J2SE: Swing applets; Struts framework, WebLogic

Confidential, Cupertino, CA

Senior Staff Engineer - Software Architect and Lead

Responsibilities:

  • Developed a billing and customer management software for ISPs and telecom industry.

We'd love your feedback!