Data Scientist Resume
Wilmington, DE
PROFILE:
- Seasoned Data Science professional with around 15 years of professional experience and a consistent track record of delivering strong, quantifiable results in a dynamic, fast - paced environment - in the context of fortune 500 companies.
- Extremely passionate about applying Machine Learning to problems that impact the business bottom line.
- Comfortable partnering with those directly involved with big data infrastructure, software, and data warehousing, as well as product management across different business units, in a matrix, global organizations.
AREAS OF EXPERTISE:
- Executive Communication
- Python, Java, SQL, Tableau, Hadoop, Hive, Spark, AWS, Tensor Flow and major visualization tools
- Ensemble Methods, Deep Learning, plus a wide range of topics in the ML and AI spectrum
- Strong theoretical foundation on statistical methods for Machine Learning, Linear Algebra
PROFESSIONAL EXPERIENCE:
Confidential, Wilmington, DE
Data Scientist
Responsibilities:
- Gathered, analyzed, documented and translated application requirements into data models
- Extracted sample data from Amazon Redshift, identified patterns, data quality issues, and leveraged insights from BI team
- Utilized Pandas for data pre-processing, imputation, and feature engineering to perform exploratory analysis with statistics
- Performed correlation analysis and utilized descriptive and graphical statistics to describe and summarize model variables
- Tested major algorithms such as Logistic Regression, Bayesian Probabilistic Classifiers, and Gradient Boosting, available on Scikit-learn. Implemented, tuned and deployed models on AWS
- Judged model performance according to agreed accuracy baseline via cross validation. Tuned hyper-parameters, featured selection, optimized, strategized and presented findings to the board
- Tackled highly imbalanced Fraud dataset using oversampling with SMOTE (Synthetic Minority Over-Sampling Technique) and cost sensitive algorithms with Python Scikit-learn to fulfill the dataset requirement
Technology Stack : Numpy, Pandas, Scikit-Learn, Spark, AWS, Hadoop, Advanced Methods of Machine Learning
Confidential, Raleigh, NC
Data Scientist & Technology Outlook Analytics Manager
Responsibilities:
- Liaised with several Israeli high-tech firms as well as internal business units to deliver the first working prototype of Lenovo mixed reality headset: Lenovo Glasses
- Implemented several augmented reality engines based on Hierarchical Bayesian Models, Discriminative (SVM, Neural Nets, Kernels and Conditional Random Fields) and Generative Methods
- Performed Ensemble Methods to improve the accuracy of a Human Action Classification Engine
- Interfaced with Big Data teams to perform ETL on massive amounts of training data
- Worked on a pilot project to cloud-enable a mixed reality engine to store/retrieve live feeds of extra-reality layers of geo-tagged information
Technology Stack : Python 3.x, AWS, Hive, Pig Latin, HDFS, PySpark, Jupyter, Numpy, Pandas, Scikit
Confidential
System Analyst
Responsibilities:
- Collected and analyzed predictors from social networks data streaming and stored in a Hadoop System of Clusters with Hive. Utilized advanced Web Scraping techniques to that end.
- Conducted Sentiment Analysis (emotion AI), and tested the results against “human consensus” from previous year’s LTO reports.
- Performed data visualization and designed dashboards with Tableau; provided complex reports, including charts, summaries, and graphs to interpret the findings to R&T as well as to Server, PC, Mobile and Cloud Business Units.
Technology Stack : Tableau 10.x/9.x, (SSIS/SSRS), Power BI, AWS, MS SQL Server 2012, Python (Scikit-Learn/Scipy/Numpy/Pandas), Machine Learning Algorithms (Naive Bayes, Neuro Nets, Random Forest, Maximum Entropy), AWS S3, EC2, Hadoop
Confidential
System Analyst, Developer
Responsibilities:
- Charged with providing technology-oriented consulting, system analysis, business cases identification, development, and assessment of key technology solutions.
- Responsible for development and maintenance of core functionalities as well as e-commerce and internet based customer care programming assistance
- Make sure the following workflow runs smoothly, end-to-end: enterprise clients generate order with their own ERPs Orders process by “red-zone” intermediary system and sent to a “transaction hub” “transaction hub” spread order to multiple systems in XML, including MQSeries B2B consume orders and convert to XML bind schema order sent to Web order management store request in DB2 production database.
Technology Stack : Windows XP, J2EE/Java, JSP, Struts/Hibernate, Web Services, RMI, CVS, WebSphere IDE, DB2, MQ Series (for asynchronous binding), JDBC, JDNI, SOAP, and JMS, JDEdwards.
Confidential, Florida
System Analyst
Responsibilities:
- Software Developer. Development and maintenance of the most critical modules of the system to schedule classes, exams and tutors resources using Genetic Algorithms and A.I Concepts.
Environment: Java, JSP, C++, Mr. Persister, XSLT, Resin, HTML, CSS, and XML, CVS.
Confidential
System Analyst/Researcher
Responsibilities:
- Technical Lead of a semi-real-time s imulation system contracted to the Confidential .
- Generate flood simulation and Digital Elevation Models; integration of data created by the numerous Brazilian agencies responsible for conserving the Amazon Creek.
- Development, analysis and design of Object Oriented semi-real time engines on top of a GIS - Geographic Information System data layer.
Environment: UNIX, Windows XP, Arc Info, C/C++, OO Database ArcGIS/Arc Info, XML.
