Data Science Solution Architect Resume
Kansas City, MO
SUMMARY:
- Confidential has a graduate degree in Advanced Analytics from the Statistics Department Confidential Texas A&M. He is certified in SAS and SPSS Modeler. He is a certified IBM Hadoop (BigInsights) developer. He has Spark experience. He successfully completed the databricks / edX / BerkeleyX class Scalable Machine Learning, which focused on Spark and Python. He has earned a in Programming for Data Science from the Institute for Statistics (statistics.com). He has also completed 7 of 12 required courses for a in Analytics for Data Science from statistics.com. Confidential has a PhD in Linguistics.
- He has a very strong background in data integration and ETL with 15 years experience designing and building complex end - to-end pipelines to extract data from multiple sources, cleanse and integrate it, and feed it into analytic systems 85% of any data mining / advanced analytics project is ETL. Confidential combines the skillsets of data engineering and data science.
TECHNICAL SKILLS:
Skills: Data Science / Data Engineering / Statistics / Machine Learning SAS Certified, SPSS Modeler Certified R, RStudio, ggplot2, plyr, markdown Python, NumPy, SciPy, Sci - KitLearn, Pandas, NLTK, Anaconda, Jupyter, Spark, PySpark, MLLib, SystemML, Predictive analytics, text analytics, Natural Language Processing, sentiment analysis, word2vec, LDA, time series analysis, machine learning, neural networks, data mining, multiple regression, logistic regression, decision trees, cluster analysis, k-means, random forests, gradient boosting, bootstrap, Support Vector Machines, etc. Big Data IBM BigInsights (Apache Hadoop) Certified Developer Hadoop, Hive, Spark, Solr, Python, PySpark, R, Scala AWS Certified Technical Professional InfoSphere Information Server Architect and develop DataStage and QualityStage solutions, versions 11.3, 9.2, 8.x, etc. Experience with entire range of IIS tools Blueprint Director, Business Glossary, Data Governance, FastTrack, Metadata Asset Manager, Data Studio, Data Architect Certified InfoSphere Data Governance
PROFESSIONAL EXPERIENCE:
Confidential
Data Science Solution Architect
Responsibilities:
- Build Scala pipelines for processing credit bureau XML data with Scala, Spark SQL, Spark DataFrames and Hive.Spark Scala ETL and EDA
- Projects done while employed by Mainline Information Systems: Sr. Data Scientist, Sr. Data Warehouse Architect / ETL ( )
Confidential
Responsibilities:
- Led team using R text analytics for topic discovery and document classification applied to internal incident reports. Data extraction, cleansing, text analytics, clustering, LDA.
Confidential
ETL Architect, Sr. Developer
Responsibilities:
- Process telephone directory listings for new EDW on Netezza. Data cleansing, standardization, and integration.
Confidential, Kansas City, MO
ETL Architect, Sr. Designer / Developer
Responsibilities:
- Architected and designed ETL for payment records from more than 50 US government agencies for the Payment Information Repository (PIR) Project for US Treasury. Data cleansing and standardization. Master record generation.
Confidential
ETL Developer / Designer
Responsibilities:
- Design, develop, ETL jobs to move all of Confidential &T’s mobile telephone records from Oracle to Teradata. Process 1.4M files per day.
Confidential
ETL Architect / Developer
Responsibilities:
- Banking. Convert legacy data feeds for credit card processing to the new enterprise data warehouse using IBM’s UDM - Banking.
Confidential
ETL Architect / Mentor
Responsibilities:
- Reviewed existing data integration jobs and made recommendations for improvement. Mentored new developers
Confidential
Grid Architect / Data Integration Architect
Responsibilities:
- Advise and develop prototype jobs for IBM GBS on Wal-marts InfoSphere MPP grid. Project pulled all sales records Confidential SKU level from all stores worldwide every night and fed data into JDA prediction engine which generated estimates of store sales Confidential the SKU level for all sales worldwide, for a 28 day moving window. Output predictions were then fed back into Confidential systems for POs, invoices, scheduling of receiving goods Confidential distribution centers, truck load optimization for shipments to individual stores
Confidential
Solution Architect
Responsibilities:
- Evaluated existing architecture and systems which move data from SAP BW to an EDW and from there to Hyperion Essbase for Canadian and US operations.
Confidential
Data Integration Architect / Designer / Team Lead / Developer
Responsibilities:
- Designed and led development of optimized ETL jobs to efficiently load data into world’s largest commercial database. .
Confidential
Data Integration Design and Development
Responsibilities:
- Integrate operational data from international sales stores into a unified view for management.
Confidential
Data Integration Architect / Designer / Team Lead / Developer
Responsibilities:
- Extract store operations financial data from Hyperion Essbase. Push into Confidential EDW.
Confidential
Data Integration Architect / Designer / Team Lead / Developer
Responsibilities:
- Designed and led the development of a ETL system to replace mainframe B2B system
Confidential
Data Integration Architect / Designer / Team Lead / Developer
Responsibilities:
- Architect ETL solution for new legal-inquiry data mart for prescription records from all pharmacies. Chain of handling tracking. Led sixteen member development team.
Confidential
Data Integration Architect / Designer / Team Lead / Developer
Responsibilities:
- ETL for POS records from retail stores into new EDW. Complex processing of pricing options and sales records.
Confidential
Data Integration Architect / Designer / Developer / Administrator
Responsibilities:
- Initiated development of ETL for new EDW using DataStage. Full life cycle of the EDW. Data marts for GL from Lawson and Peoplesoft. Feeds for Business Objects reporting and Hyperion Essbase. System admin. Mentor new users. 24 hour production support.
Confidential
Project Technical Lead / Architect / Designer / Developer
Responsibilities:
- Architected solution and led team to develop a secured property tax system for the San Diego County Assessor and Tax Collector.