We provide IT Staff Augmentation Services!

Senior Researcher Data Mining Resume Profile

4.00/5 (Submit Your Rating)

Professional Summary

Computer scientist with over 25 years of experience in informatics data - mining and analytics and software engineering and more than 10 years of experience in directing the research development and application of new computational and data technologies across a variety of computer and biomedical areas.

Past Positions

2012-2013 Executive Director Interim Confidential

Led the formation of a Consortium aimed at supporting the adoption and sustainability of the iRODS data middleware technology. During my period as Director I was responsible for establishing the business and staffing structure of the Consortium developing Consortium bylaws and charters and recruitment of initial members. Of significance we were able to form the Consortium with RENCI the DICE Group at UNC/UCSD and the Max Planck Society as founding members.

Director of Biomedical Informatics Confidential

Formed and led the BioMedical Informatics Program at RENCI one of the two domain focuses for RENCI. As Director held responsibility for setting the program directions building collaborations securing funding managing projects and aiding in the development of bioinformatics applications systems and tools.

Senior Researcher Data Mining Confidential

Researcher responsible for developing collaborations with University researchers and working on and leading projects in the areas of bioinformatics medical informatics and health informatics.

Senior Computer Scientist BD Technologies Confidential

Lead computer scientist and software architect in the Bioinformatics Group. Responsible for directing product planning and development managing software developers designing software architectures developing algorithms and leading software implementation and testing efforts.

Senior Architect/Director of Software Confidential

Directed development of software and e-commerce offerings centered around data-mining and pattern recognition provided technical management of junior staff and conducted statistical and data-mining consulting for multiple Fortune 500 corporate clients.

Senior Scientist Dendrite International Confidential

Led the development of software systems for optimization of marketing and sales efforts. Consulted with clients seeking statistical analysis and data-mining to improve marketing and sales efforts. Prominent clients included Abbott Labs Pfizer Merck and GlaxoSmithKline.

Independent Consultant Confidential

Provided data-mining and software development services for clients. Specific projects included neural networks for fraud detection client YellowBrick click-stream analysis for web-sites client GrayStone LLC forecasting models for financial investing client DB S and data-entry software for laboratory research client UNC-CH .

Research Assistant Confidential

Investigated and developed computational models of human visual processing based on neural networks and statistical models. Conducted research on image analysis and pattern recognition algorithms for the detection and recognition of moving objects. Developed simulation and mathematical software for neural networks image analysis computer vision and nonlinear dynamical systems.

Software Engineer Confidential

Software engineer on four versions of the IBM Communications Manager software system. Participated in all stages of the software development cycle. Involved in ISO-9000 compliance processes. Specific projects included ISDN X.25 LAN SDLC Twinaxial communications protocols and user-interface software.

Software Technician Confidential

Developed software to aid in the design and development of telecommunications hardware.

Technical Reports

Owen P Shoffner M Wang X Schmitt CP Lamm B Mostafa J. TR-11-01 Secure Medical Research Workspace 2011 RENCI Tech. Reports 2011.

Wilhelmsen K Schmitt CP Fecho K. TR-13-03 Factors Influencing Data Archival of Large-scale Genomic Data Sets.2013 RENCI Tech. Reports 2013.

Owen P Ahalt S Berg J Coyle J Evans J Fecho K Gillis D Schmitt CP Young D Wilhelmsen K. TR-14-02 The GMW A Genetic Medical Workflow Engine. 2013 RENCI Tech. Reports 2013.

Reilly J Ahalt S Fecho K Jones C McGee J Roach J Schmitt CP Wilhelmsen K. TR-14-03 MaPSeq A Computational and Analytical Workflow Manager for Downstream Genomic Sequencing. 2013 RENCI Tech. Reports 2013.

Bizon C Ahalt S Fecho K Nassar N Schmitt C Scott E Wilhelmsen K. TR-14-04 CANVAS and AnnoBot Solutions for Genomic Variant Annotation. 2013 RENCI Tech. Reports 2013.

Schmitt C Maher S. The Evaluation and Optimization of Face Recognition Performance Using Experimental Design and Synthetic Images. 2011 June . Prepared by RTI International Institute for Homeland Security Solutions under contract HSHQDC-08-C-00100 .

White Papers

Schmitt C Shoffner M Owens P Wang X Lamm B Mostafa J Barker M Krishnamurthy A Wilhelmsen K Ahalt S Fecho K. Security and Privacy in the Era of Big Data - The SMW a Technological Solution to the Challenge of Data Leakage. RENCI White Paper Series. 2013.

Schmitt C Wilhelmsen K Krishnaumurthy A Ahalt S Fecho K. Security and Privacy in the Era of Big Data - iRODS a Technological Solution to the Challenge of Implementing Security and Privacy Policies and Procedures. RENCI White Paper Series. 2013

Project Highlights

Visual Decision Link Development of visualization and data mining techniques to provide medical decision support capabilities for clinicians at the point of care. Developed approaches allow clinicians to compare presenting patients to comparative populations and clinical guidelines to better determine treatment options. Focus on major depressive disorders and epilepsy. System under evaluation to determine effectiveness as part of a NIH R21 award and NSF award.

Role Investigator Mentor

Date 2010-current

Technologies Processing Java MS SQL Server SAS JMP custom developed data-mining routines in Java

Confidential

Role Lead Developer and Researcher

Computing over Software Defined Networks Research and development into new ways to run computational workflows and data grids on software defined networks through cloud-bursting approaches. This project won the Corporation for Education Network Initiatives in California 2013 award for Innovations in Networking.

Role Co-Principal Investigator

Date 2012-current

Technologies C/C Java ORCA Geni iRODS Virtualization OpenFlow OpenStack Pegasus Condor

Confidential

Role Facilities and Operations Director

Confidential

Role Principal Investigator

Technologies C/C Java Python JavaScript PostgreSQL Unix tool sets Linux Windows GForge Hudson Eucalyptus node.js

Informatics for High Throughput Genomic Sequencing Collaboration with UNC Genetics Department to develop and operate an IT system for the processing management and analysis of next generation genomic sequence data for clinical research basic research and clinical care. Deployed in late 2010 the system has been used to assemble and detect mutations in 1000 human genomes manages the storage and analysis of close to 3000 human genomes and manages close to 500Tb of genomic data as of 06/2012 .

Role Technical Director Investigator

Confidential

Technologies High Performance Computing Linux-based compute clusters Open Science Grid Pegasus Condor Globus SeqWare LSF PBS Storage Hadoop HDFS Pig Hive PostgreSQL iRODS Software Development Java Python Perl Bioinformatics BWA Picard GATK SAMtools Blast/Blat PolyPhen plus various bioinformatics tools and databases

Integration of Genomics into Clinical Care NCGenes Collaboration with UNC Hospital and School of Medicine to develop and deploy a system for integrating high throughput genomic sequencing into clinical care at UNC. The system manages an entire IT workflow that includes patient entry into a clinical trial blood draw and sample processing genomic data processing and analysis identification of clinically relevant genetic variants independent confirmation of results by a clinically approved lab and reporting of results to clinicians and patients.

Role Initial Technical Director Investigator

Confidential

Technologies High Performance Computing Linux-based compute clusters Open Science Grid Pegasus Condor Globus SeqWare LSF PBS Storage RedCap Hadoop HDFS Pig Hive PostgreSQL MS SQL Server iRODS Software Development PHP Java Python Perl Bioinformatics BWA Picard GATK SAMtools Blast/Blat PolyPhen plus various bioinformatics tools and databases

Secure Medical Workspace Prototype technology developed with the UNC Medical School focused on enabling research on patient data while limiting the risk of data loss and leakage. System deployed into production use at UNC in 2011.

Role Technical Lead

Confidential

Technologies VMWare Virtual Computing Lab Citrix Data Leakage Protect Microsoft File System Filter Driver C

Characterization of Video-based Biometrics Research effort to predict the performance of face recognition from video algorithms under different operating and environmental characteristics by modeling the algorithms in simulated virtual environments.

Role Principal Investigator

Confidential

Technologies C/C Matlab various 3-D modeling packages

NADIA Understanding alcohol effects on adolescents Development of web site database and analysis reports in support of a research consortium focused on studying alcohol impacts using a variety of neurobiology molecular and genetic techniques.

Role Technical Lead

Confidential

Technologies MS SQL Server PHP WordPress SAS JMP

Early Detection of Death in Intensive Care Units Collaborated with the UNC Medical School to develop an early warning system for patients at risk of death in pediatric and neonatal intensive care units. System integrates real-time streaming monitor data with patient electronic medical records and uses predictive non-linear algorithms to calculate risk. System being marketed by Realtromins Inc.

Role Technical Lead Advisor

Confidential

Technologies MS SQL Server SAS Enterprise Miner Matlab Cognos Java Java Message Service Unix tool sets HL7

Denovo Genomic Sequence Assembly Worked with the UNC Department of Biology to enhance the speed of the VCAKE genome sequence assembly software. Through recoding the performance was improved over 40x.

Role Technical Lead Developer

Confidential

Technologies Perl C/C

Locating Farmer s Markets As part of a research grant to understand issues around food sustainability researched and developed a geographical approach based on Huff retail trade models to determine best locations for locating or relocating farmer s markets.

Role Technical Lead Developer

Date 2009-2011

Technologies R SAS JMP GRASS ArcGIS MongoDB

Image-analysis Approaches to Detect Melanoma Collaboration with UNC Medical School to develop image-analysis approaches to detect melanoma from pathology slides.

Role Initial Technical Lead

Confidential

Technologies MatLab

Laboratory Information Management Systems for Life Science Research Led a small software team in developing the initial Multiwell Plate Manager MPM system for BD Technologies. Joined BD to further develop the system and tailor it for supporting an internal R D effort around tissue and cellular engineering. System upgraded over multiple years to include capabilities for microarray capture and analysis federating databases across labs cellular signaling pathways using semantic technologies representing and using ontologies automated statistical analysis meta-analysis and data-mining and data normalization visualization and quality control. System has been in use for over 10 years at BD in multiple labs and manages multiple terabytes of life science data and images.

Role Lead Computer Scientist and Technnical Architect

Confidential

Technologies Java JavaSpaces J2EE Oracle Relational Database Management System R

Commercial Laboratory Information Management Systems Worked with BD Technologies to commercialize the MPM technology. The initial system was sold as BD Gentest MPM/ADMET. A version was developed for measuring chemical solubility and sold as MPM SolubilityTM a software system for the BD GentestTM Solubility Scanner. Aspects of the technology were incorporated into the BD Multiwell AutoSampler system sold with BD Flow Cytometers. A Good Manufacturing Practices validated version meeting 21 CFR Part 11 was developed and used to control BD LyoplateTM manufacturing business a multi-million dollar service business. As part of this work new algorithms for optimizing robotic scheduling to minimize pipetting time and reagent use based on quadratic programming were developed.

Role Lead Computer Scientist and Technnical Architect

Confidential

Technologies Java JavaSpaces J2EE Oracle Relational Database Management System R

Data-Mining for Business Analytics Through work at Yankelovich Analytika and private contracting provided statistical consulting and data-mining work for multiple corporate clients. Work included building predictive models and classifiers using a variety of approaches including linear logistic ridge and MARS regression decision and classification trees support vector machines neural networks and genetic algorithms. Clients included multiple Fortune 500 clients including Best Buy Sam s Club Ericsson Pfizer GSK Merck and Abbott Labs.

Role Consultant

Confidential

Technologies Java C/C R SPSS

Customer Relationship WebSite Technical lead on the development of a web-based customer-relationship management tool for Ericsson. Responsibilities included developing the system design and architecture developing data-mining approaches for analyzing customer web-traffic data web-site implementation and mentoring of junior developers.

Role Technical lead web-analytics mentor for junior developer

Confidential

Technologies Java C/C R SPSS

Alerting system for pharmaceutical saleforeces Proposed and developed the Scripmax SentryTM product for alerting pharmaceutical sales divisions of potential changes in market conditions related to product sales. Developed predictive algorithms based on time-series analysis and pattern recognition. Product marketed by Analytika Inc until bought by Dendrite Int.

Role Technical Lead and Analyst

Confidential

Technologies Java Oracle Relational Database Management System

Optimizing salesforce allocation Led the development of the Scripmax OptimizerTM product for optimizing the re-allocation of pharmaceutical sales forces across product offerings. Developed algorithms based on a hybrid model combining expert rules and techniques for portfolio optimization. Product marketed by Analytika Inc until bought by Dendrite Int.

Role Technical Lead and Analyst

Confidential

Technologies Java Oracle Relational Database Management System

Predicting pharmaceutical market adoption Aided in the development of analysis and software for the ScriptMax HQ product which provided predictions of market penetration for pharmaceuticals. Product marketed by Analytika Inc until bought by Dendrite Int.

Role Analyst

Confidential

Technologies Java Oracle Relational Database Management System

Development of telecommunications software Through work at IBM and Millidyne Inc worked on several commericial telecommunications products including four versions of the IBM OS/2 Communications Manager platform. Work involved all stages of the software development cycle and involvement in ISO-9000 compliance. Specific work involved ISDN X.25 LAN SDLC and Twinaxial communications protocols as well as user interface software.

Role Software Technician Software Developer Software Engineer

Confidential

Technologies C/C Assembly PLS286

We'd love your feedback!