Data Engineer - Java/sql/nlp/javascript/ios/android Resume
Monterey, CA
SUMMARY:
- Data Science and Machine Learning
- Statistical Analysis
- Software Engineering and Database Development
- Data Modeling
TECHNICAL SKILLS:
Languages: Pig, Hive, Sqoop, MATLAB, R, Java, Python, C++, SQL, Spark, Scala
Platforms: Linux, Unix, Windows, Mac
Databases: HBase, MySQL, SQLServer, Vertica, JDBC
Others: Hadoop/MapReduce, HDFS, Github, Tableau, Subversion, CVS, J2EE, GIS
WORK EXPERIENCE:
Data Engineer - Java/SQL/NLP/Javascript/iOS/Android
Confidential, Monterey, CA
Responsibilities:
- Software development of foreign language learning applications with web and mobile interfaces. Used java, javascript, SQL, ionic, for web-browser, iOS, and android.
Solution Architect/Data Engineer - Python/NLP/Hadoop/Vertica/Kafka/Spark/Scala
Confidential, Sunnyvale, CA
Responsibilities:
- Big data projects including Kafka, Spark and Scala with Vertica as datastore. Developed scripts and SQL to automatically load data. Developed algorithms for processing of geospatial data. Wrote proposals and preliminary code for machine learning efforts to analyze text-from-speech and banking data.
Confidential, Campbell, CA
Responsibilities:
- NLP, machine learning, and sentiment analysis using Python and NLTK. Analysis and predic-tive modeling using Spark, MLlib. Analysis and implementation of large software data systems using AWS. Performance analysis of HP Vertica vs. RDBMS. Oozie development for systems using sqoop and hive.
Data Engineering - Hadoop/Java/HBase/Python/Tableau/R/AWS
Confidential, Aptos, CA
Responsibilities:
- Recommended climate data and loaded data into HDFS/Hadoop ecosystem including HBase. Queried Hbase with Pig scripts including UDF. Analyzed climate and aircraft engine data with SQL, R, and Tableau to analyze engine performance over time.
- Used Python SciKit-Learn random-forest classifier on bacterial species data. Proof-Of-Concept work using Hadoop technologies to show stakeholders the capabilities of Hadoop technologies and usage of AWS. Used Pig and Hive to query data and Matlab to visualize the data.
- Tutored college undergraduate and graduate students in Calculus, Physics, Statistics, R, and Matlab.
Data Analyst/Engineer - C++/Matlab/Linear Regression/Multivariate Analysis
Confidential, Sunnyvale, CA
Responsibilities:
- Project Lead - Developed C++ software to control, communicate with, and collect data from oceanographic sensors on an autonomous ocean-going surface vehicle (Wave Glider). This project integrated a magnetometer onto an ocean-going platform working with mechanical engineers, electrical engineers, manufacturing engineers, vendors, and end-user scientists. Project Lead - analyzed Acoustic-Doppler-Current-Profile data with Matlab and created a data report for Marketing/Sales.
Project Scientist - Perl/Matlab
Confidential, Concord, CA
Responsibilities:
- Analyzed ocean current velocity data using Matlab in support of Natural Resource Disaster Assessment (NRDA) of the Gulf of Mexico oil spill. Analysis included calculating summary statistics to characterize the ocean velocity structure and time-series analysis. These results were presented data in written reports and in oral presentations.
Research Associate - Java/Perl/C++/SQL/Matlab/PCA
Confidential, Aptos, CA
Responsibilities:
- Self-employed analyzing data and developing software under contracts with Monterey Bay area research institutions and government agencies as follows:
Confidential, Santa Cruz, CA
Responsibilities:
- Analyzed current meter data and temperature data for a variety of experiments using MATLAB and PCA on ocean density structure. Confidential School, Monterey, CA - Collected and analyzed data using MATLAB. Confidential (Monterey Bay Aquarium Research Institute), Moss Landing, CA - Wrote data analysis programs in Java, perl, and MATLAB for display of the data on the web. Using MATLAB analyzed bacteria data collected along southern California beaches.
Software Engineer - Java/C++/mySQL/Perl
Confidential, Monterey, CA
Responsibilities:
- Brought unusable beta software to operational state by increasing efficiency in code and removing memory leaks. This Java servlet project displayed meteorological model data on the web for Navy officers around the world. This project included parallel computing using a Queen/Hive configuration.
Software Engineer - Java/SQLServer
Confidential, Monterey, CA
Responsibilities:
- Developed software in Java for e-commerce website that included servlets and a SQLServer database.
Research Associate - C++/Perl/Java/NetCDF/Matlab
Confidential, Moss Landing, CA
Responsibilities:
- Developed software to collect and analyze data in C, perl, and Java. Accessed data storage types that included Relational Databases (used SQL) and NetCDF. Developed Matlab applications in support of oceanographic research for data storage, processing, analysis, and visualization. Prepared and deployed oceanographic instruments.
Data Analyst/Engineer - C/Matlab
Confidential, Monterey, CA
Responsibilities:
- Programmed in C to produce new mooring design tools and data analysis. Programmed in Matlab to produce new data analysis tools. Designed, prepared, and deployed oceanographic moorings and instruments. Performed data processing, analysis, and visualization in C and Matlab.
