Data Engineer Resume
SUMMARY
- Nineteen years of experience building high performance data warehouse, MPP and distributed systems. Seventeen years of e - Commerce experience dealing with path analysis, log analysis and behavior analysis.
- Eight years working with Risk and Fraud Prevention teams, combating advertisement fraud, mail fraud, credit card fraud and internet auction fraud.
- Client-focused professional with a passion and persistence to meet clients’ specific business requirements; a commitment to exceed expectations; and a specialization in big data, self-service business intelligence, data discovery and visualization.
TECHNICAL SKILLS
Databases: Oracle EE, Red Brick Warehouse, Teradata, MySQL, Postgres, Vertica
Tools: (Modeling, ETL, BI): Ab Initio, Informatica, Teradata, Hadoop (HDFS Hive, Pig, Spark)
Programming: PERL, Python, Java, C/C++, SQL, bash/ksh, Pig, HQL, JavaScript, PHP, R
OS: Solaris, Linux
PROFESSIONAL EXPERIENCE
Confidential
Data Engineer
Responsibilities:
- Wholesaler Gross Net Sales, refactor. Refactored Vertical ETL code for identifying Quickbooks wholesalers.
- Data Ingestion from Hive to Vertica.
Tool: stack: Vertical, Hive, Jenkins, python, SQL and UNIX.
Confidential
Data Engineer
Responsibilities:
- Task: Proof of Concept. Could Clarity team deliver insightful inferences from Confidential 's power strip data, using HPE Vertica. Confidential concluded Clarity's Data Science and Data Engineering experience showed promise and a contract for was signed.
- Wrote functional design specifications for reporting portal based on MicroStrategy.
- Wrote graph mock-ups using Advanced SQL, JavaScript, Java and D3's visualization library.
- Used HPE Vertica for data integration using data from power strips, Nlyte (Data Center Infrastructure Management), rack-mounted temperature and humidity sensors, and public weather data.
- Analyzed data using exploratory data analysis and regression modeling, with RStudio and Vertica, on 2.8 billion records.
Tool: stack: R, JDBC, Java, JavaScript, Python, Advanced SQL, HPE Vertica, MicroStrategy.
Confidentia
Solution Architect
Responsibilities:
- Made RESTful endpoint for posting data to Confidential Groups, using internal libraries, in Hack (PHP).
- Wrote Python libraries for interfacing with Metric Insights RESTful API to extract content and publish to a Confidential Group.
- Used Metric Insights for data quality checks against Tableau, MicroStrategy, Vertica and Hive.
- Used Metric Insights for alert distribution on KPI metrics.
Tool: stack: Tableau, MicroStrategy, Hive, Vertica, PHP, python and bash.
Confidential
Manager
Responsibilities:
- Managed small team, of global resources, to support Risk's analytical servers and applications listed below.
- Worked with internal business partners to procure hardware, manage systems analytical systems and in general enabled users to succeed.
- Developed and managed small team of remote resources, to measure data quality, model execution, batch jobs and system management, using Hive, Python, PERL, bash, Java, Teradata and Pig.
- Worked directly with SAS to renew support license and conduct customer information seminars.
- Responsible for building free software packages (R libraries and Python packages, from source) to increase productivity for modeling team. Continuous integration using Jenkins.
- Deployed 60 servers running various large-scale applications for training and research, including Hortonworks v2.2, FICO Model Builder . Added non-standard products, such as Spark v1.4 (v1.2 was supported) and H2O v3.1.0.4.
Confidential
ETL Architect
Responsibilities:
- Internet Marketing: ETL Architect for Near Real Time (NRT) Fraud Detection application using Ab Initio, Oracle SQL and Tomcat. Conducted code reviews, design reviews and research. Wrote design documents and oversaw development by offshore resources.
- User Behavior Insights (UBI): Rewrote Ab Initio Continuous flows application in C++. Application used Tibco API to pull messages web event from bus and sessionized them, using gcc. POSIX threads and STL.
Confidential
Senior ETL Engineer
Responsibilities:
- Wrote Ab Initio graph to process steaming logs, produced from Roku player, to identify quality issues and analyze user behavior.
- Web Analytics. Drove design and development for a web analytics application. The application read XML files produced by Confidential 's web application server, organized hits into visits and summarized information for path analysis on sign-up and cancel flows.
- Fraud Detection. Merged routing information from USPS and User accounts, to uncover fraud rings.
Confidential
Senior Ab Initio Developer
Responsibilities:
- Account Setup Application. Designed and developed Ab Initio process to extract customer's billing information.
- Used Ab Initio's C++ API to integrate Confidential 's proprietor encryption algorithm into transformations for Customer Support and Vendor Reports.
- Team Lead, Kaiser Permanente. Senior resource on 12-person team to develop application for Part D Medicare reimbursement.
- Ab Initio Developer, Visa International. Designed and wrote transaction alert monitor in Ab Initio, ksh and DB2.
Confidential
Senior Software Engineer
Responsibilities:
- Data Modeler for core product's schema.
- Responsible for query and load performance, on Red Brick Warehouse and Oracle RDBMS.
- Designed Database Management Console and Application Status page using PERL CGIs and regex, on Apache.
- Maintained Java-based Query Builder application.
- Liaison between Customer Support and Product Development, and interfacing with high-value customers.