Data Architect / Principal Developer Resume
Washington, DC
SUMMARY:
- Highly motivated professional with 25+ years in database, data architecture and software development.
- Strong mathematical, computer science and analytical skills. s in surpassing Silicon Graphics, IBM, Tableau and Microstrategy solutions.
- Great experience in database driven applications, high performance, Web, security and automated systems.
- Innovations in digital filtering, image processing and computer graphics
- Looking for a challenging position as data architect or software developer.
- 2nd Level DHS Security Clearance, SEC Clearance, IRS MBI clearance
TECHNICAL SKILLS:
Database development: Oracle, Netezza, Greenplum, Postgres, MS SQL, DB2, MySQL, Informix.
Software: C#, C, C++, Java, Java Script, Perl, Python, Shell
Operating Systems: MS Windows XP, UNIX (AIX 5.3, Linux 2.6, CentOS Linux, SunOS 5.10, IRIX, RHEL)
Development Environment: Microsoft Visual Studio 2005, 2008, 2010, NetBeans 8.0.2, Eclipse 2011, IntelliJ IDEA 10.0.3, Informatica PowerCenter (IPC), IBM InfoSphere DataStage 8.5, SAS 9.2
Web Development: IIS, HTTP, ASP.NET
PROFESSIONAL EXPERIENCE:
Data Architect / Principal Developer
Confidential (Washington, DC)
Responsibilities:
- Created innovative database model for TAQ (trade and quote) analytics operations.
- Developed grand expert level Netezza application including 250+ TAQ daily and intraday analytics and data conversion functions and procedures outperforming existing SEC Perl and SAS based tools.
- Implemented fraud detection algorithms to truck buy and sell unsanctioned trades, illegal stock exchange activities, overstated and unlawfully arranged stocks.
- Constructed, compiled and published Netezza User Defined Function (UDX, C++) library to reorganize operations on trillion record set SEC databases.
- Created unmatched UDX trade and quote validation and filter functions.
- Designed interactive algorithms to detect prevailing quotes to match trades with up to nanosecond finest world class granularity.
- Developed fast procedures to build complete NBBO (National best bid and offer) quotes.
- Re - factored slow Netezza window functions to execute TAQ analytics functions in real time.
- Innovated high performance algorithms to associate multibillion trades to future NBBOs with preliminary undefined time shifts and accelerated execution time from hours to seconds.
- Developed automated cron jobs to execute independently multithreaded daily database conversion and upload.
- Investigated and reported to government SEC security production data leaks.
Data Architect / Principal Developer
Confidential (Lorton, VA)
Responsibilities:
- Created automated system to compare and improve execution plans for 11g and 12c Oracle databases.
- Developed fast algorithms to reconstruct principal DHS ADIS tables with numerous referential integrity constraints.
- Resolved critical problem with remodeling ADIS prime table keeping truck of US nationwide passenger list for 15+ years.
- Developed algorithms to upload tables with referential constraints based on efficient bulk collect and “hint” technique never used before in overall DHS environment. Improved performance from 40+ hours to 10 minutes.
- In a short time completed about 20 different development “stories” and improved ADIS application functional quality.
- Created innovative solution to deliver “Get Travel History” project in parallel based development technique. Individually completed majority of database tasks in few Sprints instead of lengthy development.
- Simplified dysfunctional DHS/ADIS architecture solutions. Modified convoluted, infinitely running and never ending pivot based procedures and views to run in seconds.
- Resolved a number of outstanding performance problems.
Principal Data Architect
Confidential (Ashburn, VA)
Responsibilities:
- Developed Java based Data conversion applications to migrate data between top popular databases including Greenplum, Netezza, Oracle, DB2, MS SQL and others.
- Automated DDL, DML and data conversion scripts for data extraction, transformation and upload.
- Developed innovative algorithms essentially accelerating database conversion time by effective MPP architecture, high performance collocated joins, optimized partitioning and fast multithread data upload.
- Developed Data Architecture for multi-platform efficient data transformation, retrieval and maintenance.
Data Architect
Confidential (New Carrollton, MD)
Responsibilities:
- Discovered inevitable problems existing in RRP and CADE2 projects developed by IBM.
- Developed innovative ETL and data injection algorithms for "real time" database upload and updates.
- Completely re-factored Rule Execution Analytics code, decreased incomplete execution time from 30+ hours to 12 minutes.
- Delivered code to get 440 CADE2 DB2 FRR reports all automated daily compare to few months of manual labor.
- Developed algorithms and demonstrated 10+ times acceleration on getting Discovery reports.
- Migrated and consolidated IRS EFDS, RRP and CADE2 Oracle and DB2 databases to Greenplum.
- Developed innovative Fraud Detection Analytics Data Mart to detect lawless tax documentation fillings.
- Provided Fraud Detection Analytics Data Mart access to authorized users.
- Implemented algorithms to truck tax violations, check duplicate tax returns, incorrect addresses, emails and phone addresses, investigated other criminal activities.
- Fraud Data Mart was used to truck illegal tax returns and helped IRS returning millions of dollars yearly.
- Updated IRS LRA Tableau scripts to increase few times Fraud Detection report execution compare to initial model.
- Discovered performance problems and provided recommendations to eliminate excessive table decomposition, over blowing data, redundant tables and partitions, inconsistence and duplicated data across multiple IRS sources.
- Developed applications to generate IRS CADE2 DB2 and Greenplum production size data for performance tests.
- Provided solutions for strategic IRS DB2, Oracle and Greenplum EFDS and RRP mainstream database consolidation.
- Generated automated tools to detect referential integrity violations across multiple IRS platforms for CADE2 DB2 databases. Developed unmatched performance execution referential integrity scripts.
- Developed algorithms to secure transfer sensitive data between productions servers.
Data Architect
Confidential (New Carrollton, MD)
Responsibilities:
- Developed modern architecture and applications for multi-phase parallel data extraction and upload significantly speeding up legacy database conversion.
- Fundamentally improved and decreased existing IBM/IRS top complexity 1040 form conversion time for the whole US nationwide taxpayer data from 24 hours to 5 minutes.
- Migrated one of the largest IRS EFDS production database.
- Identified and tuned slowly running SQL conversion scripts. Improved performance for for numerous IRS forms.
- Moved billions of records (about 40 TB) of IRS legacy data to Greenplum environment.
- Fixed numerous Greenplum segment error bugs existing in dysfunctional IBM and IRS conversion code.
- Generated automated scripts to extract, clean up and convert 2000+ EFDS-RRP Oracle tables to Greenplum.
- Implemented algorithms to compare and match tax returns and track potential fraud.
- Created high complicity ETL transformation functions for uploading FDC tables to Greenplum.
- Developed algorithms to prevent sensitive information disclose including personal income, SSN, document location numbers and other private identification data.
- Developed algorithms to predict IRS RRP data capacity trends.
Data Architect
Confidential (Kansas City, KS)
Responsibilities:
- Developed Confidential Netezza C++ UDF (User Defined Function) library.
- Implemented innovative high complicity mathematical non-linear equations algorithms, significantly improved UDF precision and performance.
- Provided solutions for ErlangB and ErlangC UDF functions used for detecting mobile and other electronic device exact Earth locations. Essentially surpassed existing Tableau and Microstrategy market s.
- Managed and developed Confidential MS SQL and Oracle databases to Netezza data migration and consolidation.
- Implemented data migration from TF12 and TF72 to TF120 in a scope of Confidential modernization program.
- Delivered solutions for data migration from MS SQL, Oracle and Netezza TF72 to Netezza TF120 including 10 top volume Confidential databases.
- Completed IBM Streams (SPL) for Big Data multiprocessing execution.
- Investigated solutions for accelerating data upload to Confidential - FLUX and Reveal set of databases.
- Developed algorithms for extrapolating data capacity to calculate required database storage.
- Developed parallel algorithms for loading Netezza databases using IBM Streams.
Principal Software Developer, Data Architect
Confidential (Scranton, PA, Princeton, NJ, Boston, MA)
Responsibilities:
- Worked on multiple projects as Data Architect
- Developed efficient architecture solution to migrate multiple databases including MSSQL, DB2 and Oracle to Netezza and Greenplum.
- Consolidated and loaded Prudential, Bracebridge and Novo Nordisk data warehouses.
- Essentially Increased speed of development by constructing automated process for complete line of operations including creating target database, downloading, transferring and uploading data using advantages of parallel processing.
- Developed efficient algorithms for converting partitions, primary, unique and distribution keys.
- Converted Medicare and Medicaid Population tables from MS SQL and DB2 to Greenplum.
- Developed efficient backup and restore algorithms for Greenplum on different levels including various databases and schemas
Senior Software Engineer
Confidential (Rockville, MD)
Responsibilities:
- Developed algorithms for massively parallel processing databases - Netezza and Greenplum/PostgreSQL to convert and migrate production data warehouses received from NASDAQ, Boston and Philadelphia stock exchanges (volumes about 1 TB a day).
- Created automated algorithms for refactoring legacy application, reduce existing code up to 10 times and significantly increase reliability of ETL transformations.
- Developed efficient ETL algorithms by eliminating multi-million record temporary tables and using aliases with minimized record number data sets.
- Constructed automated algorithms for assembling and converting Data warehouse DDLs.
- Provided data modeling and improvements for NASDAQ data warehouses.
- Converted Oracle cursor based functions to Netezza taking advantage of Massive Parallel Processing in Netezza User Defined Function (UDF) development.
- Implemented high level SQL transformations in UDFs to detect NASDAQ anonymous quotes.
Senior Software Engineer / Data Architect
Confidential (Vienna, VA)
Responsibilities:
- Developed algorithms for generating massive scale EDI messages used by CBP and ATS including all boundary CAMIR, ANSI, BAPLIE and CATAIR types for IBM Message Queue servers.
- Managed EDI automated development and test process.
- Provided M1 Informatica PowerCenter (IPC) development and conversion leadership.
- Substantially improved EDI message development process.
- Developed real time C# and Java database driven applications large EDI message execution.
- Developed automated algorithms to extract and load data from Oracle, SAS and SAP to DB2 and Greenplum for D6, M1 and M2 Data warehouses.
- Migrated Customs Services legacy data center IBM mainframe databases to DB2.
- Detected and improved long-running SQL queries by creating optimal joins, table partitioning and refactoring.
- Developed triggers for optimal performance DB2 database conversion.
- Improved capacity, determined DB2 production environment extrapolated size.
- M1/M21 Ocean/Rail/Air - Executed more than 4000 automated EDI test cases, detected about 250 outstanding defects
- Discovered critical referential integrity problems for the whole set of ACE applications - D5, M1, M21.
- Developed data encryption and decryption algorithms for entire set of ACE projects including D6, M1, M2.1, D7.
- Invented strategic approach for generating Performance stress tests with Production size Oracle 10i and DB2 data bases.
- Developed unprecedented complicity Performance stress tests for ATS S2TP, S2TF, D5, M1 and M21 releases.
- Created mathematical algorithms to detect critical defects in S2TP application by discovering the most inefficient queries.
- Developed Java and C# based applications for automated testing to enhance HP Quality Center and HP LoadRunner functionality.
Data Architect
Confidential (Arlington, VA)
Responsibilities:
- Developed consolidated database to migrate multiple Confidential databases (NADDIS) into one source.
- Investigated referential integrity problems in numerous Confidential production databases and created reports for top management.
- Designed data modeling algorithms for converting multiple CIA legacy databases to Oracle.
- Developed algorithms to encrypt sensitive Confidential data.
- Converted production data to prototype database.
Senior Software Engineer
Confidential (Vienna, VA, Washington, DC)
Responsibilities:
- Invented new design for critical parts Smart 3 application.
- Created CSOSA (Court Services and Offender Supervision Agency) Auto-Screener database.
- Migrated CSOSA production drug offender data.
- Developed high level complicity database driven Auto-Screener Web Application.
Senior Software Engineer
Confidential (Reston, VA)
Responsibilities:
- Developed Web Services for new generation Progress eCommerce system.
- Improved functionality of web based trade system.
Senior Software Engineer
Confidential (Herndon, VA)
Responsibilities:
- Invented critical changes and implemented multiple applications including: In Process Control system, database driven Menu System, Object Editor, Menu Editor, Barcode Configuration Tool.
- Reduced convoluted C source code by 90% from 30 KB to 3 KB Based on innovative object oriented programming approach.
- Developed Study Manager used for drug investigation.
- Designed and implemented Data Base driven Security applications.
- Created multidimensional high level complicity reports based on COGNOS.
- Architected, developed and maintained big database including about 300 Oracle tables.
Software Developer
Confidential (Boston, MA)
Responsibilities:
- Created regression algorithm for fast calculation of special integral kernels.
- The algorithm over performed 2.5 times RGIVN routine from Silicon Graphics IMSL library ( available upon request).
Software Developer
Confidential (Plainsboro, NJ)
Responsibilities:
- Developed algorithms for fast data processing of SQL requests for large Informix database in real time.
Software Development Leader
Confidential
Responsibilities:
- As the principal Software Developer was responsible for development and management database used for real time processing and earthquake recognition.
- Created a database able to process about 0.5 GB a day incoming data with 10 MHz frequency. Used innovative algorithms for data compression and restoration.
Programmer Analyst
Confidential
Responsibilities:
- Created multiple applications in different areas including:
Senior Scientist
Confidential
Responsibilities:
- Innovated world class algorithms for real time digital filtering and high quality image processing outperforming multi dimensional Fast Fourier, McClellan and Kaiser Transformations.
- Developed digital filter software for multiprocessing image recognition systems.
- Created software library LIDA3 on digital filtering and image processing.
- The library was ed multiple diplomas at industrial exhibitions and used in various national projects: space image processing, radio signal treatment, echo noise elimination, high quality band pass digital filters design and implementation.
Scientist
Confidential
Responsibilities:
- Developed Library LIDA2 on data approximation and fast digital filtering.
- Completed PhD Degree in Mathematics and Computer Science.
