We provide IT Staff Augmentation Services!

Data Analytics & Machine Learning Solutions Architect Resume

3.00/5 (Submit Your Rating)

New, JerseY

SUMMARY:

  • 30 years' experience in successfully delivering clear and simple solutions to very complex data management challenges as Confidential consultant leading fixed length data management engagements typically staffed by Confidential mix of on - shore and off-shore resources.
  • Focused on delivering executives answers to very complex business questions using Confidential mix of analytics, data science, machine learning and natural language processing (NLP) of huge amounts of unstructured data collected from many sources.
  • Data Science Machine Learning Python Anaconda Computational Statistics Enterprise Information Architecture
  • AWS, Azure & Google Compute Cloud Servers ClearNLP NLTK Ontology and RDF Design (OWL, Turtle, N-Triple, etc.)
  • Data Vault Architecture Master Data Management / CDM R Studio Neo4j Graph Databases Tableau TOGAF
  • Conceived of and authored Chameleon Metadata & Data Quality System

EXPERIENCE:

Confidential, New Jersey

Data Analytics & Machine Learning Solutions Architect

Responsibilities:

  • Educate practice leads on enterprise and data vault ETL modeling, master data management; big data; data governance and metadata management; data science; and semantic data.
  • Designed and deployed data science infrastructure which ingested, parsed, classified and indexed unstructured content from public government records using Python, BeautifulSoup, Natural Language Tool Kit (NLTK), Neo4j, MySQL and ElasticSearch. Master Data Management done using OpenRefine (formerly Google Refine)

Confidential, New York

Data Analytics & Machine Learning Solutions Architect

Responsibilities:

  • Designed and deployed Confidential new Data Science infrastructure and led an India-based development team. The new system successfully delivers Natural Language Processing (NLP) capabilities for ingesting, parsing, classifying and indexing unstructured content from legal documents.
  • Ingestion of the ‘raw’ documents uses Confidential Data Vault Hub, Spoke, Link approach and is stored in Cassandra. k-Nearest Neighbor (kNN) and Support Vector Machine classifications done using Python Anaconda’s Natural Language Toolkit (NLTK). Tableau, with R Studio, were used for computational statistics & visualizations.
  • Relationship visualizations done using Neo4j. Metadata management, data stewardship and automatic generation of Neo4j Cypher statements was done using the Chameleon Metadata approach.

Confidential, Massachusetts

Data Science & Master Data Management Solutions Architect

Responsibilities:

  • This project successfully delivered, in just five months, an open-source alternative to their planned deployment of an $8M Oracle MDM Suite not including maintenance. Total software cost for my solution was $2,350.
  • Google Refine used for data clustering and cleanup. MonkeyLearn API’s used for legal entity extraction, location extraction, content sentiment analysis. Confidential Data Vault ETL staging area ensures complete auditability.
  • Non-technical users are now able to explore enterprise data with Confidential polyglot persistence approach using Neo4j, and MySQL as its graph (DAG) and relational databases. Unstructured data was stored using Cassandra and Hadoop for “small record & rapid arrival” and “large record & slow arrival” data arrival, respectively.

Confidential

Lead Integration Architect

Responsibilities:

  • Design the information integration architecture for their John Hancock subsidiary as Phase-I of their initial Master Data Management effort for Party data for several lines of business on IBM’s MDM Server (MDMS) V11.
  • Managed Confidential Malaysia-based ETL team of ManuLife employees developing Informatica PowerCenter Workflows.
  • Designed and modeled Confidential ‘Heavy Onboarding Footprint’ ETL ecosystem using Confidential Data Vault data model for aligning incoming data to the IBM/MDMS RDF definitions. The graph database nodes, relationships, properties and OWL ontologies were documented using IHMC CMAP Knowledge Modeling Kit.
  • Another ‘post-MDM RDF’ was used to populate Hadoop 1.2.1, graph databases (Neo4j) and NoSQL stores.

Confidential, New Jersey

Analytics Specialist

Responsibilities:

  • Provide design enhancement recommendations for an analytics environment staging via Data Vault stores (i.e. Hubs, Links and Satellites) atop Terradata.

Confidential

Metadata Specialist

Responsibilities:

  • Performed Confidential data quality and analytics assessment focused on: Financial and Compliance Services; Risk Management and Compliance; Product Development; and Global Account Management.
  • Technical landscape included SAP (ECC, SD, CRM, BP, BI BOBJ and BW), MicroStrategy reporting and Microsoft TFS for service design and delivery management

Confidential, New York

Hadoop Big Data Architect

Responsibilities:

  • Captured existing business processes, lineage and metadata as ‘RDF Triples’ (i.e. Subject Predicate Object) for their new Hadoop 0.23.1, HDFS, HIVE and HBASE data stores.
  • Designed Object-Oriented RDF using CmapTools 5.0.03 to segment incoming source data and UDEF-based RDF Schema aligned to W3C XSD 1.1 Part 2 datatypes for organizing captured domains and ranges and linking to source system URI’s.

Confidential, New York & Massachusetts

Enterprise Information Management and Governance Design

Responsibilities:

  • Guided Confidential subject matter experts through definition and documenting of business processes and required execution sequences using Directed Acyclic Graphs as RDF Triples to represent the lifecycles of Confidential information-based products from data vendor to ultimate consumer and its compliance with source data vendor contract agreements.
  • Designed the Exchange-to-Hadoop Business, Information and Technical architectures for any data lifecycle.
  • Standardized the valid Linked Data value pairs and their relationship(s) to URI’s and business processes via Object-Relational Mappings associating them to available Confidential GS1 Global Product Classification ( Confidential ) Business Object Documents ( Confidential ’s). And, where possible, linked ISO 10383 identifiers for source data vendors.
  • Created Confidential Corporate Product Information Ontology using the Florida Institute for Machine & Human Cognition (IMHC) CmapTools Knowledge Modeling Kit.
  • Created Confidential -based workflow management, role-based product entitlements and task-level audit metrics to capture data product information management (PIM) knowledge.

Confidential, Connecticut, United States

Management Consulting / Enterprise Architect

Responsibilities:

  • Assessed the Confidential Research business and information architectures in anticipation the rapid expansion expected as Confidential result of Confidential growth strategy relying on both organic and Confidential & Confidential strategies.
  • Detailed assessment of and suggested improvements to their proposed business & information architectures

Confidential, New York

Management Consultant

Responsibilities:

  • Delivered new multi-channel marketing (MCM) capabilities which allowed Confidential to capture multiple customer interactions from multiple channels and understand when, how and to whom they were related.
  • Designed Confidential framework which aggregated master, analytics and real-time data (via Informatica Confidential ) atop MEGA Database Builder and ITSM Accelerator knowledge-sharing environment.
  • Confidential Business Process and Information Architecture which Confidential ’s subject matter experts agreed was perfect

Confidential, New Jersey

Management Consulting / Enterprise Architect

Responsibilities:

  • Deployed Confidential Proof-of-Concept IBM MDM Server hub for PARTY assuming PATIENT & PHYSICIAN roles.
  • Designed the first strategic roadmap for this newly created MDM center-of-excellence (COE) organization as well as the logical, physical, canonical, information flow and process models.
  • Designed and deployed proof-of-concept data governance and metadata portal allowing the design, deployment and reuse of automated business rules engine integration (PegaRULES, Corticon, etc.).

Confidential, Ohio

Master Data Architect

Responsibilities:

  • Designed reusable-process strategy to increase predictability and efficiency of the Production (Conceive/Design/Produce/Deploy/Service) and Customer-interaction (Campaign/Order/Cash/Care) lifecycles.
  • Designed conceptual and logical canonical product information master (PIM) models.
  • Designed business process architecture using Oracle’s AIA PIM Hub and the Siperian UCM tool and roadmap for migrating canonical PARTY/CUSTOMER from legacy Oracle Trading Community Architecture (TCA) to Oracle AIA.

Confidential, New Jersey

Programme Lead / Chief Solutions Architect

Responsibilities:

  • Deployed proof-of-concept MDM systems (profiling: Trillium, metadata: CA Repository, ETL: IBM DataStage)
  • Led senior business and IT executives through the identification, consensus building and project planning phases of this MDM initiative under standards of the Capability Maturity Model (CMM).

We'd love your feedback!