Data Recognition Functional Layer

Organize data assets in enterprise data ecosystem semantically.

The product can improve the quality of data by evaluating the data the enterprise possesses. The product organizes the data assets on the basis of its meaning (i.e. its semantic type). To do this, it must ‘recognize’ the column, and assign it to a business concept that is being represented in the data landscape. So the software automatically creates a “bucket” for each unique concept category – or semantic domain, and places each column that it recognizes into the appropriate semantic domain.

The software is able to recognize a large number of domains, on the basis of rules that are embedded inside the software. These domains fall into 4 major categories.

  • Global domains like names, postal codes, currency codes, country codes
  • Industry-specific business domains like ISBN, CUSIP, GTIN, CLLI
  • Organization-specific business domains like customer_ids, product_types, vendor_categories
  • Use-case-specific domains like PII and PCI domains for Security/Privacy use cases, PII domains for HIPAA Regulatory compliance etc.

 

Modules:

Modules Description
Taxonomy Manager Creates domains and sub-domains, classifies them and finds lineage
Domain Classifier Domain profiles columns and organizes data under domains
Code Table Taxonomy Manager Identifies and classifies code based data domains
Subtype Taxonomy Manager Identifies and classifies subtype domains
Subtable Domain Classifier Profile domains from subtables

 

Previous Layer

Next Layer