You are here: Data Profiling and Mapping Suite - FAQ

Data Profiling and Mapping Suite : FAQ


The following section addresses Frequently Asked Questions regarding the Global IDs Data Profiling and Mapping Suite.

What is Data Profiling?

Can Data Profiling deal with redundant data?

What is Data Classification?

What is Data Mapping?

Does your product support a Vocabulary /Taxonomy / Ontology project ? Does it support "semantic reconciliation"?

 

What is Data Profiling?

Data Profiling provides the ability to automatically scan the data associated each attribute in the database environment.
The profiling process creates a large number of metrics related to each attributes. Collectively these metrics can create a "semantic understanding" of each attribute, allowing the software to relate it to all other attributes in the data landscape.


Can Data Profiling deal with redundant data?

Yes. 10 different types of profiling abilities built are in the software which do in depth statistical, quality oriented analysis of data and generate quality metrics such as conformity, consistency, accuracy, timeliness etc.


What is Data Classification?

Data Classification provides the ability to automatically identify information domains and group them together on the basis of similiarity.Data Classification provides the ability to automatically identify information domains and group them together on the basis of similiarity.


What is Data Mapping?

Data mapping is the process of creating data element mappings between two distinct data models. Data mapping is used as a first step for a wide variety of data integration tasks including -

  • Data transformation between a data source and a destination
  • Identification of data relationships as part of data lineage analysis
  • Consolidation of multiple databases into a single database and identifying redundant columns of data for consolidation or elimination

Does your product support a Vocabulary /Taxonomy / Ontology project?

Yes and No. We support Taxonomic representations of data through our Hierarchy Manager module. We do not support Ontological representations of data, and have very limited support for lexical representations. The semantic mapping and reconciliation is one of the core deliverables of our software, and is supported.


Data Discovery Features

Data Discovery provides the ability to automatically scan and identify information assets within an organization. The DPS Product Suite offers the following list of features to aid the Data Discovery Process

# Requirement Available Comments
1 Create a Metadata Repository without significant manual involvement Yes
2 Scan all structured databases of interest Yes See database types supported
3 Scan all unstructured data of interest Yes See content formats supported
4 Scan all semi-structured data No In development (beta). See supported formats
5 Provide web access to metadata repository Yes
6 Profile the structured data Yes 10 types of profiling is supported. See details
7 Automatically detect data quality problems in structured data Yes
8 Create a data dictionary and glossary for critical business data Yes Requires some degree of manual input
9 Continuously monitor the metadata environment for changes Yes
10 Create metadata reports and profile reports on demand Yes

Based on customer requests, the Global IDs Product Roadmap also includes some new features that have been listed below
  • It can accomplish the above tasks, with a minimum of manual involvement
  • It can scale to extremely complex environments (enterprise or global levels)
  • It can meet these requirements in systematic and repeatable ways