Mapping

Overview of mapping in Data Hub.

About Model-to-Model Mapping

A data model defines how data is structured. Each field in a dataset corresponds to a property in the data model, even if the data model is not explicitly defined.

Your source datasets might have data models that are different from each other. For example, one dataset might have a field called family-name and another might call the same field surname.

Model-to-model mapping is the process of associating the fields (properties) in your source dataset (data model) with properties in a standardized data model to make it easier to access the data regardless of its source.

For example, you can create the property lastname in your standardized data model. Any request for the value of lastname would return the correct value whether the source field is called family-name or surname.

Mapping in MarkLogic Data Hub

In Data Hub, the standardized model is an entity model. The entity model is a canonical representation of your entity or business object.

To configure a mapping, you need:

  • An entity model which you create.
  • At least one record which you ingest from your source dataset.
Note: If you have multiple source datasets with different data models, you must create one mapping for each source data model against the same entity model.

The easiest way to configure a mapping is through QuickStart. QuickStart chooses an arbitrary ingested record from the STAGING database to determine the source fields that can be mapped against the entity model properties. You can select another ingested record to use to generate the list of source fields you can choose from.