Modeling with Hub Central
Overview
- Load/Ingest your raw data into MarkLogic Server.
- Create an entity model to standardize your data fields.
- Map the fields in your raw data to the fields of the entity model.
- (Optional) Match and merge duplicates.
The entity model is a core component of data integration in MarkLogic Data Hub. It defines the standard structures (entity types) that would be populated with values from your raw data, so that your data components can be accessed uniformly regardless of the format and structure of the source.
The entity type is comprised of entity properties, which can be of any of the following types:
- A basic data type, including integer, string, dateTime, boolean, and other less common string, number, and date types (under More string types, More number types, and More date types).
- A structured type, which is comprised of its own properties, which can also be of other structured types. Use a structured type if the values of its properties will change for every entity instance. For example, the property
FullName
of the entityEmployee
could be a structured type, because each employee would have a different full name.The depth of nested structured types is not limited.
- A relationship type, which links to an entity of the selected type. Use a relationship type if the properties of the target entity must be the same for all entities that point to it. For example, the entity
Order
could point to the entityProduct
as a relationship type, because the SKU and product description will be the same for all customer orders.
Learn more: Entities
Security
You must be assigned the following security roles:
- To view, create, edit, or delete an entity model: Hub Central Modeler
Or any role that inherits the required role. See Users and Roles.
Modeling Process
To integrate your data,
- To create an entity type, see Create an Entity Type.
- Use the entity type to curate your data.
Managing Entity Types
To edit an entity type, see Edit Entity Type.
To manage the properties of an entity type, see Manage Entity Properties.
To publish changes,
- Go to the Model area of Hub Central.
- To publish all your changes to all entity types, click .
To delete an entity type,
- Go to the Model area of Hub Central.
- In the list, click the Delete icon () for the entity type to delete.
If an entity type has already been used to map or match and merge data, then modifying or deleting it might trigger a reindexing of all your curated data and might affect the results of mapping and mastering processes that are occurring concurrently.
MarkLogic recommends scheduling a time to modify or delete used entity types when the impact would be minimal.