Managing and Running Steps and Flows

Overview

A typical Data Hub data flow involves the following operations:
  1. Load/Ingest your raw data into MarkLogic Server.
  2. Create an entity model to standardize your data fields.
  3. Map the fields in your raw data to the fields of the entity model.
  4. (Optional) Match and merge duplicates.

Security

You must be assigned the following security roles:

  • To add a step to a flow: Hub Central Developer or Hub Central Curator
  • To run a step: Hub Central Operator or Hub Central Curator

Or any role that inherits the required role. See Users and Roles.

Managing Flows and Running Steps

Changes are made to the database when you run steps in a flow. After creating and configuring your steps, you must add them to a flow and run them.

  • You can add steps to a new or existing flow.
  • You can add the steps to a flow in the same area where you created the step (Load or Curate) or in the Run area.

To create a flow and add steps to a flow, see Manage Steps and Flows Using Hub Central.

To run steps in a flow, see Run Steps in a Flow Using Hub Central.