Ingest and Curate Your Data

Overview

You can ingest and curate your data on MarkLogic Data Hub Service (DHS) using Hub Central or on-premises/locally. If you ingest and curate your data on DHS using Hub Central, you can ingest, curate, and explore data directly in the cloud. If you ingest and curate your data on-premises/locally, you can ingest and curate data in your own on-premises servers and then deploy your Data Hub project in the cloud.

Important: Before you ingest and curate your data, you must first complete all tasks at Getting Started.

Ingest and Curate Your Data on DHS Using Hub Central

Hub Central enables you to ingest raw data into your service and then use Hub Central to curate and explore your data directly in the cloud. To learn more about Hub Central, see About Hub Central.

Hub Central is accessed using the Hub Central endpoint for your service. For details, see Endpoints and Port Numbers.

Note: MarkLogic recommends checking your Hub Central work into a version control system. To learn how to do this, see Download Your Project Files Using Hub Central.

(Optional) Advanced Data Hub Development

To do more advanced development, download Hub Central files from Hub Central and work in your on-premises/local installation of Data Hub. You can then deploy your work back to Hub Central in your service.

Follow these instructions to perform more advanced development on-premises/locally:

  1. Download Your Hub Central Files Using Hub Central.
  2. Develop your Data Hub project on-premises/locally.
  3. Deploy to Data Hub Service.

Ingest and Curate Your Data On-Premises/Locally

You can ingest and curate your data on-premises/locally. After ingesting and curating your raw data on-premises/locally, you can then deploy your data to Data Hub Service. To learn more about on-premises development options, see On-Premises Tools.

Tip: After deploying your on-premises/local Data Hub project to DHS, you can then curate and explore your data on DHS using Hub Central.