Deploy to Data Hub Service
Data Hub Service
You can deploy your Data Hub project in the cloud instead of setting up your own. The Data Hub Service (DHS) is a cloud-based solution that provides a preconfigured MarkLogic cluster in which you can run flows and from which you can serve harmonized data.
Use MarkLogic Data Hub to develop and test your project locally (your development environment) then deploy it to a DHS cluster (your production environment).
In a DHS environment, the databases, app servers, and security roles are automatically set up. Admins can create user accounts.
To learn more about Data Hub Service (DHS), see Data Hub Service and the DHS documentation.
The following configurations might be different between Data Hub projects and DHS projects:
- Roles — The DHS roles are automatically created as part of provisioning your DHS environment.
Data Hub DHS data-hub-admin endpointDeveloper endpointUser flow-developer flowDeveloper flow-operator flowOperator See also: Data Hub Service Instance Security Roles
- Database names — If database names are customized in the Data Hub environment, they might be different.
- Gradle settings — The
gradle.properties
file contains some DHS-only settings, includingmlIsHostLoadBalancer
andmlIsProvisionedEnvironment
, which are set totrue
to enable Data Hub to work correctly in DHS.
The following configurations are the same between Data Hub projects and DHS projects:
- Ports and load balancers for app servers
app servers ports DHS load balancers staging 8010 curation final 8011 operations jobs 8013 analytics Important: Use port 8004 to deploy the Data Hub Framework core only. To deploy custom plugins (REST extensions, search options, etc.) against the FINAL database, use port 8011.
- the requests from the outside world to MarkLogic
- the results from MarkLogic to the requester
If your endpoints are publicly available, you can use any machine that is set up as a peer of the MarkLogic VPC. See Create the Peer Role - AWS.
Before you begin
- A Data Hub project that has been set up and tested locally
- A provisioned MarkLogic Data Hub Service environment
- For private endpoints only: A bastion host inside a virtual private cloud (VPC)
- Information from your DHS administrator:
- Your DHS host name (typically, the curation endpoint)
- REST curation endpoint URL (including port number) for testing
- The username and password of the user account associated with each of the following roles. (See Creating a DHS Instance User Account.)
endpointDeveloper
endpointUser
flowDeveloper
flowOperator
Procedure
What to do next
If you update your flows after the initial project upload, you can redeploy your flow updates by running gradle mlLoadModules
again and then running the flows.