MarkLogic 12 EA 1 Product Documentation
xdmp:document-loadxdmp:document-load(
$location as xs:string,
[$options as (element()|map:map)?]
) as empty-sequence()
Summary
Inserts a new document with the specified URI. If a document already exists
at the URI, the function replaces the content in the existing document as
an update operation.
Parameters |
location |
The location of the input document. If the scheme of the location is
HTTP (that is, if the string starts with "http://"), then the document is
requested over HTTP. If the scheme is file (that is, if the string starts
with "file://"), then the document is requested over file protocol from
the local filesystem.
Otherwise, the document is fetched from the local
filesystem. On the filesystem, the path can be fully qualified or relative.
Relative pathnames are resolved from the directory in which
MarkLogic Server is installed.
|
options |
Options with which to customize this operation. You
can specify options as either an options XML element
in the "xdmp:load" namespace, or as a map:map . The
options names below are XML element localnames. When using a map,
replace the hyphens with camel casing. For example, "an-option"
becomes "anOption" when used as a map:map key.
This function supports the following options, plus the options from the
xdmp:http-get
function.
- uri
- The URI of the document to be loaded. If omitted, then the location
is used for the URI.
- permissions
- Security permission corresponding to the permissions for the
document. If not supplied, the current user's default permissions are
applied. The default value used for $permissions can be obtained by
calling
xdmp:default-permissions()
. A document that
is created by a non-admin user (that is, by any user who does not have the
admin role) must have at least one update permission,
otherwise the creation will throw an XDMP-MUSTHAVEUPDATE
exception. When expressing options using a map:map,
use the "object" format for permissions; see the output-kind
parameter of xdmp:permission and
xdmp:default-permissions .
- collections
- The collection URIs for collections to which this document belongs.
If not supplied, the document is added to the current user's default
collections (the collections returned from
xdmp:default-collections()
). For each
collection that is protected, the user must have permissions to update
that collection or have the any-collection privilege. For
each unprotected collection, the user must have the
unprotected-collections privilege. The
<collections> element consists of one or more
<collection> child elements. For example:
<collections>
<collection>myCollection1</collection>
<collection>myCollection2</collection>
</collections>
- quality
- The quality of this document. A positive value increases the
relevance score of the document in text search functions. The converse is
true for a negative value. The default value is 0.
- default-namespace
- (XML only) The namespace to use if there is no namespace at the root
node of the document. The default value is "".
- repair
- A value of
full specifies that malformed XML
content be repaired. A value of none specifies that
malformed XML content is rejected.
If no repair option is explicitly specified, the
default is implicitly specified by the XQuery version of the caller.
In XQuery 1.0 and 1.0-ml the default
is none . In XQuery 0.9-ml the
default is full .
This option has no effect on binary, text or JSON documents.
- format
- A value of
text specifies to get the document as a text
document, regardless of the URI specified. A value of
binary specifies to get the document as a binary
document, regardless of the URI specified. A value of xml
specifies to get the document as an XML document, regardless of the
URI specified. A value of json
specifies to get the document as a JSON document, regardless of the
URI specified.
- default-language
- The language to specify in an
xml:lang attribute on the
root element node if the root element node does not already have an
xml:lang attribute. This option applies only to XML documents.
If this option is not specified, then nothing is added to the root element
node.
- encoding
- Specifies the encoding to use when reading the document into MarkLogic
Server. The value must either be "auto" or match an encoding name
according to the Unicode Charset Alias Matching rules
(http://www.unicode.org/reports/tr22/#Charset_Alias_Matching).
When the value is "auto", MarkLogic guesses the encoding from
the document content. For a list of character set encodings by language, see
Collations and Character Sets By Language in the Search Developer's Guide.
If you do not set this option, MarkLogic uses the encoding
specified in the HTTP headers, if present. If you do not set this option
and no encoding is available from HTTP headers, the encoding
defaults to UTF-8. For more details, see
Character Encoding in the Search Developer's Guide.
- forests
- Specifies the ID of the forest in which this document is inserted.
Each forest ID is in a <forest> child element and
is of type
xs:unsignedLong .
. If the document
already exists in the database, it will remain in its existing forest. If
no such forest exists or if no such forest is attached to the context
database, an error is raised. If multiple forests
are specified, the document is inserted into one of the specified
forests. If the document already exists and the forest in which it is
stored is set to delete-only, then you must specify the forest IDs to
include one or more forests that allow updates, otherwise an exception is
thrown.
If you have local disk failover enabled, specify the ID of the master
forest. In the event of a failover, MarkLogic server will automatically
redirect documents to the replica forest. Specify the ID of the replica
forest will result in a "forest not in database" error.
- metadata
- Specifies key-value pairs representing user-defined metadata associated
with the document. Metadata values are strings. Non-string values are
converted to strings. When you express the options as
an XML element, the value of the
metadata option element is
a serialized map:map. When you express the options as a map:map,
the associated with the "metadata" option key is itself a map:map.
|
Required Privileges
http://marklogic.com/xdmp/privileges/xdmp-document-load
If a new document is inserted, you also need the
unprotected-uri
privilege (only if the URI is not protected),
the any-uri
privilege, or an appropriate URI privilege.
If adding an unprotected collection to a document, the
unprotected-collections
privilege
(http://marklogic.com/xdmp/privileges/unprotected-collections
)
is needed; if adding a protected collection, the user must have either
permissions to update the collection or the
any-collection
privilege
(http://marklogic.com/xdmp/privileges/any-collection
).
Usage Notes
When selecting documents over HTTP (where the $location
parameter begins with http://
), the response from the webserver
is loaded into the database, regardless of what the headers returned
from the webserver indicate. For example, if the webserver returns a
404 (file not found), then the response page that says "file not found"
is loaded into the database. If you want to examine the headers before
loading the document, use xdmp:http-get
(combined with
xdmp:document-insert
) instead, as
xdmp:http-get
allows you to examine the headers returned from the HTTP server.
Example
(: Load a document from the file system, using options expressed
: as an XML element. :)
xdmp:document-load("c:\myFile.xml",
<options xmlns="xdmp:document-load">
<uri>/documents/myFile.xml</uri>
<repair>none</repair>
<permissions>{xdmp:default-permissions()}</permissions>
<metadata>{
map:map() => map:with("h", "hello")
=> map:with("w", "world")
}</metadata>
</options>)
(: Loads the document with a URI "/documents/myFile.xml"
: and does not perform tag repair during the load. :)
Example
(: Load a document from the file system, using options expressed
: as a map:map. :)
xdmp:document-load("c:\myFile.xml",
map:map() => map:with("uri", "/documents/myFile.xml")
=> map:with("repair", "none")
=> map:with("metadata", map:map() => map:with("key", "value"))
)
(: Loads the document with a URI "/documents/myFile.xml"
: and does not perform tag repair during the load. :)
Example
xdmp:document-load("http://myCompany.com/file.xml",
<options xmlns="xdmp:document-load"
xmlns:http="xdmp:http">
<uri>/documents/myFile.xml</uri>
<repair>none</repair>
<permissions>{xdmp:default-permissions()}</permissions>
<format>xml</format>
<http:authentication>
<http:username>user</http:username>
<http:password>pass</http:password>
</http:authentication>
</options>)
(: Loads the document with a URI "/documents/myFile.xml"
: from the server http://myCompany.com, sending the
: credentials user/pass. Tag repair is not performed
: during the load, the document is loaded as xml with
: metadata key-value pairs of 'h:hello' and 'w:world'. :)
Example
(: Using a map to expression options, rather than an XML element. :)
xdmp:document-load("c:\myFile.xml",
map:map() => map:with("uri", "/documents/myFiles.xml")
=> map:with("permissions",
(xdmp:default-permissions("objects"),
xdmp:permission("some-role", "read", "object")))
=> map:with("collections", ("myCollection1", "myCollection2"))
=> map:with("repair", "full")
=> map:with("forests", (xdmp:forest("myForest")))
)
(: Loads the document with a URI "/documents/myFile.xml"
: performing tag repair during the load, adding the
: document to the "myCollection1" and "myCollection2"
: collections, and loading the document into the forest
: named "myForest". :)
Copyright © 2024 MarkLogic Corporation. MARKLOGIC is a
registered trademark of MarkLogic Corporation.