The table below lists all the
hadoop built-in
functions (in this namespace:
http://marklogic.com/hadoop
).
This module provides helper functions for creating input and split queries for the MarkLogic Content Pump. For details, see the mlcp User Guide.
The Hadoop function module is installed as the following file:
install_dir/Modules/MarkLogic/hadoop.xqy
where install_dir
is the directory in which
MarkLogic Server is installed.
To use the hadoop.xqy
module in your own XQuery modules,
include the following line in your XQuery prolog:
import module namespace hadoop="http://marklogic.com/xdmp/hadoop" at "/MarkLogic/hadoop.xqy";
Function name | Description |
---|---|
hadoop:get-splits | This function returns (forest_id, record_count, host_name) tuples where forest_id and host_name identify the target forest of the split input, and record_count is a rough estimate of the number of input key-value pairs in the split. |