Loading TOC...

MarkLogic 10 Product Documentation
xdmp functions (Document Conversion)

Includes the PDF convert functions using the Iceni technology.

The conversion functions are built-in to the server and support the ability to convert various document formats to XML. There are functions to convert HTML, PDF, Microsoft Word, Microsoft Excel, and Microsoft Powerpoint documents. The output of each of these functions is standards-compliant XHTML with cascading style sheets (CSS). Additionally, there are functions to zip and unzip documents, which can be used to support document formats that are zip archives (for example, Microsoft Office 2007 docx format).

Includes the Microsoft Office convert functions using the AntennaHouse technology.

Includes the document filtering functions using the ISYS technology.

11 functions
Function name Description
xdmp:document-filter Filters a wide variety of document formats, extracting metadata and text, and returning XHTML.
xdmp:excel-convert Converts a Microsoft Excel document to XHTML.
xdmp:gunzip Get a node from a gzip node.
xdmp:gzip Create a gzip node from a node.
xdmp:pdf-convert Converts a PDF file to XHTML.
xdmp:powerpoint-convert Converts a Microsoft Powerpoint document to XHTML.
xdmp:tidy Run tidy on the specified html document to convert the document to well-formed and clean XHTML.
xdmp:word-convert Converts a Microsoft Word document to XHTML.
xdmp:zip-create Create a zip file from a list of nodes.
xdmp:zip-get Get a named file from a zip document.
xdmp:zip-manifest Return a manifest for this zip file.