Loading TOC...

MarkLogic 12 EA 2 Product Documentation
xdmp functions (Document Conversion)

The conversion functions are built-in to the server and support the ability to convert various document formats to XML. There are functions to convert HTML, PDF, Microsoft Word, Microsoft Excel, and Microsoft Powerpoint documents. The output of each of these functions is standards-compliant XHTML with cascading style sheets (CSS). Additionally, there are functions to zip and unzip documents, which can be used to support document formats that are zip archives (for example, Microsoft Office 2007 docx format).

Includes the document filtering functions using the ISYS technology.

Includes the Microsoft Office convert functions using the AntennaHouse technology.

Includes the PDF convert functions using the Iceni technology.

11 functions
Function name Description
xdmp.documentFilter Filters a wide variety of document formats, extracting metadata and text, and returning XHTML.
xdmp.excelConvert Converts a Microsoft Excel document to XHTML.
xdmp.gunzip Get a node from a gzip node.
xdmp.gzip Create a gzip node from a node.
xdmp.pdfConvert Converts a PDF file to XHTML.
xdmp.powerpointConvert Converts a Microsoft Powerpoint document to XHTML.
xdmp.tidy Run tidy on the specified html document to convert the document to well-formed and clean XHTML.
xdmp.wordConvert Converts a Microsoft Word document to XHTML.
xdmp.zipCreate Create a zip file from a list of nodes.
xdmp.zipGet Get a named file from a zip document.
xdmp.zipManifest Return a manifest for this zip file.