Understanding the Text Index Settings
The following table describes the different types of indexes available. The indexes are not mutually independent. If both the word search and stemmed search indexes are disabled, the configuration of the remaining indexes is irrelevant, as they all depend on the existence of the word and/or stemmed-search index.
Index |
Default Setting |
Description |
---|---|---|
language |
en |
Specifies the default language for content in this database. Any content without an |
stemmed searches |
Off (index is not built) |
Controls whether searches return relevance ranked results by matching word stems. A word stem is the part of a word that is common to all of its inflected variants. For example, in English, "run" is the stem of "run", "runs", "ran", and "running". A stemmed search returns more matching results than the exact words specified in the query. A stemmed search for a word finds the same terms as an unstemmed search, plus terms that derive from the same meaning and part of speech as the search term. For example, a stemmed search for There are three types of stemming: basic (one stem per word), advanced (one or more stems per word), and decompounding (advanced plus smaller component words of large compound words). Without either this index or the word searches index, MarkLogic Server is unable to perform relevance ranking and will refuse to execute any If both the stemmed search and word search indexes are enabled, MarkLogic Server defaults to performing stemmed searches (unless an unstemmed search is explicitly specified). Turn this index off if you want to disable stemmed searches. If word and stemmed search indexes are both off, then full-text searches are effectively disabled. |
|
On (index is built) |
Enables MarkLogic Server to return relevance ranked results which match exact words in text elements. Either this index or the stemmed search index is needed for MarkLogic Server to execute any For many applications, keeping this word search index off and the stemmed search index on is sufficient to return the desired results for queries. Turn this index on if you want to do exact word-only matches. If word and stemmed search indexes are both off, then full-text searches are effectively disabled. |
|
Off (index is not built) |
Speeds up the performance of proximity queries that use the Turn this index off if you are not interested in proximity queries or phrase searches and if you want to conserve disk space and decrease loading time. If you turn this option on, you might find that you no longer need |
|
On (index is built) |
Accelerates phrase searches by building additional indexes that describe sequences of words at load (or reindex) time. Without this index, MarkLogic Server will still perform phrase searches, just more slowly. Turn this index off if only a small percentage of your queries will contain phrase searches, and if conserving disk space and enhancing load speed is more important than the performance of those queries. |
|
On (index is built) |
Accelerates case sensitive searches by building both case sensitive and case insensitive indexes at load time. Without this index, MarkLogic Server will still perform case sensitive searches, just more slowly. Turn this index off if only a small percentage of your text searches will be case sensitive, and if conserving disk space and enhancing load speed is more important than the performance of those queries. |
|
Off (index is not built) |
Speeds up reverse query searches by indexing stored queries. Turn this option on to speed up searches that use |
|
On (index is built) |
Speeds up diacritic-sensitive searches by eliminating some false positive results. Turn this option off if you do not want to do diacritic-sensitive searches. |
|
On (index is built) |
Accelerates searches that look for words in specific elements by building additional indexes at load time. Without this index, MarkLogic Server will still perform these searches, just more slowly. Turn this index off if only a small percentage of your queries rely on finding words within specific document elements, and if conserving disk space and enhancing load speed is more important than the performance of those queries. |
|
Off (index is not built) |
Speeds up the performance of proximity queries that use the Turn this index off if you are not interested in proximity queries and if you want to conserve disk space and decrease loading time. |
|
On (index is built) |
Accelerates phrase searches on elements by building additional indexes that describe sequences of words in elements at load (or reindex) time. Without this index, MarkLogic Server will still perform phrase searches, just more slowly. Turn this index off if only a small percentage of your queries will contain phrase searches at the element level, and if conserving disk space and enhancing load speed is more important than the performance of those queries. |
|
Off (index is not built) |
Speeds up the performance of proximity queries that use the Turn this index off if you are not interested in proximity queries and if you want to conserve disk space and decrease loading time. |
|
Off (index is not built) |
Speeds up the performance of proximity queries that use the Turn this index off if you are not interested in proximity queries and if you want to conserve disk space and decrease loading time. |
|
Off (index is not built) |
Speeds up the performance of field value searches that use the Turn this index off if you are not interested in field value queries and if you want to conserve disk space and decrease loading time. |
|
Off (index is not built) |
Speeds up the performance of proximity queries that use the Turn this index off if you are not interested in proximity queries and if you want to conserve disk space and decrease loading time. |
|
Off (index is not built) |
Speeds up wildcard searches where the search pattern contains the wildcard character at the end (for example, |
|
Off (index is not built) |
Speeds up the performance proximity queries that use trailing-wildcard word searches, such as wildcard queries that use the Turn this index on if you are using trailing wildcard searches and proximity queries together in the same search. |
|
Off (index is not built) |
Faster wildcard searches with the wildcard at the end of the search pattern within a specific element, but slower document loads and larger database files. |
|
Off (index is not built) |
Speeds up wildcard searches where the search pattern contains three or more consecutive non-wildcard characters (for example, When character indexing is turned on, performance is also improved for Turn this index on if you want to enable wildcard searches that match three or more characters. If you need wildcard searches to match only two or one characters, then you should enable two character searches and/or one character searches. |
|
Off (index is not built) |
Speeds up the performance of proximity queries that use three-character word searches, such as queries that use the Turn this index on if you are using wildcard searches and proximity queries together in the same search. |
|
Off (index is not built) |
Enables wildcard searches where the search pattern contains two or more consecutive non-wildcard characters. For details on wildcard characters, see Understanding and Using Wildcard Searches in the Application Developer’s Guide. When character indexing is turned on in the database, the system also delivers higher performance for Turn this index on to speed up wildcard searches that match two or more characters (for example, |
|
Off (index is not built) |
Speeds up wildcard searches where the search pattern contains only a single non-wildcard character. For details on wildcard characters, see Understanding and Using Wildcard Searches in the Application Developer’s Guide. When character indexing is turned on in the database, the system also delivers higher performance for Turn this index on if you want to enable wildcard searches that match one or more characters (for example, |
|
Off (index is not built) |
Turn this index on to improve performance of wildcard searches that query specific XML elements or JSON properties. Also, speeds up element-based wildcard searches. Turn this index on to improve performance of wildcard searches that query specific elements. For details on wildcard characters, see Understanding and Using Wildcard Searches in the Application Developer’s Guide. |
|
Off (index is not built) |
Maintains a lexicon of all of the words in a database, with uniqueness determined by a specified collation. For details on lexicons, see Range Indexes and Lexicons and the Application Developer’s Guide. For details on collations, see Language Support in MarkLogic Server in the Search Developer’s Guide. Speeds up wildcard searches. Works in combination with any other available wildcard indexes to improve search index resolution and performance. When used in conjunction with the |
|
On (index is built) |
Maintains a lexicon of all of the URIs used in a database. The URI lexicon speeds up queries that constrain on URIs. It is like a range index of all of the URIs in the database. To access values from the URI lexicon, use the |
|
On (index is built) |
Maintains a lexicon of all of the collection URIs used in a database. The collection lexicon speeds up queries that constrain on collections. It is like a range index of all of the collection URIs in the database. To access values from the collection lexicon, use the |