Skip to main content

What's New in MarkLogic 11

Change to Classification of Some Special Symbol Tokens

The classification of some symbols changed for purposes of tokenization as of MarkLogic 9, including the symbols in the following table. These changes can affect search results.

Symbols

Old Classification

New Classification

spacing accents(5E, 60, A8, AF, B4, B8)

punctuation

diacritic

copyright (A9)registered (AE)degree (B0)

punctuation

symbol

Spanish masculine/feminine ordinals(AA, BA)

punctuation

diacritic

superscript numbers(B2, B3, B9)

punctuation

diacritic

micro (B5)

punctuation

greek

fractions (BC, BD, BE)

punctuation

symbol