Change to Classification of Some Special Symbol Tokens
The classification of some symbols changed for purposes of tokenization as of MarkLogic 9, including the symbols in the following table. These changes can affect search results.
Symbols |
Old Classification |
New Classification |
---|---|---|
spacing accents(5E, 60, A8, AF, B4, B8) |
punctuation |
diacritic |
copyright (A9)registered (AE)degree (B0) |
punctuation |
symbol |
Spanish masculine/feminine ordinals(AA, BA) |
punctuation |
diacritic |
superscript numbers(B2, B3, B9) |
punctuation |
diacritic |
micro (B5) |
punctuation |
greek |
fractions (BC, BD, BE) |
punctuation |
symbol |