How to perform a non-latin character search?
Last updated 18, Apr 2024
Question
How to perform a non-latin character search?
Answer
The problem: while partial matches can normally be done or stemming works when text is in the English language, in other languages like Chinese, only an exact match works.
The problem is a misconfiguration of the index: other languages besides English are supported. Setting LANGUAGE_FIELD
to the desired language enables the stemmer for that specific language.
The supported languages are Arabic, Basque, Catalan, Danish, Dutch, English, Finnish, French, German, Greek, Hungarian, Indonesian, Irish, Italian, Lithuanian, Nepali, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish, Tamil, Turkish, and Chinese
References
Refer to the documentation for the Chinese language support and to the description of the option LANGUAGE_FIELD
from FT.CREATE documentation.