How FTS indexing works for localization and attachment file formats
With some special considerations for attachment fields, the full text feature can index any content where the character set is compatible with the AR System server's character set. If the AR System server is running as a Unicode server, the full text feature has no restriction on the encoding format of the data content. You can index and search content in multiple languages.
With a non-Unicode AR System server, the data content must be compatible with the server's character set. When indexing and searching attachments with common formats, such as Microsoft Office documents and PDF documents, the full text feature can process the data without a dependency on the server's character set. For plain text files, the full text feature requires that the server recognize the character set of the data.
The locale of the AR System server defines the locale by which all text is processed. Language text can be indexed and searched, but the analysis (stemming, thesaurus, and root words) is applied according to the rules for the server's locale. For example, if the server is set up for English (en), all words (whether they are English or any other language) are processed as if they were English.