Indexing Chinese?

Eric Abrahamsen eric at ericabrahamsen.net
Thu Oct 4 04:18:17 BST 2018


My second (and hopefully last) question: is there any more news on
indexing Chinese characters and words? Searching online mostly returns
results from a decade ago or more, with nothing very conclusive. How
close is this to possible?

For the time being I'm doing some pre-processing on long strings of
Chinese, breaking on punctuation in order to avoid errors. But I have
some large corpora of Chinese texts that in the future I'd like to index
properly.

Thanks,
Eric



More information about the Xapian-discuss mailing list