[Xapian-discuss] Indexing Chinese

Alex Deucher alexdeucher at gmail.com
Tue Jun 27 18:17:46 BST 2006


Has anyone ever indexed documents of Chinese characters?  What's the
best way to break down the text for indexing.  I know context is
important.  My current plan is to index each character and then do
phrase queries on combinations of characters.  Is there a better
approach?

Thanks,

Alex



More information about the Xapian-discuss mailing list