[Xapian-discuss] How Xapian work with Thai ?

Sakesun Roykiattisak sakesun at boonthavorn.com
Tue Jun 10 15:14:05 BST 2008


> Out of curiousity, what if anything are you using for segmentation?
> Are you doing character based indexing?  I understood that Thai has no
> standard for word segmentation.

I wonder too.  Why it work with Thai ?  The result is not very good
but it's somewhat usable and I thought it could made work with some effort,
so I began investigate this.

Can Xapian do N-gram ?


>
> 2008/6/10 Sakesun Roykiattisak <sakesun at boonthavorn.com>:
>>
>> Hi,
>>
>>   Just a minute ago I've installed Flax and I'm very surprised.  
>> Eventhough
>> it's
>> very far from perfect, it's very usable with my language, Thai,
>> out-of-the-box.
>> And I'm wondering how does Xapian handle Thai ? Tokenize or N-Gram ?
>> I've seen the list of language Xapian support (in which Thai is not
>> included)
>> But how does it handle other languages ?
>>
>>   I'm looking for way to perform full-text search for my language.
>>
>> Thanks
>>
>>
>>
>>
>> _______________________________________________
>> Xapian-discuss mailing list
>> Xapian-discuss at lists.xapian.org
>> http://lists.xapian.org/mailman/listinfo/xapian-discuss
>>
>





More information about the Xapian-discuss mailing list