[Xapian-discuss] Query parser and stemming of norwegian letters

Olly Betts olly at survex.com
Thu Jun 9 16:55:13 BST 2005


On Thu, Jun 09, 2005 at 02:11:16PM +0200, Ivar Bratberg wrote:
> Why does the queryparser produce something different than a direct 
> stemmer call ?

Because QueryParser needs to tokenise before stemming.  Currently it
isn't unicode aware, so it treats the unicode character as a word break.

Hopefully I'll be fixing this next week.

Cheers,
    Olly



More information about the Xapian-discuss mailing list