[Xapian-discuss] queryparser thinks ø is o

Marcus Ramberg marcus at startsiden.no
Mon Aug 29 10:04:49 BST 2005


hey. I'm having some problems with the Xapian QueryParser using the  
perl bindings. It turns all scandinavian characters into the english  
alphabet. See the following example:

$qp->set_stemmer($stemmer);
print $qp->parse_query('bølle')."\n";
print $stemmer->stem_word('bølle')."\n";

Returns

marcus at ds1:~/src/Horus-Indexer$ ./stemtest
Xapian::Query(bolle:(pos=1))
bølle

So, I'm pretty sure it's not the stemmer. Any other ideas?

Marcus

Ps. ( for your info, bølle eq bully, and bolle eq 'bowl' )
Pps. I've implemented the set_parser function in QueryParser. It  
should work, and I get the same results with set_stemming_options. :)

Marcus




More information about the Xapian-discuss mailing list