[Xapian-discuss] Xapian::Queryparser / Encoding Problem (Utf8)
Olly Betts
olly at survex.com
Tue Aug 16 18:49:54 BST 2005
On Wed, Aug 10, 2005 at 04:41:41PM +0200, R. Mattes wrote:
> On Wed, 2005-08-10 at 15:29 +0100, Richard Boulton wrote:
> > The query parser itself shouldn't need too much work - you'll probably
> > need to look at the accent normalising code (see accentnormalisingitor.h
> > and symboltab.h).
>
> Well, looks like this will be my next task on the stack ...
I've already done this - Gmane is using a patched version of the
QueryParser on utf-8 data (without any stemming).
As I've said before, anyone who wants the patch is welcome to it. I
can't just apply it to SVN as is though as it'll break anyone using
iso-8859-1 queries or stemming. It also currently adds a dependency on
glib which is probably something we don't want to do.
Cheers,
Olly
More information about the Xapian-discuss
mailing list