[Xapian-discuss] UTF8 support plans (without stemming)

James Aylett james-xapian at tartarus.org
Wed Jun 29 09:21:47 BST 2005


On Wed, Jun 29, 2005 at 04:19:57AM +0100, Olly Betts wrote:

> This is using a patched version of the QueryParser.  Currently I'm using
> glib's unicode routines, but I wonder if we really want to add a
> dependency on glib when we only use a very tiny part of it.
> 
> I already have C code for handling utf-8.  I'm going to see what else is
> around for unicode versions of "isalpha" etc.

IBM ICU is probably a better choice. It also supplies a whole load of
other useful features for Unicode handling, so it's not a ridiculous
thing for people to be using anyway if they're doing Xapian + Unicode
work.

<http://icu.sourceforge.net/>

J

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james at tartarus.org                               uncertaintydivision.org



More information about the Xapian-discuss mailing list