[Xapian-discuss] Accent

Robert Kaye rob at eorbit.net
Wed Jun 25 22:59:32 BST 2008


On Jun 25, 2008, at 3:14 AM, double wrote:

> Hello,
>
> The words "Guantánamo" and "Guantanamo" are indexed
> as two different words. Is there any chance to ignore
> accents?

If you run the data that you index through libunac [0] it will remove  
accents from characters so that those two words will be indexed as  
one. Just don't run the data you store in the index through libunac

[0] http://www.nongnu.org/unac/

--

--ruaok      Somewhere in Texas a village is *still* missing its idiot.

Robert Kaye     --     rob at eorbit.net     --    http://mayhem-chaos.net





More information about the Xapian-discuss mailing list