[Xapian-discuss] Accent
Robert Kaye
rob at eorbit.net
Wed Jun 25 22:59:32 BST 2008
On Jun 25, 2008, at 3:14 AM, double wrote:
> Hello,
>
> The words "Guantánamo" and "Guantanamo" are indexed
> as two different words. Is there any chance to ignore
> accents?
If you run the data that you index through libunac [0] it will remove
accents from characters so that those two words will be indexed as
one. Just don't run the data you store in the index through libunac
[0] http://www.nongnu.org/unac/
--
--ruaok Somewhere in Texas a village is *still* missing its idiot.
Robert Kaye -- rob at eorbit.net -- http://mayhem-chaos.net
More information about the Xapian-discuss
mailing list