[Xapian-discuss] stemming an irregular forms?

Sascha.Wojewsky at heinze.de Sascha.Wojewsky at heinze.de
Wed Oct 26 14:21:53 BST 2011


Dear Olly,

thanks a lot for your answer.
Is it possible to create a stemmer subclass in php/perl?

I've tried it without any success...

I'm not able to change it in c++ :-(

Regards
Sascha Wojewsky
________________________________________
Tel.:	+49 (5141) 50291
Fax:	+49 (5141) 506291
-----Ursprüngliche Nachricht-----
Von: Olly Betts [mailto:olly at survex.com] 
Gesendet: Dienstag, 18. Oktober 2011 05:38
An: Wojewsky, Sascha
Cc: xapian-discuss at lists.xapian.org
Betreff: Re: [Xapian-discuss] stemming an irregular forms?

On Fri, Oct 14, 2011 at 08:45:38AM +0200, Sascha.Wojewsky at heinze.de wrote:
> I could not find the irregular forms table in xapian.
> Please, could you tell me how to define/add words to the irregular
> forms table in xapian?

You can define your own stemmer by subclassing StemImplementation:

http://xapian.org/docs/apidoc/html/structXapian_1_1StemImplementation.html

So if you want to use an existing stemmer but handle some words
differently, just create a subclass which checks for the exceptions,
and passes all other words to the existing stemmer.

If there are words which are poorly handled by a particular stemmer,
it's worth letting us (or the snowball team) know too.

Cheers,
    Olly



More information about the Xapian-discuss mailing list