[Xapian-devel] Adding Support for Krovetz Stemmer Algo in Xapian

Olly Betts olly at survex.com
Wed Dec 3 10:56:46 GMT 2014


On Mon, Dec 01, 2014 at 01:03:13PM +0530, Abhishek Singh Kushwah wrote:
> Now that if i code a Krovetz Implementation from scratch, than
> possibly i have to use the api and backend calls from xapian api
> rather than making it as independent module and similar to stem.h in
> namespace xapian.

I don't think the backends would be useful for a stemmer.  Even for a
dictionary-based stemmer you'd probably want to have the dictionary
in memory while indexing, since you have to look up every word in
every document.

The only part of the Xapian API you're likely to find useful is the
Unicode support (and if you need such functionality it would be better
to use this than duplicate it).

Cheers,
    Olly



More information about the Xapian-devel mailing list