[Xapian-devel] Bitsize project: Krovetz Stemmer
Richhiey Thomas
richhiey.thomas at gmail.com
Sun Feb 15 07:04:17 GMT 2015
Hello xapian devs,
I had shown interest in writing a krovetz stemmer for xapian and spoke to
James Aylett about it. Since it was hard to code the stemmer in snowball, I
came up with a C++ implementation of the stemmer.
But since it is a dictionary based stemmer, im having problems on deciding
how to create the dictionary.
I did check out some of the implementations of the Krovetz stemmer online
but all of them have large dictionaries and im not sure whether that would
be helpful in our case since the dictionary would be better for the user if
configurable.
I believe words such as exceptions, nationalities have to be treated
differently and have implemented that by creating a DictEntry class with a
boolean value named exception.
Any advice on how to proceed with this would be of much help :)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20150215/1edd32e5/attachment.html>
More information about the Xapian-devel
mailing list