[Xapian-devel] Bitsize project: Krovetz Stemmer

Richhiey Thomas richhiey.thomas at gmail.com
Sun Feb 15 19:07:46 GMT 2015


Hello,
Yes James, will load a required dictionary according to the requirements of
the program. I have also structured the program accordingly so it shouldn't
be a problem :)
Also I did not mean a different class for every exception. Im using an
unordered_map to map the word against its dictionary entry for which I have
created a DictEntry class. It stores the word along with pointing out
whether it is an exception or not. Should work well right?
Also once I'm done, how can I have one of you review the code and help me
proceed?
Thanks.
On Feb 15, 2015 10:35 PM, "James Aylett" <james-xapian at tartarus.org> wrote:

> On 15 Feb 2015, at 07:04, Richhiey Thomas <richhiey.thomas at gmail.com>
> wrote:
>
> > Since [Krovetz] is a dictionary based stemmer, im having problems on
> deciding how to create the dictionary.
>
> Richhiey — I think I recommended that you load any dictionaries you need
> from a file, which could be specified when constructing the stemmer. That
> separates the need to create the dictionary from implementing the feature,
> although we’ll have to provide some initial dictionary eventually.
>
> How you then structure that in your code as you load it from file and
> later use it is entirely up to you. If it’s just a list of words that
> should be treated specially, having a class to represent each word feels
> like overkill — you can probably do it with something like an STL container
> of a base_string of some sort (std::wstring? I haven’t done much Unicode in
> C++ work, so others may want to jump in and correct me here).
>
> J
>
> --
>  James Aylett, occasional trouble-maker
>  xapian.org
>
>
> _______________________________________________
> Xapian-devel mailing list
> Xapian-devel at lists.xapian.org
> http://lists.xapian.org/mailman/listinfo/xapian-devel
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20150216/ba50b204/attachment.html>


More information about the Xapian-devel mailing list