[Xapian-discuss] stopwords

Olly Betts olly at survex.com
Wed Dec 15 16:50:26 GMT 2004


On Tue, Dec 14, 2004 at 12:26:41PM -0300, Georges Dupret wrote:
> I would like to use the lists of stopwords provided with Xapian. Are
> there some standard way to remove stopwords automatically, or should I
> implement it mysel in the indexer?

There's the Xapian::Stopper class which Xapian::QueryParser uses.  But
that's just there as a way of passing a stopping algorithm to the
QueryParser.

If you're stopping at index time it's simplest to just store the list in
a std::set (or if you're using Xapian from another language, whatever
the appropriate data structure is in that language - e.g. you'd use a
hash in Perl).

Cheers,
    Olly



More information about the Xapian-discuss mailing list