[Xapian-discuss] auto stopwords
Olly Betts
olly at survex.com
Wed Dec 15 11:02:10 GMT 2004
On Wed, Dec 15, 2004 at 09:53:55AM +0000, James Aylett wrote:
> On Wed, Dec 15, 2004 at 09:43:55AM +0000, Sam Liddicott wrote:
> > As well as within-document-frequency and within-index-frequency is there
> > any benefit in keeping the not-in-document-frequency, or the number of
> > documents that do NOT contain a given term?
>
> Isn't this Database::get_doccount() - TermIterator::get_termfreq() ?
>
> (Or Database::get_doccount() - Database::get_termfreq(tname) .)
Yes. And even if you wanted to, you can't sanely store it precomputed
since every stored value will need to be updated every time a document
is added.
Cheers,
Olly
More information about the Xapian-discuss
mailing list