[Xapian-devel] What does collection_freq means?

jiangwen jiang jiangwen127 at gmail.com
Wed Aug 28 03:13:36 BST 2013


Thanks, Matt.

And some other questions:
1.
http://xapian.org/docs/sourcedoc/html/classXapian_1_1Weight_1_1Internal.html
    Xapian::doccount collection_size
                              Number of documents in the collection.
    What's the difference bewteen collection_size and
doccount(Xapian::doccount  get_doccount() const;).

2 On this page, http://xapian.org/docs/bm25.html
    *(k3+1)q*
(k3+q) ·  *(k1+1)f*
(k1L+f) ·log *(r+0.5)(N-n-R+r+0.5)*
(n-r+0.5)(R-r+0.5)   .   f is the wdf, the within document frequency,

   But in the code BM25Weight::get_maxpart(),
   double wdf_max(get_wdf_upper_bound()) is used, what's the difference
between f(wdf) and wdf_max.
   If they are not the same, why wdf_max is used

Really appreciate your help!




2013/8/27 Matt Chaput <matt at whoosh.ca>

> > Hi, all:
> >
> > I am confused with the concept of colletion_freq
> > There's no informations about it on http://xapian.org/docs/glossary.html
> > What does it means?
>
> The number of times the term appears across all documents.
>
>
> _______________________________________________
> Xapian-devel mailing list
> Xapian-devel at lists.xapian.org
> http://lists.xapian.org/mailman/listinfo/xapian-devel
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130828/6bbb04d5/attachment.html>


More information about the Xapian-devel mailing list