[Xapian-devel] Backend for Lucene format indexes-How to get doclength

jiangwen jiang jiangwen127 at gmail.com
Mon Sep 2 02:21:48 BST 2013


TfIdfWeight and BM25(b=0) also need wdf_upper_bound, it is not exists in
Lucene backends.
I think this data will be caculated when doing copydatabase, I will update
the code later

Regards


2013/8/26 Olly Betts <olly at survex.com>

> On Mon, Aug 26, 2013 at 09:41:07AM +0800, jiangwen jiang wrote:
> > > For now, using weighting schemes which don't use document length is
> > > probably the simplest answer.
> >
> > There's tf-idf weighting scheme on svn master, is it suitable for lucene
> > backend?
>
> Yes - TfIdfWeight doesn't ever use the document length (at least with
> the normalisations currently implemented).
>
> You could also use BM25 with parameter b=0.
>
> Cheers,
>     Olly
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130902/0643f62f/attachment.html>


More information about the Xapian-devel mailing list