[Xapian-devel] Backend for Lucene format indexes-How to get doclength
jiangwen127 at gmail.com
Sun Jun 16 05:32:31 BST 2013
I have wrote a demo patch for Backend for Lucene format indexes, Lucene
version is 3.6.2.
Now, this demo patch just support the basic features in Lucene. Compound
delete document(.del) are not supported, skip list in .fdx is not supported
example/quest.cc is used to test this demo. query like this:
field_name:term, or file_name:term1 AND field_name:term2
Until now, I found some data needed for BM25 in Xapian are not existed in
4. doclength(for each document)
1-3 are statistics data, can be caculated when doing copydatabase, and
store them in somewhere. But doclengh is
hard to do this way.
1. some other data instead of doclength?
2. Xapian support other rank algorithm which does not need doclength?
Is there some suggestions to solve this problem?
And the demo patch is here:
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Xapian-devel