[Xapian-discuss] Lucene ranking
James Aylett
james-xapian at tartarus.org
Thu Oct 28 15:46:38 BST 2004
Kevin Burton has posted about poor ranking in Lucene preferring
shorter documents over longer ones[1]. A similar search in Xapian
returns documents in the expected order:
Performing query `Xapian::Query(foo)'
3 results found
ID 3 99% [foo foo foo]
ID 2 94% [foo foo]
ID 1 80% [foo]
Anyone know what Lucene is doing here? Their FAQ doesn't mention what
weighting scheme they use, and I don't have time to investigate
further right now ...
[1] <http://www.peerfear.org/rss/permalink/2004/10/26/PoorLuceneRankingForShortText/>
J
--
/--------------------------------------------------------------------------\
James Aylett xapian.org
james at tartarus.org uncertaintydivision.org
More information about the Xapian-discuss
mailing list