[Xapian-discuss] Lucene ranking

James Aylett james-xapian at tartarus.org
Thu Oct 28 15:46:38 BST 2004


Kevin Burton has posted about poor ranking in Lucene preferring
shorter documents over longer ones[1]. A similar search in Xapian
returns documents in the expected order:

Performing query `Xapian::Query(foo)'
3 results found
ID 3 99% [foo foo foo]
ID 2 94% [foo foo]
ID 1 80% [foo]

Anyone know what Lucene is doing here? Their FAQ doesn't mention what
weighting scheme they use, and I don't have time to investigate
further right now ...

[1] <http://www.peerfear.org/rss/permalink/2004/10/26/PoorLuceneRankingForShortText/>

J

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james at tartarus.org                               uncertaintydivision.org



More information about the Xapian-discuss mailing list