[Xapian-tickets] [Xapian] #216: Inconsistent return values for percentage weights

Xapian nobody at xapian.org
Tue Mar 3 10:31:18 GMT 2009


#216: Inconsistent return values for percentage weights
---------------------+------------------------------------------------------
 Reporter:  richard  |        Owner:  olly     
     Type:  defect   |       Status:  assigned 
 Priority:  normal   |    Milestone:  1.0.11   
Component:  Matcher  |      Version:  SVN trunk
 Severity:  normal   |   Resolution:           
 Keywords:           |    Blockedby:           
 Platform:  All      |     Blocking:           
---------------------+------------------------------------------------------

Comment(by olly):

 Even with better bounds on the weight, we're rarely going to get the
 attained maximum equalling the bound.  And users are (quite reasonably)
 going to be surprised if 100% is not attained in most cases where a
 document matches all the terms in the query.  I don't think that approach
 is really workable.

 There isn't actually a lot of special case code for changes to percentage
 cutoff weights - that's all handled in a single if statement.  But it does
 feature quite a lot in the post-match adjusting of bounds, etc.

 Several people have seemed confused that two matches with the same
 percentage score don't count as having the same "relevance" so I think if
 we're going to give up on the current approach to percentages, we should
 just leave it to the user to calculate them if they want them.  Ironically
 this is how it used to be - the percentage calculating code was originally
 part of Omega.

 > We could also provide an interface which returns the term weights for
 each of the terms in a query (generally useful).

 Already exists - MSet::get_termweight(tname)!

 > they could get hold of the weight of the top document (either by asking
 for it to be included in the mset, if they're doing a relevance-sorted
 search [...])

 MSet::get_max_attained() should report that weight even if first > 0.

 Anyway, for the moment, the question is really if we should apply this
 patch for 1.0.11.  I'll try running some tests to see if the patch slows
 things down much.

-- 
Ticket URL: <http://trac.xapian.org/ticket/216#comment:18>
Xapian <http://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list