[Xapian-tickets] [Xapian] #756: Implement dice coefficient weight metric

Xapian nobody at xapian.org
Thu Apr 5 22:01:10 BST 2018


#756: Implement dice coefficient weight metric
--------------------+---------------------------
 Reporter:  gp1308  |             Owner:  gp1308
     Type:  task    |            Status:  new
 Priority:  normal  |         Milestone:
Component:  Other   |           Version:
 Severity:  normal  |        Resolution:
 Keywords:          |        Blocked By:
 Blocking:          |  Operating System:  All
--------------------+---------------------------

Comment (by olly):

 If I follow you want a lower bound on the number of unique terms in any
 document in the collection - we don't track that, but we know `1` is a
 lower bound for it (because documents without any terms wouldn't get
 weighted).

 It's not necessarily a tight bound, but it's the best we can do right now.

 We could certainly start to track bounds on the number of unique terms -
 you could usefully make that a stretch goal of your project (and for older
 databases without that information, we can return 1 for the lower bound
 and the doclength upper bound for the unique terms upper bound).

--
Ticket URL: <https://trac.xapian.org/ticket/756#comment:6>
Xapian <https://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list