[Xapian-discuss] Re: (Very)Wrong number in get_matches_estimated
Olly Betts
olly at survex.com
Mon Feb 5 05:20:05 GMT 2007
On Mon, Jan 22, 2007 at 02:21:16PM -0800, Rafael SDM Sierra wrote:
> Sorry, it's a stupid mail, I'm using match decider, and have some documents
> that don't pass on it, so, the number will be really different
Actually, it's not so stupid.
It's harder to make a good estimate in the presence of a matchdecider,
but currently we just return the same estimate we would in the absence
of the matchdecider.
I think we ought to scale this estimate by the ratio of documents the
matchdecider liked to the number it tested (or something like that).
It seems that would generally improve the estimate in this case and
we do something similar when collapsing or using a percentage cutoff.
Does this patch help:
http://oligarchy.co.uk/xapian/patches/xapian-matcher-better-mdecider-estimates.patch
Cheers,
Olly
More information about the Xapian-discuss
mailing list