[Xapian-devel] Explanation of how Eset works
aarshkshah1992 at gmail.com
Tue Jan 15 08:56:49 GMT 2013
Hi there Olly . :) So like, are you talking about the user being able
to specify the weighing scheme or the user being able to
override Xapian's weighing scheme and coding his own ? (in context with
generating the Eset )
On Sat, Jan 12, 2013 at 4:05 AM, Olly Betts <olly at survex.com> wrote:
> On Fri, Jan 11, 2013 at 12:42:32AM +0530, aarsh shah wrote:
> > So basically an ESET is formed by ranking terms based on the combined
> > weights((by using something similar to BM25) assigned to the documents
> > in the Rset (formed by the top 5 entries in the MSET or selected by us
> > ) which are present in the term's posting list,right ?
> Yes, that's an accurate summary.
> The weighting formula used for generating the ESet is currently
> hard-coded to be the original probabilistic formula, which is
> essentially BM25 with particular parameters.
> We should probably allow this formula to be specified by the user, like
> we do for document weights (I've just added that to the ideas list, as
> it didn't seem to be there already).
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Xapian-devel