[Xapian-devel] Explanation of how Eset works

aarsh shah aarshkshah1992 at gmail.com
Tue Jan 15 08:56:49 GMT 2013


Hi there Olly . :)   So like,  are you talking about the user being able
 to specify the weighing scheme or the user being able to
 override Xapian's weighing scheme and coding his own  ?  (in context with
generating the Eset  )

-Regards
-Aarsh

On Sat, Jan 12, 2013 at 4:05 AM, Olly Betts <olly at survex.com> wrote:

> On Fri, Jan 11, 2013 at 12:42:32AM +0530, aarsh shah wrote:
> > So basically an ESET is formed by ranking terms based on the combined
> > weights((by using something similar to BM25) assigned to the documents
> > in the Rset (formed by the top 5 entries in the MSET or selected by us
> > ) which are present in the term's posting list,right ?
>
> Yes, that's an accurate summary.
>
> The weighting formula used for generating the ESet is currently
> hard-coded to be the original probabilistic formula, which is
> essentially BM25 with particular parameters.
>
> We should probably allow this formula to be specified by the user, like
> we do for document weights (I've just added that to the ideas list, as
> it didn't seem to be there already).
>
> Cheers,
>     Olly
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130115/e93bf4c4/attachment.htm>


More information about the Xapian-devel mailing list