Hi there Olly . :) So like, are you talking about the user being able to specify the weighing scheme or the user being able to override Xapian's weighing scheme and coding his own ? (in context with generating the Eset ) <div>
<br>-Regards <br>-Aarsh<br><br><div class="gmail_quote">On Sat, Jan 12, 2013 at 4:05 AM, Olly Betts <span dir="ltr"><<a href="mailto:olly@survex.com" target="_blank">olly@survex.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div class="im">On Fri, Jan 11, 2013 at 12:42:32AM +0530, aarsh shah wrote:<br>
> So basically an ESET is formed by ranking terms based on the combined<br>
> weights((by using something similar to BM25) assigned to the documents<br>
> in the Rset (formed by the top 5 entries in the MSET or selected by us<br>
> ) which are present in the term's posting list,right ?<br>
<br>
</div>Yes, that's an accurate summary.<br>
<br>
The weighting formula used for generating the ESet is currently<br>
hard-coded to be the original probabilistic formula, which is<br>
essentially BM25 with particular parameters.<br>
<br>
We should probably allow this formula to be specified by the user, like<br>
we do for document weights (I've just added that to the ideas list, as<br>
it didn't seem to be there already).<br>
<br>
Cheers,<br>
Olly<br>
</blockquote></div><br></div>