<div dir="ltr"><div><div>Hi Olly,<br><br></div>Wouldn't setting the weight of terms in title back to normal (e.g. 5 to 1) by below line, automatically adjust the wdfs and field lengths? <br><br>indexer.index_text(title, 5, "S"); ->
indexer.index_text(title, 1, "S"); <br><br></div><div>if it does not then we should include that part in the patch too. I like to create a patch for xapian-letor for resolving common code of xapian.<br><br></div>
<div>Cheers,<br></div><div>Parth. <br></div><div><br></div><br><br><br></div><div class="gmail_extra"><br><br><div class="gmail_quote">On Wed, Mar 12, 2014 at 3:13 AM, Jiarong Wei <span dir="ltr"><<a href="mailto:vcamx3@gmail.com" target="_blank">vcamx3@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Thank you Parth and Olly! I’ll try it :)<br>
<span class="HOEnZb"><font color="#888888"><br>
Jiarong Wei<br>
</font></span><div class="HOEnZb"><div class="h5"><br>
On Mar 11, 2014, at 16:57, Olly Betts <<a href="mailto:olly@survex.com">olly@survex.com</a>> wrote:<br>
<br>
> On Tue, Mar 11, 2014 at 03:20:31PM +0100, Parth Gupta wrote:<br>
>>><br>
>>> On current trunk, we index the title with prefix "S" by default in<br>
>>> omindex, though with a wdf inc of 5 rather than 1:<br>
>>><br>
>>> indexer.index_text(title, 5, "S");<br>
>>><br>
>>> So I don't think you need that change to omindex now.<br>
>><br>
>> Yes, but please make sure to change 5 to 1 otherwise divide the final count<br>
>> statistics by 5 . :)<br>
><br>
> We really need to resolve any instances where letor requires code in<br>
> other parts of Xapian to be patched.<br>
><br>
> In this case, possibly the bias on the title should be done differently,<br>
> but won't this just mean both the wdfs and the field length for the S<br>
> prefix are 5 times larger, and it won't matter?<br>
><br>
> Cheers,<br>
> Olly<br>
<br>
</div></div></blockquote></div><br></div>