Hello guys.I am working on implementing the PL2 weighting scheme of the DFR framework by Gianni Amati.<br>It

 uses the Poisson approximation of the Binomial as the probabilistic 

model (P), the Laplace law of succession to calculate the after effect 

of sampling or the risk gain (L) and within document frequency  normalization H2(2) (as proposed by

 Amati in his PHD thesis). <br><br>The formula for w(t,d) in this scheme is given by::-<br><br>w(t,d) = wqf * L * P<br>where <br>         wqf = within query frequency<br>         L = Laplace law of after effect sampling =1 / (wdfn + 1)<br>

         P = wdfn * log (wdfn / lamda) + (lamda - wdfn) log(e) + 0.5 * log (2 * pi * wdfn) <br>         wdfn = wdf * (1+c * log(average length of document in database / length of document d )) (H2 Normalization )<br>         lamda = mean of the Poisson distrubution = Collection frequency of the term   / Size of the database    <br>

         and the base of all logarithms is 2.<br>         c is a constant  parameter<br>       <br>The code is almost complete but I am stuck at a few places which are as follows:-<br><br>1.) Calculating the upper bound of the weight for the get_maxpart( ) function<br>

              This one calculation has been giving me sleepless nights for a couple of days now.The problem is that L is<br>             

 a decreasing function for wdfn and P as per my calculations is a 

increasing function . I arrived at this conclusion<br>              

because the derivative of L is always negative and the derivative of P 

is always positive (In the derivative of P, log (lamda)<br>             

 will always be negative as in his thesis,Amati states that for the PL2 

model, collection frequency of term &lt;&lt; Collection <br>             

 Size and so lamda will always be less than one .) .So, in order to find

 the upper bound,I simply substituted wdf=1 for L <br>              and used wdf = wdf_upper_bound for P and multiplied them by using upper doc length bound and lower doc length <br>              for wdfn of L and P respectively.However,this does not give that tight a bound.Not a word has been spoken about<br>

              upper bounds on DFR weights in Amati&#39;s thesis or on his papers on DFR .I even tried differentiating the product of <br>              L and P and equated that to zero as discussed on IRC but that landed me with a complicated equation with no answer <br>

              in sight.Please tell me what you&#39;ll think.<br><br>2.) Documentation <br><br>              This scheme belongs to a family of weighting schemes.Please do let me if I need to write any additional documentation<br>

               to introduce this new framework and the new weighting schemes.<br><br>Please do let know if you&#39;ll need additional information to help me out.Want to finish this scheme and move on the DPH scheme,which is yet another interesting weighting scheme of the DFR framework.<br>

<br>-Regards<br>-Aarsh<br><br><br><br><br>