[Xapian-devel] Learning to rank

pankaj singhal pankajsinghal at ieee.org
Mon Mar 26 13:40:53 BST 2012


Hey Parth,

Would the inclusion of
WEKA<http://www.cs.waikato.ac.nz/%7Eml/weka/index.html>be encouraged,
referring to the update "If there is a public library
available for the ML part of the algorithm then it is advisable to use it".

regards,

On Sun, Mar 25, 2012 at 9:49 PM, Parth Gupta <parthg.88 at gmail.com> wrote:

> Hello Pankaj,
>
> I think based on the updated info of the LTR project (below link) you
> should be able to think specifically about your idea and if you want you
> can discuss it here or on IRC about the fine details involved.
>
> http://trac.xapian.org/wiki/GSoCProjectIdeas#Project:LearningtoRank
>
> Moreover, for the general details you may want to refer
> http://trac.xapian.org/wiki/GSoC2012
>
> Thanks for your interest.
>
> Parth.
>
>
>
>
> On Sun, Mar 25, 2012 at 2:27 AM, pankaj singhal <pankajsinghal at ieee.org>wrote:
>
>>  Dear Sir,
>>                  I am Pankaj Singhal from Jaipur, India. I am very much
>> interested and strongly looking forward in getting involved in this project
>> Learning-to-Rank.
>>
>> My previous experience in this field is good. Last semester I did a
>> similar job of ranking the URLs of the given huge dataset based on their
>> attribute values. The dataset consisted hundreds of thousands of URLs and
>> each url consisted of around 33000 features and a binary class label with
>> +1 OR -1 value. I applied the Decision Tree induction(GINI INDEX) Approach
>> for filtering out the URLs and then applying a RANKSUM[1] metric, which
>> uses weighted sum approach, to rank the URLs accordingly.
>>
>> The current implementation involves firstly the unsupervised ranking of a
>> query and then applying a supervised learning algorithm, SVM, on the first
>> 'n' documents retrieved.
>> A similar approach can be incorporated while extending the problem of
>> ranking with a better supervised learning algorithm and probabilistic model
>> viz. Bayesian Belief Networks i.e. it can be applied after fetching 'n'
>> documents from either of the two approaches, unsupervised ranking or SVM
>> ranking.
>>
>> Incorporating pairwise approach would also be a good idea, there are
>> various algorithms available.
>>
>>
>>
>>
>> [1] - Rank-Order Weighting of Web Attributes for Website Evaluation -
>> Mehri Saeid, Abdul Azim Abd Ghani, and Hasan Selamat
>>
>>
>> regards,
>>
>> --
>> Pankaj Singhal
>> III Year, CSE
>> The LNMIIT, Jaipur, India.
>>
>> Mob: +918875053936
>>
>>
>>
>>
>> _______________________________________________
>> Xapian-devel mailing list
>> Xapian-devel at lists.xapian.org
>> http://lists.xapian.org/mailman/listinfo/xapian-devel
>>
>>
>


-- 
Pankaj Singhal
III Year, CSE
The LNMIIT, Jaipur, India.

Mob: +918875053936
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120326/4ffe9c9c/attachment.htm>


More information about the Xapian-devel mailing list