[Xapian-devel] GSoc Project Idea Weighting Schemes (Ranking)

Abhishek Singh Kushwah abhishek18kushwah at gmail.com
Sun Nov 23 08:29:53 GMT 2014


Hi,
I am Abhishek

Currently Xapian::Weight follows BM25 scheme, many models such as the
Divergence from Randomness (DfR) family of models, Unigram Language Model
and the Bi-gram Language Model implemented two years ago in GSoc 2012 yet
not merged to the master.

The new weighing schemes or improvement in implementing the previous models
to change the default scheme of BM25 from SMART with reference to this
paper www.aclweb.org/anthology/P10-1141

After skimming through the schemes implemented in Xapian::weight. There
seems a considerable hope in editing the algorithms to increase efficiency
and speed and implementing new ones in use.

I would need mentors point of view regarding new schemes for the project
wrt SMART and others.

Thank You
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20141123/3e2aff8e/attachment.html>


More information about the Xapian-devel mailing list