[Xapian-devel] GSoc 2015 project ideas

aditi gupta aditiguptabits at gmail.com
Sat Feb 14 19:06:32 GMT 2015


Hey all,
I am Aditi Gupta, an aspiring Google Summer of Code 2015 participant.  I
had browsed through the plausible project ideas for GSoC 2015 on the wiki
page and particularly found two ideas very interesting. The  projects
'Weighting Schemes' and  'Learning to Rank' have seemed to capture my
imagination.
Having done the Information retrieval course at my university last semester
these two projects appealed to me the most. As a part of my project for the
course I had developed an automatic image annotation system prototype
modeled on the Latent Dirichlet Allocation  (LDA) probabilistic topic
modelling approach. Probably as a part of extending the features taken into
account when weighting terms by the Xapian library, the way LDA scores are
assigned can be considered to add to the extension to improve precision and
recall of search results. Also if some context specific parameters can be
modeled in the weighting schemes it might help in improving the performance
of the system.
I am also currently doing a study oriented projects under one of my
professors on Automatic text summarization and have done a literature
review on various techniques being employed for allotting relevance scores
to phrases/words in a sentence for picking salient sentences from a text.
Hopefully this background can get me started on finding relevant extensions
to weighting and ranking schemes.
Any suggestions and guidance on this front will be really appreciated.

I have got Xapian 1.2.19 built on my machine and am currently going through
the Getting started guide to get into the nitty-gritty of things.

Looking forward to a constructive discussion.

Cheers
Aditi
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20150215/6553360a/attachment.html>


More information about the Xapian-devel mailing list