Guidance for GSoC

Olly Betts olly at survex.com
Tue Feb 20 20:41:32 GMT 2018


Hi Aasheesh,

Apologies for the delay - your email was held for moderation (the list
would be full of spam if we just accepted email from non-subscribers)
and I must have failed to process the queue for a week.  I've added
you to the whitelist, so you should be able to post directly now.

On Wed, Feb 14, 2018 at 01:31:49AM +0530, Aasheesh Tiwari wrote:
>  I am Aasheesh Tiwari from India. Currently, I am pursuing Masters in
> Geoinformatics from CSRE , Indian Institute of Technology Bombay (IITB). I
> have my bachelors in Computer Science. Currently, i am learning machine
> learning and natural language processing. In my coursework, i have been
> reading research papers on ' Semantic web ' to present a seminar at the end
> of the semester.
> 
> In your GSoC wiki page I found the topic 'Diversification of Results' to be
> interesting. But i could not understand what is the expectation as a GSoC
> project. It would be very helpful if you could shed some more light on the
> topic.

The high-level end goal is a new feature in Xapian which can be used to
provide more diversified results.

There are different aspects to diversification - an important one is the
"query with different meanings" such as the "jaguar" example in the
project idea - but it could also try to present results for a variety of
sources (e.g. a web search which returns all results from one site is
less useful), or in a variety of formats, etc.

For this project, we expect applicants to review the academic literature
and find an approach which has already been shown to work well in a
variety of situations and which can be implemented efficiently within
Xapian.  The project idea give a couple of links for possible starting
points.

Then (as for any project) you need to write a proposal which explains
what you plan to implement and how you plan to go about it, including a
timeline.

Cheers,
    Olly



More information about the Xapian-devel mailing list