Regarding GSoC 2016 project idea

James Aylett james-xapian at tartarus.org
Tue Mar 22 12:46:26 GMT 2016


On Tue, Mar 22, 2016 at 11:12:12AM +0530, Ainish Dave wrote:

> If I take up the task of 'clustering the search results' and I am opting
> for the new method to cluster the results, then what will be the
> approximate amount of data that will be clustered? So appropriate
> clustering mechanism can be thought of according to that.

The intention is to use this in 'live search' environments, so you'd
probably want to pull a medium number of documents from the top of the
search. It's a little difficult to say what a good number is, because
it will depend somewhat on how broad the search is. But maybe in the
100..1000 range?

J

-- 
  James Aylett, occasional trouble-maker
  xapian.org



More information about the Xapian-devel mailing list