[Xapian-discuss] Is there a better way to do a distributed search server with xapian?

Richard Boulton richard at tartarus.org
Tue Sep 14 14:46:52 BST 2010


On 14 September 2010 14:26, hightman <hightman at zuaa.zju.edu.cn> wrote:
> In fact,  Xapian read the data(termlist, postlist ...) from each database in a linear mode
> and then calculate the matched result.  So when I have remote databases too many,
> this implemention will be terrible and ineffective, because it cann't really search on
> these machines simultaneously.

If you perform a search across multiple machines using the remote
database protocol, Xapian will perform the search in parallel across
those machines.  It uses a two-pass protocol: first, the query is sent
to all the sub-machines, then the sub-machines return statistics for
the frequencies of the query terms on those machines, then the global
statistics are sent to all the sub-machines, and finally the top
results returned from each machine are returned.

-- 
Richard



More information about the Xapian-discuss mailing list