[Xapian-discuss] Xapian and quartz scalability - feedback of current users

Olly Betts olly at survex.com
Tue Mar 22 12:00:10 GMT 2005


On Tue, Mar 22, 2005 at 11:52:37AM +0000, Olly Betts wrote:
> > Based on what the separation of the quartz databases is made ?
> 
> If you're searching over several unmerged databases, try to make them
> all a representative sample of the whole corpus as Xapian by default
> approximates term frequencies by looking at those in one database (the
> first I think, but check to be sure!)  This is for efficiency.

Oh, I forgot to say that this approximation is only used for "expand"
operations (which Omega uses to implement "topterms").  So this doesn't
matter if you don't plan to use "expand".

Cheers,
    Olly



More information about the Xapian-discuss mailing list