[Xapian-discuss] query within value range

Richard Boulton richard at lemurconsulting.com
Tue Jun 5 20:57:42 BST 2007


Mark Clarkson wrote:
> So for making the programmers life that much easier is the performance
> expected to be better or worse than boolean when using a value
> processor?

Yes. :)

That's a particularly hard question to answer, because it depends on the 
  context.  Briefly:

Hopefully, the value table will be quite small, so if it's being used 
reasonably frequently, it should get cached in memory quite well. 
Therefore, the overhead of using a value instead of a boolean shouldn't 
be too high.

On the other hand, the information required for a single search using 
the value table will be more spread out than a postlist for a single 
boolean term, so you if the value table hasn't got cached well, you 
might well find that searches are slower.

On the third hand, to do a range query using booleans will often require 
an OR of lots of boolean terms, so the value option might be quite a bit 
faster, due to having to read less from disk.

Finally, if your search only returns a few results, the value will only 
be checked for those results, so the overhead should be minimal.  If the 
search returns lots of results, the value could be checked quite 
frequently, so you might get a much higher overhead.

So - without more information, your guess is as good as mine.

I'm working on some performance tests at this very moment, which include 
some tests of the performance of a search using a value range 
restriction, so I'll let the list know what I find.

-- 
Richard



More information about the Xapian-discuss mailing list