[Xapian-devel] term duplication among index tables

Olly Betts olly at survex.com
Fri Nov 3 22:14:03 GMT 2006


On Fri, Nov 03, 2006 at 12:35:29PM -0800, Peter Friend wrote:
> Since the backends are basically B+ trees, I figured that
> space saved by using a term ID might allow more of the index pages
> for the other tables to be cached in memory (and possibly reduce disk
> hits)

This is definitely an important consideration.  Getting more branching
in the first few levels of the Btree will reduce the number of levels of
branching, which reduces the number of disk blocks required to get to an
entry.

> If I manage the time to attempt such an overhaul, I'll share what I
> find.

It would be interesting to hear the outcome of any experiments.

Cheers,
    Olly



More information about the Xapian-devel mailing list