[Xapian-devel] Something to think about

James Aylett james-xapian at tartarus.org
Sun Oct 14 13:11:21 BST 2007

On Fri, Oct 12, 2007 at 05:27:00PM +0100, Olly Betts wrote:

> > Could we have a mechanism where the size of each opened database is
> > taken into account, perhaps doubled to provide padding, and if any one
> > overflows its padding a new exception is raised? If that were an
> > optional strategy, it would work in the majority of cases (could even
> > be the default).
> Obviously we could, but it wouldn't help in the common case where the
> database is opened afresh by each search process.

If the db is opened each time, the document ids aren't guaranteed to
be consistent anyway, so does that case actually matter?

> One approach (rather a long term one) is to allow arbitrary docids -
> then searching over multiple databases can simple prefix the docids from
> each.  Probably the main challenge there is to maintain the ability to
> store them compactly.

I think if we want to do something like that, we'd want to separate
the idea of a user-level identifying docid, managed by the system, and
an underlying db-internal id. (Kind of like oids in postgresql.)
Although that has some nasty consequences, so maybe not. Hmm :-/


  James Aylett                                                  xapian.org
  james at tartarus.org                               uncertaintydivision.org

More information about the Xapian-devel mailing list