[Xapian-discuss] too slow when create index

Olly Betts olly at survex.com
Sun Oct 7 23:14:47 BST 2012


On Fri, Aug 31, 2012 at 10:56:09AM +0800, Eric Yue wrote:
> I am create index for some files,in my program,a document is  a line in a
> file.   i create index for very lines in a file.  is there any method to
> speed up this ??????

You don't give us much information about how you're indexing, so it's
hard to give specific advice, but in general increasing the batch size
will make indexing go faster - provided you have enough memory.  The
default is to auto-commit every 10000 document changes, which is fairly
conservative for modern hardware.  And if each line in a file is a
document, then your documents are presumably very small so you should be
able to raise this a lot.

To set it, you just set XAPIAN_FLUSH_THRESHOLD in the environment (and
make sure it is exported) - e.g.:

env XAPIAN_FLUSH_THRESHOLD=1000000 ./my-indexer

Cheers,
    Olly



More information about the Xapian-discuss mailing list