[Xapian-discuss] last_mod performance

Olly Betts olly at survex.com
Tue Mar 3 00:53:58 GMT 2009


On Fri, Feb 20, 2009 at 04:57:37PM +1030, Frank J Bruzzaniti wrote:
> I found that last mod from that patch ends up deleting documents.
> But I found another patch on trac that worked, I've been testing it and 
> have found indexing time for me went down from 4 hours to about 8 minutes.

Clearly checking the last modified time is going to be a win when
nothing has changed.

The more interesting questions are what the overhead is when all the
documents need reindexing anyway, and if that overhead is measurable
what the break-even point is in terms of the proportion of documents
which need to have changed to make it a win.

If the overhead isn't measurable, this can just be always on.  If
it's significant, then perhaps the option should default off.

> I use a slightly modified version in here.
> 
> http://trac.xapian.org/attachment/ticket/290/office2007.patch

It would be helpful to have the patch you're actually testing, without
any other unrelated changes

Cheers,
    Olly



More information about the Xapian-discuss mailing list