[Xapian-discuss] Ticket #342: Omega: Add option to avoid reindexing unchanged files

Richard Boulton richard at lemurconsulting.com
Wed May 20 20:16:37 BST 2009


2009/5/20 Olly Betts <olly at survex.com>:
>> Maybe the test for changed content should depend on the md5sum and not on
>> the date (even though this does add more burden than just checking the last
>> mod date). Something roughly like this:
>
> Yes, it's quite a lot more work, but it would save some work.  A fuller
> solution to ticket #250 would reduce the gain here, but there would
> probably still be some:

Checking if the file size has changed as well as the date is another
approach - it doesn't cause all changes to be noticed, of course, but
it's a lot cheaper than computing the MD5 sum of the file (if you've
done a stat(), you've already got the size available).

-- 
Richard



More information about the Xapian-discuss mailing list