[Xapian-discuss] BUG IN XAPIAN_FLUSH_THRESHOLD

Kevin Duraj kevin.softdev at gmail.com
Tue Jul 17 19:36:10 BST 2007


There is is bug when setting XAPIAN_FLUSH_THRESHOLD=20000000

When trying for force Xapian flush documents to flush after 20 million
documents Xapian ignores the size and flush it after only 10,000
documents.

Data captured from delve after 60 seconds interval when has been set as follow:
XAPIAN_FLUSH_THRESHOLD=20000000

perl -e ' while(1) { system("delve ."); sleep(60); } '

number of documents = 8510000
average document length = 13.5538
number of documents = 8520000
average document length = 13.5537
number of documents = 8530000
average document length = 13.5543
number of documents = 8530000
average document length = 13.5543
number of documents = 8540000
average document length = 13.5548
number of documents = 8550000
average document length = 13.5548
number of documents = 8550000
average document length = 13.5548
number of documents = 8560000
average document length = 13.5545
number of documents = 8570000
average document length = 13.5549
number of documents = 8570000
average document length = 13.5549
number of documents = 8580000
average document length = 13.5563
number of documents = 8590000
average document length = 13.5568

PS: Please do not ask me create smaller index and then merge them. I
am indexing 500 million documents. 20 million is my small index.

-- 
Cheers,
   Kevin Duraj



More information about the Xapian-discuss mailing list