[Xapian-discuss] My new record: Indexing 20 millions docs = 79m9.378s

Kevin Duraj kevin.softdev at gmail.com
Wed Feb 7 21:21:06 GMT 2007


Gentoo Linux 2.6
8 AMD Opteron 64-bit Processors
32GB Memory
--------------------------------------------------------------------------------

Environment:
------------------
XAPIAN_FLUSH_THRESHOLD=21000000
XAPIAN_FLUSH_THRESHOLD_LENGTH=16000000
XAPIAN_PREFER_FLINT=True
Indexing 20 million documents:
--stemmer=none
-------------------------------------------
real    79m9.378s
user    77m28.696s
sys     1m36.654s

# delve /home/kevin/index
---------------------------------------
number of documents = 19999995
average document length = 8.18631


PS: In my scenario after 25 million records the indexing significantly slows
down (2x-4x)
I do not know why? Could it be because of the B-Tree become very complex?

- Kevin Duraj


More information about the Xapian-discuss mailing list