[Xapian-discuss] Open source search engines compared

Henry henka at cityweb.co.za
Mon Jul 13 05:26:07 BST 2009


Quoting "Kevin Duraj" <kevin.softdev at gmail.com>:
> Here is proof how fast search goes on 500GB of data using Xapian, can
> Lucene do that on single server? ... of course not.
> http://myhealthcare.com

I had a look, and I must say the performance is very encouraging  
indeed.  I say this since we're busy with an indexing run of almost a  
TB of gzipped (mostly) HTML data with no idea of how it's going to  
perform on a search cluster, or how big the final cluster will need to  
be.  I just wished it indexed faster...

May I chuck a few questions at you?

Are you indeed using a /single/ machine for search on 500GB?

What's the spec?  Main memory, RAID0, using SSD...?

Do you use spelling?

How many documents are talking about (representing 500GB)?


Thanks
Henry

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: PGP Digital Signature
Url : http://lists.xapian.org/pipermail/xapian-discuss/attachments/20090713/4646a9ef/attachment.pgp 


More information about the Xapian-discuss mailing list