[Xapian-devel] Indexing INEX collection for your GSoC Project

Olly Betts olly at survex.com
Mon May 19 12:26:39 BST 2014


On Mon, May 19, 2014 at 11:58:36AM +0200, Parth Gupta wrote:
> For indexing these XML documents, simply you should treat them as HTML by
> doing "--mime-type xml:text/html". Although this is not the correct way but
> it does the job and gets you started.

While that's fine for the letor projects, the point of Aarsh's work on
this is to produce a performance test suite, so indexing the data via
omindex probably isn't a good approach.

Cheers,
    Olly



More information about the Xapian-devel mailing list