[Xapian-devel] [GSOC 2014] Indexing INEX dataset

Olly Betts olly at survex.com
Tue Mar 11 14:17:56 GMT 2014


On Tue, Mar 11, 2014 at 12:02:15PM +0100, Parth Gupta wrote:
> During the indexing with omindex, only you need to make sure is indexing
> with prefix 'S' for title as explained here in Letor documentation:
> xapian-letor/docs/letor.rst
> 
> Previously when I edited omindex.cc it was modified as can be seen
> here<http://trac.xapian.org/browser/svn/branches/gsoc2011-parth/xapian-applications/omega/omindex.cc>on
> line 838 and block 1532-1559.
> 
> But now we have the same as xapian-letor/bin/xapian-letor-update.cc so
> before starting with questletor.cc you need to run it once for each db and
> in this case all  you need to make sure is below line in omindex.cc while
> indexing.
> 
> indexer.index_text(title, 1,"S");

On current trunk, we index the title with prefix "S" by default in
omindex, though with a wdf inc of 5 rather than 1:

            indexer.index_text(title, 5, "S");

So I don't think you need that change to omindex now.

Cheers,
    Olly



More information about the Xapian-devel mailing list