[Xapian-discuss] Ranking and term proximity

goran kent gorankent at gmail.com
Sun Sep 4 14:43:07 BST 2011


Hi,

I was reading an article recently about how google ranks results
(among many other things of course) based on the proximity of the
search terms in the source documents.  In addition, the position of
the search terms in the search query string itself is also taken into
consideration when determining how important each term is.

Does Xapian do something similar - at least for the first part?

For example, if I search for 'Olly Betts' - without double quotes in
two documents the first of which the terms 'Olly' and 'Betts' are
widely separated, and the second contains the terms 'Olly Betts' right
next to each other, will the latter document score higher?  Please
tell me it is.

I can understand the position information in the search string itself
not being used, but surely term proximity is used?

Thanks



More information about the Xapian-discuss mailing list