[Xapian-discuss] add_posting(): term position significance - line or offset?

Olly Betts olly at survex.com
Tue Nov 18 20:04:58 GMT 2008


On Tue, Nov 18, 2008 at 08:29:34PM +0200, Henry wrote:
> This to my mind would be a rather important aspect of  
> scoring, and one which I'd like to explore with a view to possible  
> sponsored development (small personal purse, so don't get too excited:).

If you're looking for a simple but probably effective approach, then an
option to automatically turn:

    a b c
    
into something like this might work:

    (a b c) OR "a b c"

The phrase could have a window size (to allow non-exact matches) or
use OP_NEAR instead of OP_PHRASE.

And it's fairly easy to convert some examples by hand to see if the
results are good and how well it performs.

Cheers,
    Olly



More information about the Xapian-discuss mailing list