[Xapian-discuss] add_posting(): term position significance - line or offset?
Olly Betts
olly at survex.com
Tue Nov 18 20:04:58 GMT 2008
On Tue, Nov 18, 2008 at 08:29:34PM +0200, Henry wrote:
> This to my mind would be a rather important aspect of
> scoring, and one which I'd like to explore with a view to possible
> sponsored development (small personal purse, so don't get too excited:).
If you're looking for a simple but probably effective approach, then an
option to automatically turn:
a b c
into something like this might work:
(a b c) OR "a b c"
The phrase could have a window size (to allow non-exact matches) or
use OP_NEAR instead of OP_PHRASE.
And it's fairly easy to convert some examples by hand to see if the
results are good and how well it performs.
Cheers,
Olly
More information about the Xapian-discuss
mailing list