Improving partial lookup results

Peter Bowyer peter at mapledesign.co.uk
Fri Sep 20 16:31:43 BST 2019


On Thu, 19 Sep 2019 at 21:45, Olly Betts <olly at survex.com> wrote:

> Incidentally, if you're actually aiming to match different forms of a
> name (Peter vs Pete, Ann vs Anne vs Annette) then you might find the
> synonym feature a better option than wildcarding.
>
> You'd need to give it a list of names to treat as synonyms, but it
> should have many fewer false positives, and can also handle cases
> which aren't just a substring - e.g. Robert vs Rob vs Bob vs Bobby, or
> look entirely different: e.g. Terence vs Terrence vs Spike or
> Margaret vs Peggy vs Daisy.
>

This is exactly what I want to do - pending finding a suitable dataset
that's free to use.

Peter


More information about the Xapian-discuss mailing list