Reparsing queries (was Re: [Xapian-devel] Re: [Xapian-commits] Changes in xapian/xapian-applications/queryserver/ xapian/xapian-applications/queryserver/source/)

Olly Betts olly at survex.com
Thu May 13 17:55:44 BST 2004


On Tue, May 11, 2004 at 06:30:47PM +0100, Olly Betts wrote:
> On Tue, May 11, 2004 at 05:41:58PM +0100, Richard Boulton wrote:
> > * If the queryserver can't parse a query, strip out all special
> >   characters, and then retry.  This means that we get some kind of
> >   result even if the query is broken.
> 
> That's probably a feature worth pushing down into Xapian::QueryParser...

I've now done this.  It's essentially the same as the queryserver patch,
except that "@" is also stripped.  It seems arbitrary to leave "@" but
to strip other phrase generators (especially "'" as then contractions
such as "isn't" get broken up).  As it is now it works well on the
sample of real world queries from tweakers.net, whereas not stripping
*any* phrase generators seems to do slightly less well.

I think this is something to revisit after this is addressed:

http://www.xapian.org/cgi-bin/bugzilla/show_bug.cgi?id=22

Thoughts?

Cheers,
    Olly




More information about the Xapian-devel mailing list