[Xapian-discuss] Newbie question: ESets, finding similar documents

Olly Betts olly at survex.com
Wed Dec 10 02:39:14 GMT 2008


On Wed, Dec 10, 2008 at 01:08:51AM +0000, Olly Betts wrote:
> On Tue, Dec 09, 2008 at 05:54:17PM +0000, Ben Campbell wrote:
> > But get_eset often returns me useless terms, eg:
> > ['Zsay', 'are', 'Zare', 'says', 'but', 'Zbut', 'be', 'it', 'Zyear', 
> > 'Zthat', 'that', 'is', 'Zis', 'Zit', 'Zbe', 'Zthere', 'on', 'Zon', 
> > 'for', 'Zfor']
> 
> I'm surprised that the list is so bad.

I took a look in case the code was wrong.  I think it is when handling
multiple databases, though it's unclear to me what effect the bug would
have.  But if you're expanding over multiple databases, this may not be
helping...

Cheers,
    Olly



More information about the Xapian-discuss mailing list