[Xapian-discuss] Newbie question: ESets, finding similar documents
Olly Betts
olly at survex.com
Wed Dec 10 02:39:14 GMT 2008
On Wed, Dec 10, 2008 at 01:08:51AM +0000, Olly Betts wrote:
> On Tue, Dec 09, 2008 at 05:54:17PM +0000, Ben Campbell wrote:
> > But get_eset often returns me useless terms, eg:
> > ['Zsay', 'are', 'Zare', 'says', 'but', 'Zbut', 'be', 'it', 'Zyear',
> > 'Zthat', 'that', 'is', 'Zis', 'Zit', 'Zbe', 'Zthere', 'on', 'Zon',
> > 'for', 'Zfor']
>
> I'm surprised that the list is so bad.
I took a look in case the code was wrong. I think it is when handling
multiple databases, though it's unclear to me what effect the bug would
have. But if you're expanding over multiple databases, this may not be
helping...
Cheers,
Olly
More information about the Xapian-discuss
mailing list