[Xapian-discuss] different stemming
james cauwelier
james.cauwelier at gmail.com
Sat May 9 00:12:09 BST 2009
mmmhh, the problem is that the site itself is in one language. But it 's
perfectly possible that somebody belgian wants to buy a spanish book and so
this person will search with spanish keywords.
I already feared that I should get rid of the stemming.
2009/5/8 James Aylett <james-xapian at tartarus.org>
> On Fri, May 08, 2009 at 11:43:16AM +0200, james cauwelier wrote:
>
> > The site I am working on has products in different languages (dutch,
> > english, french, italian, spanish). I want to search these products, but
> > while indexing I should use the correct stemmer. No problem, because I
> know
> > the language of a product description.
> >
> > But when somebody queries the database I have no information about the
> > language. Thus, I am not able to select the correct stemmer for queries.
> > How should I solve this? Skip stemming altogether? That 's what I am
> doing
> > now.
>
> I know this isn't the most helpful answer, but "it depends". You could
> disable stemming, but this may have unhelpful effects on the quality
> of your results. That's almost certainly the simplest thing to do,
> though.
>
> If you can figure out what language they care about most, you can stem
> to that language and restrict the search to documents (products) that
> were in that language in the first place. You may be able to ponder
> this from the same source as you're choosing site localisation.
>
> J
>
> --
> James Aylett
>
> talktorex.co.uk - xapian.org - uncertaintydivision.org
>
> _______________________________________________
> Xapian-discuss mailing list
> Xapian-discuss at lists.xapian.org
> http://lists.xapian.org/mailman/listinfo/xapian-discuss
>
More information about the Xapian-discuss
mailing list