[Xapian-discuss] Japanese stemming

Olly Betts olly at survex.com
Mon Apr 18 03:00:54 BST 2005


On Sun, Apr 17, 2005 at 10:07:41PM +0900, Seo Sanghyeon wrote:
> "A stemming algorithm is a process of linguistic normalisation, in which
> the variant forms of a word are reduced to a common form... For many of
> the world's languages, Chinese and Japanese for example, this concept is
> irrelevant,"
> 
> Which I found very strange. Of course, stemming is very valuable in
> Japanese language.

Thanks for pointing this out.  I've removed Japanese as an example here.

> Yes, as you can see, I started to learn Japanese recently. :-) I am
> not sure I may try to write Japanese stemmer myself... Can anyone
> help?
> 
> I visited the Snowball site and read the manual there. It was an
> interesting read.

If you're interested in collaborating on a Japanese stemmer, I'd suggest
asking on the snowball mailing list.

Cheers,
    Olly



More information about the Xapian-discuss mailing list