<div dir="ltr">Hi everyone,
<div><br></div><div>I was working on the Paice-Husk Stemmer, which is a Bite Size Project for Xapian, and I have created a C++ as well as Snowball version of it. </div><div><br></div><div>I read the algorithm, and picked the rules from here: <a href="http://www.comp.lancs.ac.uk/computing/research/stemming/paice/descript.htm">http://www.comp.lancs.ac.uk/computing/research/stemming/paice/descript.htm</a> </div>
<div><br></div><div>The C++ code takes rules as input from a file and generates the stem of given word, whereas the Snowball version has rules written in it. This is because file handling is not possible in Snowball, and so I have written a C++ code that generates the Snowball code (Code-ception :P).</div>
<div><br></div><div>Since the algorithm has many steps, my codes might have some mistakes.</div><div><br></div><div>This is where they are located: <a href="https://github.com/satwantrana/codes">https://github.com/satwantrana/codes</a></div>
<div><br></div><div>I will be integrating this in my Xapian fork, and release a patch soon. Meanwhile, if someone finds a bug/mistake in this, please respond.</div><div><br></div><div>Also, I hope this implementation helps my GSoC application.</div>
<div><br></div><div>Thanks,</div><div>Satwant Rana</div></div>