[Xapian-tickets] [Xapian] #465: Stemmers which can produce multiple stems
Xapian
nobody at xapian.org
Thu Apr 15 01:25:09 BST 2010
#465: Stemmers which can produce multiple stems
-------------------------+--------------------------------------------------
Reporter: olly | Owner: olly
Type: defect | Status: new
Priority: normal | Milestone: 1.3.0
Component: Library API | Version:
Severity: normal | Blockedby:
Platform: All | Blocking:
-------------------------+--------------------------------------------------
The current API assumes exactly one stem per word, but some stemming
algorithms can produce multiple stems (and possibly not producing any
stems would be useful too...)
For example:
*
[http://thread.gmane.org/gmane.comp.search.xapian.devel/1512/focus=1513
Hebrew stemmers in libhspell]
* [http://en.wikipedia.org/wiki/Double_Metaphone Double Metaphone]
* [http://snowball.tartarus.org/otherapps/schinke/intro.html Schinke
Latin stemming algorithm]
Likely to be require API adjustments to handle well, so marking as 1.3.0
material for now.
--
Ticket URL: <http://trac.xapian.org/ticket/465>
Xapian <http://xapian.org/>
Xapian
More information about the Xapian-tickets
mailing list