[Xapian-tickets] [Xapian] #609: term generation for some French elisions produces imperfect results.

Xapian nobody at xapian.org
Fri Jan 31 00:56:40 GMT 2025


#609: term generation for some French elisions produces imperfect results.
-------------------------+-------------------------------
 Reporter:  Paul Rudin   |             Owner:  Olly Betts
     Type:  enhancement  |            Status:  assigned
 Priority:  highest      |         Milestone:  1.5.0
Component:  QueryParser  |           Version:  git master
 Severity:  normal       |        Resolution:
 Keywords:               |        Blocked By:
 Blocking:               |  Operating System:  All
-------------------------+-------------------------------
Changes (by Olly Betts):

 * status:  new => assigned

Comment:

 I've pushed a change to Snowball to implement this:
 https://github.com/snowballstem/snowball/commit/664b9893ee16f4d5aa63f9898046f832976f98c4

 So far only ASCII apostrophe is handled (`'`) - we ought to handle Unicode
 apostrophe's too but that's a bit more fiddly because it's not in
 iso-8859-1 which Snowball upstream still supports.  For Xapian we only
 build UTF-8 stemmers so we could patch this in for now if necessary.
-- 
Ticket URL: <https://trac.xapian.org/ticket/609#comment:7>
Xapian <https://xapian.org/>
Xapian


More information about the Xapian-tickets mailing list