[Xapian-tickets] [Xapian] #609: term generation for some French elisions produces imperfect results.
Xapian
nobody at xapian.org
Fri Jan 31 00:56:40 GMT 2025
#609: term generation for some French elisions produces imperfect results.
-------------------------+-------------------------------
Reporter: Paul Rudin | Owner: Olly Betts
Type: enhancement | Status: assigned
Priority: highest | Milestone: 1.5.0
Component: QueryParser | Version: git master
Severity: normal | Resolution:
Keywords: | Blocked By:
Blocking: | Operating System: All
-------------------------+-------------------------------
Changes (by Olly Betts):
* status: new => assigned
Comment:
I've pushed a change to Snowball to implement this:
https://github.com/snowballstem/snowball/commit/664b9893ee16f4d5aa63f9898046f832976f98c4
So far only ASCII apostrophe is handled (`'`) - we ought to handle Unicode
apostrophe's too but that's a bit more fiddly because it's not in
iso-8859-1 which Snowball upstream still supports. For Xapian we only
build UTF-8 stemmers so we could patch this in for now if necessary.
--
Ticket URL: <https://trac.xapian.org/ticket/609#comment:7>
Xapian <https://xapian.org/>
Xapian
More information about the Xapian-tickets
mailing list