[Xapian-tickets] [Xapian] #150: Enhancements to Unicode support

Xapian nobody at xapian.org
Tue Dec 10 03:08:44 GMT 2024


#150: Enhancements to Unicode support
-------------------------+-------------------------------
 Reporter:  Olly Betts   |             Owner:  Olly Betts
     Type:  enhancement  |            Status:  assigned
 Priority:  normal       |         Milestone:  2.0.0
Component:  QueryParser  |           Version:  git master
 Severity:  minor        |        Resolution:
 Keywords:               |        Blocked By:
 Blocking:               |  Operating System:  All
-------------------------+-------------------------------
Comment (by Olly Betts):

 > I think this needs a deeper analysis, but possibly we could define a
 subset of the Unicode compatibility equivalent forms to use here.

 Thinking about this more, defining our own subset is unhelpful - user code
 can use existing libraries (or language support) to convert to NFC or NFKC
 but to get "Xapian-NFC" they'd need to write their own conversion code, or
 (more sensibly) we'd need to provide conversion functionality.  If we're
 going to have to provide it, it seems better to just convert the cases we
 want converted internally - so we might pick NFC and then internally
 handle cases such as the "ffi" ligature.
-- 
Ticket URL: <https://trac.xapian.org/ticket/150#comment:14>
Xapian <https://xapian.org/>
Xapian


More information about the Xapian-tickets mailing list