[Xapian-tickets] [Xapian] #150: Enhancements to Unicode support
Xapian
nobody at xapian.org
Tue Dec 10 03:08:44 GMT 2024
#150: Enhancements to Unicode support
-------------------------+-------------------------------
Reporter: Olly Betts | Owner: Olly Betts
Type: enhancement | Status: assigned
Priority: normal | Milestone: 2.0.0
Component: QueryParser | Version: git master
Severity: minor | Resolution:
Keywords: | Blocked By:
Blocking: | Operating System: All
-------------------------+-------------------------------
Comment (by Olly Betts):
> I think this needs a deeper analysis, but possibly we could define a
subset of the Unicode compatibility equivalent forms to use here.
Thinking about this more, defining our own subset is unhelpful - user code
can use existing libraries (or language support) to convert to NFC or NFKC
but to get "Xapian-NFC" they'd need to write their own conversion code, or
(more sensibly) we'd need to provide conversion functionality. If we're
going to have to provide it, it seems better to just convert the cases we
want converted internally - so we might pick NFC and then internally
handle cases such as the "ffi" ligature.
--
Ticket URL: <https://trac.xapian.org/ticket/150#comment:14>
Xapian <https://xapian.org/>
Xapian
More information about the Xapian-tickets
mailing list