[Xapian-tickets] [Xapian] #618: Omega: Improved indexing of leafname (intelligent split into several words)
Xapian
nobody at xapian.org
Thu Dec 7 01:16:45 GMT 2023
#618: Omega: Improved indexing of leafname (intelligent split into several words)
-------------------------+-------------------------------
Reporter: peterpan | Owner: Olly Betts
Type: enhancement | Status: assigned
Priority: normal | Milestone: 2.0.0
Component: Omega | Version: 1.2.14
Severity: normal | Resolution:
Keywords: | Blocked By:
Blocking: | Operating System: All
-------------------------+-------------------------------
Changes (by Olly Betts):
* milestone: 1.5.0 => 2.0.0
Comment:
We need an algorithm that handles camel-case suitable, without doing
stupid things to other cases.
Perhaps "word-split before an upper case character if it's followed by
either a lower case character, or by another upper case character and then
a lower case character, so:
* `thisIsAFile` -> `this` `Is` `A` `File`
* `AndThis` -> `And` `This`
* `README` -> `README`
* `nothandled` -> `nothandled`
This would be reasonable to backport to a stable release series
(especially early in the series) so not a blocker.
--
Ticket URL: <https://trac.xapian.org/ticket/618#comment:7>
Xapian <https://xapian.org/>
Xapian
More information about the Xapian-tickets
mailing list