[Xapian-tickets] [Xapian] #618: Omega: Improved indexing of leafname (intelligent split into several words)

Xapian nobody at xapian.org
Thu Dec 7 01:16:45 GMT 2023


#618: Omega: Improved indexing of leafname (intelligent split into several words)
-------------------------+-------------------------------
 Reporter:  peterpan     |             Owner:  Olly Betts
     Type:  enhancement  |            Status:  assigned
 Priority:  normal       |         Milestone:  2.0.0
Component:  Omega        |           Version:  1.2.14
 Severity:  normal       |        Resolution:
 Keywords:               |        Blocked By:
 Blocking:               |  Operating System:  All
-------------------------+-------------------------------
Changes (by Olly Betts):

 * milestone:  1.5.0 => 2.0.0

Comment:

 We need an algorithm that handles camel-case suitable, without doing
 stupid things to other cases.

 Perhaps "word-split before an upper case character if it's followed by
 either a lower case character, or by another upper case character and then
 a lower case character, so:

 * `thisIsAFile` -> `this` `Is` `A` `File`
 * `AndThis` -> `And` `This`
 * `README` -> `README`
 * `nothandled` -> `nothandled`

 This would be reasonable to backport to a stable release series
 (especially early in the series) so not a blocker.
-- 
Ticket URL: <https://trac.xapian.org/ticket/618#comment:7>
Xapian <https://xapian.org/>
Xapian


More information about the Xapian-tickets mailing list