[Xapian-tickets] [Xapian] #618: Omega: Improved indexing of leafname (intelligent split into several words)

Xapian nobody at xapian.org
Tue Apr 16 20:03:24 BST 2013


#618: Omega: Improved indexing of leafname (intelligent split into several words)
----------------------+-----------------------------------------------------
 Reporter:  peterpan  |       Owner:  olly  
     Type:  defect    |      Status:  new   
 Priority:  normal    |   Milestone:        
Component:  Omega     |     Version:  1.2.14
 Severity:  normal    |   Blockedby:        
 Platform:  All       |    Blocking:        
----------------------+-----------------------------------------------------
 Reference: http://article.gmane.org/gmane.comp.search.xapian.general/9561

 Omega indexes file names. The file name seems to indexed as
 several words if the name contains space characters or hyphens.

 In my NAS share I often separate words in the file name using "-" , "_" or
 even using a capital letter at the beginning of each word (I guess this is
 also the case for many other users):

 Examples:
 this_is_a_file.txt
 thisIsAFile.txt

 In those cases, a noticed that omega does not index the individual words,
 but only the full basename as one single word.

 Therefore, omega should index each respective word (i.e. "this" "is" "a"
 "file") in addition to the full basename (i.e. "this_is_a_file"), in order
 to ease the search.

-- 
Ticket URL: <http://trac.xapian.org/ticket/618>
Xapian <http://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list