[Xapian-tickets] [Xapian] #618: Omega: Improved indexing of leafname (intelligent split into several words)

Xapian nobody at xapian.org
Tue Apr 16 20:06:02 BST 2013


#618: Omega: Improved indexing of leafname (intelligent split into several words)
----------------------+-----------------------------------------------------
 Reporter:  peterpan  |       Owner:  olly  
     Type:  defect    |      Status:  new   
 Priority:  normal    |   Milestone:        
Component:  Omega     |     Version:  1.2.14
 Severity:  normal    |    Keywords:        
Blockedby:            |    Platform:  All   
 Blocking:            |  
----------------------+-----------------------------------------------------
Description changed by peterpan:

Old description:

> Reference: http://article.gmane.org/gmane.comp.search.xapian.general/9561
>
> Omega indexes file names. The file name seems to indexed as
> several words if the name contains space characters or hyphens.
>
> In my NAS share I often separate words in the file name using "-" , "_"
> or even using a capital letter at the beginning of each word (I guess
> this is also the case for many other users):
>
> Examples:
> this_is_a_file.txt
> thisIsAFile.txt
>
> In those cases, a noticed that omega does not index the individual words,
> but only the full basename as one single word.
>
> Therefore, omega should index each respective word (i.e. "this" "is" "a"
> "file") in addition to the full basename (i.e. "this_is_a_file"), in
> order to ease the search.

New description:

 Reference: http://article.gmane.org/gmane.comp.search.xapian.general/9561

 Omega indexes file names. The file name seems to indexed as
 several words if the name contains space characters or hyphens.

 In my NAS share I often separate words in the file name using "-" , "_" or
 even using a capital letter at the beginning of each word (I guess this is
 also the case for many other users):

 Examples:

 "this_is_a_file.txt"

 "thisIsAFile.txt"

 In those cases, a noticed that omega does not index the individual words,
 but only the full basename as one single word.

 Therefore, omega should index each respective word (i.e. "this" "is" "a"
 "file") in addition to the full basename (i.e. "this_is_a_file"), in order
 to ease the search.

--

-- 
Ticket URL: <http://trac.xapian.org/ticket/618#comment:1>
Xapian <http://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list