[Xapian-tickets] [Xapian] #719: Tokenized CJK query terms wrongly combined with respect to prefixes

Xapian nobody at xapian.org
Thu May 5 01:15:44 BST 2016


#719: Tokenized CJK query terms wrongly combined with respect to prefixes
-------------------------+-----------------------------
 Reporter:  liweitianux  |             Owner:  olly
     Type:  defect       |            Status:  assigned
 Priority:  normal       |         Milestone:  1.4.x
Component:  QueryParser  |           Version:  1.2.23
 Severity:  normal       |        Resolution:
 Keywords:  CJK, prefix  |        Blocked By:
 Blocking:               |  Operating System:  Linux
-------------------------+-----------------------------
Changes (by olly):

 * status:  new => assigned
 * milestone:   => 1.4.x


Comment:

 I think I agree that this is the better option:

 {{{
 Xapian::Query(((B中:(pos=1) AND B中文:(pos=1) AND B文:(pos=1)) OR
                (S中:(pos=2) AND S中文:(pos=2) AND S文:(pos=2))))
 }}}

 This case seems more analogous to a phrase search, which we handle more
 like that:

 {{{
 >>> print(qp.parse_query('"hello world"'))
 Query(((Bhello at 1 PHRASE 2 Bworld at 2) OR (Shello at 1 PHRASE 2 Sworld at 2)))
 }}}

 Though either would be an improvement.

 Marking for 1.4.x (once fixed there we can consider backporting).

--
Ticket URL: <https://trac.xapian.org/ticket/719#comment:2>
Xapian <//xapian.org/>
Xapian



More information about the Xapian-tickets mailing list