[Xapian-tickets] [Xapian] #671: Performance issues when querying over large number of local databases (shards)

Xapian nobody at xapian.org
Tue Mar 24 03:26:08 GMT 2015


#671: Performance issues when querying over large number of local databases
(shards)
----------------------------------+--------------------------
 Reporter:  wgreenberg            |             Owner:  olly
     Type:  defect                |            Status:  new
 Priority:  normal                |         Milestone:
Component:  Matcher               |           Version:
 Severity:  normal                |        Resolution:
 Keywords:  sharding performance  |        Blocked By:
 Blocking:                        |  Operating System:  Linux
----------------------------------+--------------------------
Changes (by olly):

 * component:  Other => Matcher


Comment:

 Discussion on IRC reveals that wgreenbeeg's postlists only have one level
 (plus root) for wgreenberg's situation, so the patch does do what's
 intended there (as the root block should be in `C[level]`.  For the
 general case, it either needs to not descend the tree, to actually read
 the branch blocks touched, or to make multiple passes to read a level and
 pre-read the one below.

 Gmane's postlist.DB is 3 levels (plus the root I think) for 120 million
 documents, so the trees don't get very deep.  A pre-read of just the
 wanted blocks in the level below the root is likely to help even for those
 with deeper trees.

--
Ticket URL: <http://trac.xapian.org/ticket/671#comment:4>
Xapian <http://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list