[Xapian-tickets] [Xapian] #782: Improve encode_length() performance

Xapian nobody at xapian.org
Thu Jun 6 04:54:13 BST 2019


#782: Improve encode_length() performance
----------------------------+-------------------------
 Reporter:  Kronuz          |             Owner:  olly
     Type:  enhancement     |            Status:  new
 Priority:  normal          |         Milestone:
Component:  Backend-Remote  |           Version:
 Severity:  minor           |        Resolution:
 Keywords:                  |        Blocked By:
 Blocking:                  |  Operating System:  All
----------------------------+-------------------------

Comment (by olly):

 Working on getting some real world data, but in the test suite we see
 larger sizes:

 {{{
 Data points: 1584397
 Top 20 values with frequencies:
  590387 1
  230343 4
  149371 3
   99777 0
   25724 2
   17371 6
   13834 18
   13548 8
   11340 5
    8428 7
    8179 40
    7944 61
    6418 9
    5941 11
    5646 96
    5243 21
    5082 4294967295
    4801 10
    4376 17
    3804 16
 }}}

 (4294967295 is 0xffffffff so something suspect is going on there...)

 127058 values (including those 5082 0xffffffff values) are > 254, which is
 8%.  Excluding the 0xffffffff values, the mean is 36717.040639771.

 But the testsuite data is typically rather artificial.

--
Ticket URL: <https://trac.xapian.org/ticket/782#comment:10>
Xapian <https://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list