[Xapian-discuss] Re: Japanese / UTF-8 support

Jeff Breidenbach breidenbach at gmail.com
Sun Aug 13 05:34:50 BST 2006


This is looking promising. Running down my Omega checklist:

  * The patch is still too crude to submit, but I'v beaten htmlparse.cc
   into respecting <!--htdig_noindex--><!--/htdig_noindex-->

  * I've located the 300 character limit on sample size in omindex.cc,
    but am leaving that alone for the time being. Will keep in mind for
    improving summary results later. [1]

 * Getting filesize and last modification date in summary results is
    nice to have, but not critical. Putting on backburner.

  * I'm now building some flint indices for testing. This will probably
     take about a week to complete. When finished, this may provide
     some interesting benchmarks.

  * How can I best help with CJK ? The more concrete the suggestion,
     the better.

Cheers,
Jeff

[1] http://lists.tartarus.org/pipermail/xapian-discuss/2006-January/001471.html



More information about the Xapian-discuss mailing list