[Xapian-discuss] Re: Japanese / UTF-8 support
Jeff Breidenbach
breidenbach at gmail.com
Sun Aug 13 05:34:50 BST 2006
This is looking promising. Running down my Omega checklist:
* The patch is still too crude to submit, but I'v beaten htmlparse.cc
into respecting <!--htdig_noindex--><!--/htdig_noindex-->
* I've located the 300 character limit on sample size in omindex.cc,
but am leaving that alone for the time being. Will keep in mind for
improving summary results later. [1]
* Getting filesize and last modification date in summary results is
nice to have, but not critical. Putting on backburner.
* I'm now building some flint indices for testing. This will probably
take about a week to complete. When finished, this may provide
some interesting benchmarks.
* How can I best help with CJK ? The more concrete the suggestion,
the better.
Cheers,
Jeff
[1] http://lists.tartarus.org/pipermail/xapian-discuss/2006-January/001471.html
More information about the Xapian-discuss
mailing list