[Xapian-devel] Proposed changes to omindex

Sun Aug 27 19:22:37 BST 2006

On Sun, Aug 27, 2006 at 07:12:45PM +0100, Olly Betts wrote:

> Not all file formats can be sanely decoded without seeking though (and
> some are more efficient to decode if you can seek).

True, but there aren't all that many of them. You can be clever with
some of the intermediate ones which use content tables, even if
they're recursive.

> I'm not sure I can visualise how a merged indexer would look right
> now, but I think this isn't something for the short term anyway -
> sorting out utf-8 and flint are more important currently.

Absolutely.

> > > > I'd certainly favour having a way of running the query parser that
> > > > didn't need R-terms, [...]
> > > 
> > > There already is: QueryParser::set_stemming_strategy() can be called
> > > with STEM_NONE or STEM_ALL (the default is STEM_SOME).
> > 
> > Ah, excellent. Is this documented anywhere? Can't remember seeing it...
> 
> Hmm, only rather tersely:
> 
> http://www.xapian.org/docs/apidoc/html/classXapian_1_1QueryParser.html#c7dc3b55b6083bd3ff98fc8b2726c8fd
> 
> I'll try to flesh that out.

Cool.

James

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james at tartarus.org                               uncertaintydivision.org