[Xapian-discuss] search result context.

James Aylett james-xapian at tartarus.org
Mon Jan 23 18:04:09 GMT 2006


On Mon, Jan 23, 2006 at 10:42:49AM -0500, Charles Brandt wrote:

> It sounds like saving the whole document to the xapian Document's
> data field would bog down the search database.  That makes sense
> that you would need the whole document for context display, I just
> wasn't sure if there was some built in function that could rebuild
> pieces based on the position information.

The way omega does something *similar* to this is to store a summary
of the original page in the Document's data, and run a highlighting
algorithm on that. That's acceptable for most people.

If you don't have stemming enabled, but are storing positional data,
you could rebuild the text of the document from the position list of
the document (I think - I've never tried!).

The document data is stored in a different table to the stuff used in
the actual search, so putting more data in won't affect the actual
search time, although it will have an impact on building the results
display of course. It will also have an impact on index times I'd
guess, as there'll be more btree blocks per document - not sure quite
what effect you'll see there, though.

J

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james at tartarus.org                               uncertaintydivision.org



More information about the Xapian-discuss mailing list