[Xapian-discuss] using xapian for indexing mails

James Aylett james-xapian at tartarus.org
Sat Aug 30 13:21:30 BST 2008


On Sat, Aug 30, 2008 at 10:29:27AM +0100, Matthew Somerville wrote:

> > Or is there some easier way to simply provide blobs of text, and being
> > able to search for them later?
> 
> You want XapianTermGenerator, which takes a blob of text and adds all the
> words in it to Xapian.

Since you're looking at indexing email, you may like to take a look at
my (unreleased) proof-of-concept email search in Python:

<http://svn.tartarus.org/sja/woodpecker/>

It's based around mboxes, not maildirs, so you'd need to make some
changes if you wanted to use it. However it's probably more useful in
giving a possible way of laying out your document data and term
prefixes. All under GPL.

J

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james at tartarus.org                               uncertaintydivision.org



More information about the Xapian-discuss mailing list