[Xapian-discuss] using xapian for indexing mails
James Aylett
james-xapian at tartarus.org
Sat Aug 30 13:21:30 BST 2008
On Sat, Aug 30, 2008 at 10:29:27AM +0100, Matthew Somerville wrote:
> > Or is there some easier way to simply provide blobs of text, and being
> > able to search for them later?
>
> You want XapianTermGenerator, which takes a blob of text and adds all the
> words in it to Xapian.
Since you're looking at indexing email, you may like to take a look at
my (unreleased) proof-of-concept email search in Python:
<http://svn.tartarus.org/sja/woodpecker/>
It's based around mboxes, not maildirs, so you'd need to make some
changes if you wanted to use it. However it's probably more useful in
giving a possible way of laying out your document data and term
prefixes. All under GPL.
J
--
/--------------------------------------------------------------------------\
James Aylett xapian.org
james at tartarus.org uncertaintydivision.org
More information about the Xapian-discuss
mailing list