[Xapian-discuss] omega and "text/x-mail" support
Olly Betts
olly at survex.com
Mon Dec 15 22:22:57 GMT 2014
On Sat, Dec 13, 2014 at 08:32:58PM +0100, Emmanuel Garette wrote:
> I would like to add "text/x-mail" support to omega. I'm using mhonarc to
> export mail to HTML format and I'm using HTML parse to index mail
> content (largely inspired by "application/vnd.ms-outlook" format).
>
> The problem is that files attached to the mail are not indexing at all.
> I think it's not possible in "index_file" function to index 2 files as
> one document.
>
> I can't find easily solution for my problem. I think I must spit this
> function to separate document's creation and file indexing.
I've done some work on indexing attachments and files inside archives
(like tar and zip files), but I haven't merged it yet as it's not
entirely satisfactory in various ways, most of which require some
refactoring of omindex to address.
The approach I took to attachments was to index them as separate
documents - if I follow you correctly, you seem to be trying to treat
them as part of a single document. Is there a particular reason why
you are taking that approach?
I don't think my code is anywhere public currently, but I can rebase
it onto current master and put it on a git branch if it's potentially
useful to others in its current form.
Cheers,
Olly
More information about the Xapian-discuss
mailing list