[Xapian-tickets] [Xapian] #324: A Script that uses OpenOffice to filter text for Xapian Omega (was: A Script that users OpenOffice to filter text for Xapian Omega)
Xapian
nobody at xapian.org
Fri Jun 12 07:52:10 BST 2009
#324: A Script that uses OpenOffice to filter text for Xapian Omega
-------------------------+--------------------------------------------------
Reporter: frankjb | Owner: olly
Type: enhancement | Status: new
Priority: normal | Milestone:
Component: Omega | Version:
Severity: normal | Keywords:
Blockedby: | Platform: Linux
Blocking: |
-------------------------+--------------------------------------------------
Changes (by olly):
* keywords: openoffice convert =>
Comment:
I can see some people might prefer to use openoffice for such things. I
don't think it's a suitable ubiquitous replacement for antiword -
openoffice is just too heavyweight as a default. I think the best
candidate for a replacement for antiword as the default is wvWare.
This script is problematic in a few ways though. E.g. just killing off
any existing openoffice processes is very hostile - what if I'm writing a
report and cron kicks off an index update on the same machine? Also, it
doesn't appear to clean up temporary files, and since it does {{{cat
*.html}}} that seems to mean it will produce the output from every
previous file processed!
Also, openoffice is rather slow to start up, so I think we really would
want to use a persistent instance.
--
Ticket URL: <http://trac.xapian.org/ticket/324#comment:4>
Xapian <http://xapian.org/>
Xapian
More information about the Xapian-tickets
mailing list