[Xapian-tickets] [Xapian] #324: A Script that uses OpenOffice to filter text for Xapian Omega (was: A Script that users OpenOffice to filter text for Xapian Omega)

Xapian nobody at xapian.org
Fri Jun 12 07:52:10 BST 2009


#324: A Script that uses OpenOffice to filter text for Xapian Omega
-------------------------+--------------------------------------------------
 Reporter:  frankjb      |       Owner:  olly 
     Type:  enhancement  |      Status:  new  
 Priority:  normal       |   Milestone:       
Component:  Omega        |     Version:       
 Severity:  normal       |    Keywords:       
Blockedby:               |    Platform:  Linux
 Blocking:               |  
-------------------------+--------------------------------------------------
Changes (by olly):

  * keywords:  openoffice  convert =>


Comment:

 I can see some people might prefer to use openoffice for such things.  I
 don't think it's a suitable ubiquitous replacement for antiword -
 openoffice is just too heavyweight as a default.  I think the best
 candidate for a replacement for antiword as the default is wvWare.

 This script is problematic in a few ways though.  E.g. just killing off
 any existing openoffice processes is very hostile - what if I'm writing a
 report and cron kicks off an index update on the same machine?  Also, it
 doesn't appear to clean up temporary files, and since it does {{{cat
 *.html}}} that seems to mean it will produce the output from every
 previous file processed!

 Also, openoffice is rather slow to start up, so I think we really would
 want to use a persistent instance.

-- 
Ticket URL: <http://trac.xapian.org/ticket/324#comment:4>
Xapian <http://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list