[Xapian-tickets] [Xapian] #569: Generate omindex docs and code relating to file types

Xapian nobody at xapian.org
Mon Mar 20 03:34:09 GMT 2023


#569: Generate omindex docs and code relating to file types
---------------------+-------------------------------
 Reporter:  Charles  |             Owner:  Olly Betts
     Type:  defect   |            Status:  assigned
 Priority:  low      |         Milestone:  2.0.0
Component:  Omega    |           Version:  git master
 Severity:  normal   |        Resolution:
 Keywords:           |        Blocked By:
 Blocking:           |  Operating System:  All
---------------------+-------------------------------
Changes (by Olly Betts):

 * priority:  normal => low
 * version:  1.2.5 => git master
 * milestone:  1.4.x => 2.0.0

Comment:

 Reviewing the remaining thing raised here that isn't currently generated
 is the list starting:

   • HTML (.html, .htm, .shtml, .shtm, .xhtml, .xhtm)
   • PHP (.php) - our HTML parser knows to ignore PHP code
   • text files (.txt, .text)
   • SVG (.svg)
   • Compressed SVG (.svgz)
   • CSV (Comma-Separated Values) files (.csv)
   • PDF (.pdf) if pdftotext (comes with poppler or xpdf) or libpoppler (in
     particular libpoppler-glib-dev) are available
   • !PostScript (.ps, .eps, .ai) if ps2pdf (from ghostscript) and
 pdftotext
     (comes with poppler or xpdf) or libpoppler (in particular
     libpoppler-glib-dev) are available

 In the table in the `.odt` attachment, this information is in the same
 table that lists the extension to MIME content-type mappings.  I worry
 though that the table ends up being too wide (in the `.odt` the "MIME
 type" column has had to be wrapped in many cases which makes it harder to
 read).

 Also where there are multiple extensions for a MIME content-type the
 `.odt` has a row for each, repeating the other fields - this helps keep
 the width under control, but makes it harder to see at a glance which
 extensions are essentially the same type.  Perhaps using column spans for
 the other fields would work for this (grid table support column spans in
 `.rst`).

 Bumping the milestone as this isn't a blocker for the next release series.
-- 
Ticket URL: <https://trac.xapian.org/ticket/569#comment:15>
Xapian <https://xapian.org/>
Xapian


More information about the Xapian-tickets mailing list