[Xapian-tickets] [Xapian] #327: Power Point 2007 Filter
Xapian
nobody at xapian.org
Tue Feb 10 00:16:37 GMT 2009
#327: Power Point 2007 Filter
-------------------------+--------------------------------------------------
Reporter: frankjb | Owner: olly
Type: enhancement | Status: new
Priority: normal | Milestone:
Component: Other | Version: 1.0.10
Severity: normal | Blockedby:
Platform: All | Blocking:
-------------------------+--------------------------------------------------
Here's is a filter I wrote for powerpoint 2007. It will extract all the
text from the main slides.
FYI: it works in functionality like:
unzip -p "Test sample 20090201.pptx" ppt/slides/slide*.xml
If you wanted the notes and comments you could extend it to add:
unzip -p "Test sample 20090201.pptx" ppt/slides/slide*.xml
ppt/notesSlides/notesSlide*.xml ppt/comments/comment*.xml
But you must first check that motes or common files exists as not all pptx
*.files contain them (you can get unzip to list the contents of a zip file
without uncompressing the zip if you wanted this functionality.
--
Ticket URL: <http://trac.xapian.org/ticket/327>
Xapian <http://xapian.org/>
Xapian
More information about the Xapian-tickets
mailing list