In order to ma开发者_运维知识库ke PowerPoint presentations 1. readable, and 2. searchable, I'd like to somehow convert them to wiki markup (we're using ScrewTurn).
I'm expecting some manual steps.
One idea was to upload a slide as a PDF to Google Docs and make it use its native doc format, and then use Google's HTML in I love wiki, but Google Docs erred when trying to convert the PDF file.
Just an idea...Not very sure about the feasibility. Maybe you can export the contents of slides by using VBA to a plain text file and just add some simple wiki markup for titiles, sections... if you don't care much about the font, style, pictures in it. I guess VBA can provide easy way to traverse and judge the objects in PowerPoint slides easier.
You'd need to create an XSLT or use another transform technology such as Linq in .NET to convert PresentationML and DrawingML (assuming PowerPoint 2007 and later) to a different mark up. To be clear, there is nothing easy about doing this - the PowerPoint format is the most complex of all the Office MLs.
You can start by looking at Eric White's blog on Transforming Open XML WordprocessingML to XHtml - this would be one way to do it (Linq). Certainly for the textual portions of DrawingML (which PowerPoint uses for text) there are similarities between that and WordprocessingML). You an also look at the OOXML->ODF converter for inspiration (XSLT).
精彩评论