Docvert 4.0 - Word to HTML converter
November 7, 2010 5:36 PM   Subscribe

Docvert 4.0 - Word to HTML converter
I announced the previous version back in 2006 and it's progressed quite a lot since then. Docvert 4.0 is a free web service that takes Microsoft Office files and converts them to standards compliant HTML. You can control every single tag and attribute of the conversion with W3C standards. Since 2006 it's now 3× faster because it uses OpenOffice in a server mode, and it's the only Linux based converter that can handle EMF and WMF files, as well as providing unit tests, FTP/WebDAV uploads, and lossless optimisation of images. It's also easy to use, with a simple minimalist interface.

If you're on Debian/Ubuntu you can use the former version easily by installing the package: docvert-openoffice.org

Docvert is licensed under the GPLv3.
posted by holloway (4 comments total) 2 users marked this as a favorite

Very cool! I was interested to read about pipelines.

Don't remember ever having a client who used Word styles though :-(
posted by circular at 6:43 PM on November 7, 2010


By 'word styles' I just mean headings and so on. All conversion software needs some way of deriving structure so that it can break up content over multiple pages and so on.

Pipelines are an excellent way of controlling markup :)
posted by holloway at 7:31 PM on November 7, 2010


Very interesting, thanks!
posted by odinsdream at 8:13 AM on November 9, 2010


... because just someone made the mistake of doing their work in a proprietary format doesn't mean they should have to suffer for it forever!
posted by Twang at 5:05 PM on November 27, 2010


« Older LGBT etcetera...   |   My Father's Moustache... Newer »


You are not currently logged in. Log in or create a new account to post comments.