Docvert 4.0 - Word to HTML converter
November 7, 2010 5:36 PM Subscribe
Docvert 4.0 - Word to HTML converter
I announced the previous version back in 2006 and it's progressed quite a lot since then. Docvert 4.0 is a free web service that takes Microsoft Office files and converts them to standards compliant HTML. You can control every single tag and attribute of the conversion with W3C standards. Since 2006 it's now 3× faster because it uses OpenOffice in a server mode, and it's the only Linux based converter that can handle EMF and WMF files, as well as providing unit tests, FTP/WebDAV uploads, and lossless optimisation of images. It's also easy to use, with a simple minimalist interface.
If you're on Debian/Ubuntu you can use the former version easily by installing the package: docvert-openoffice.org
Docvert is licensed under the GPLv3.
I announced the previous version back in 2006 and it's progressed quite a lot since then. Docvert 4.0 is a free web service that takes Microsoft Office files and converts them to standards compliant HTML. You can control every single tag and attribute of the conversion with W3C standards. Since 2006 it's now 3× faster because it uses OpenOffice in a server mode, and it's the only Linux based converter that can handle EMF and WMF files, as well as providing unit tests, FTP/WebDAV uploads, and lossless optimisation of images. It's also easy to use, with a simple minimalist interface.
If you're on Debian/Ubuntu you can use the former version easily by installing the package: docvert-openoffice.org
Docvert is licensed under the GPLv3.
By 'word styles' I just mean headings and so on. All conversion software needs some way of deriving structure so that it can break up content over multiple pages and so on.
Pipelines are an excellent way of controlling markup :)
posted by holloway at 7:31 PM on November 7, 2010
Pipelines are an excellent way of controlling markup :)
posted by holloway at 7:31 PM on November 7, 2010
... because just someone made the mistake of doing their work in a proprietary format doesn't mean they should have to suffer for it forever!
posted by Twang at 5:05 PM on November 27, 2010
posted by Twang at 5:05 PM on November 27, 2010
« Older LGBT etcetera... | My Father's Moustache... Newer »
Don't remember ever having a client who used Word styles though :-(
posted by circular at 6:43 PM on November 7, 2010