[odf-discuss] ODF support
marbux
marbux at gmail.com
Thu Jan 18 13:00:01 EST 2007
>From today's XML.org Daily NewsLinks:
Docvert Version 3.0 Supports Word Processor Format Conversion to HTML
Matthew Cruickshank, Developer Announcement
Version 3.0 of Docvert has been announced. The software is described
as a web service software takes multiple word processor files
(typically .doc) and converts them to OASIS OpenDocument and HTML.
The Web Service receives .doc file and converts it to a OASIS
OpenDocument 1.0 which can then be converted to HTML, RSS, or any XML
format. The resulting OpenDocument is then optionally converted to
HTML or any XML. This is done with XML Pipelines, an approach that
supports XSLT, breaking up content over headings or sections, and
saving those results to multiple files (e.g., chapter1.html,
chapter2.html); the result is returned in a .zip file. Docvert is easy
to integrate as it uses a simple REST-style interface, and it's
released under the LGPL so although it's open source there's no legal
problems developing proprietary software ontop of it. The XML produced
is easier to understand and more structured than the WordML or .DOC
formats. New in version 3.0 (1) Post-conversion editing. Upload an
word processing file, preview, and edit it on the browser. Remove
chapters, correct typos, and rebuild the document to download the
resulting HTML. (2) FTP/WebDAV upload. Send the results of a conversion
to your website. (3) Control every tag and attribute with XSLT, PHP,
and XML Pipelines. (4) Image format conversion. Convert WMF files to
PNG and GIF. (5) Microsoft Word Plugin, which installs a toolbar and
allows one-way conversion of the word document to OpenDocument and HTML
(multiple pages, based on document structure, controlling any tag or
attribute). Successful use requires disciplined application of Word
styles: "For all but trivial examples you'll need to use Word Styles
for any conversion software so that it knows how to section your
document and format everything correctly. Like most conversion software,
Docvert ignores font sizes and background colours and instead makes
decisions based on structural Word Styles that describe paragraphs,
headings, lists, tables, etc."
http://docvert.org/
See also the FAQ document: http://holloway.co.nz/docvert/faq.html
More information about the odf-discuss
mailing list