[odf-discuss] Metadata issues in ODF
Soren Roug
soren.roug at eea.europa.eu
Sun Sep 2 06:10:57 EDT 2007
I apologize for not talking about OOXML in this mail...
I've been writing a script that can convert the 20,000 Gutenberg texts
(www.gutenberg.org) to ODT, and I've discovered that ODF is not quite
adequate for simple *published* texts. First problem is that there is no
Publisher metadata field in meta.xml. The second more significant
problem is that there is no metadata field for the copyright information.
You can say, just put it in the text body. But there are now search
engines that look for *Creative Commons* licensed documents
(http://search.yahoo.com/web/advanced), and these search engines won't
have a chance to find the license, unless there is a metadata field for
it. Yes, you can say; just stick it in meta.xml anyway. But the spec
only says implementations *should* preserve unknown elements. OOo does
not, and the copyright is information you don't want to loose.
Finally, I converted Herodotus' Histories to ODT. You know him from the
movie The English Patient. He published his book in 430 BC, and
OpenDocument simply doesn't understand negative years.
I'm thinking of writing my issues up as use-cases and submit them to
OASIS, but I found out that just a week ago the metadata subcommittee
released a file called 07-08-22-ODF-Metadata-Proposal.odt for ODF 1.2. I
would like to read it to see if they have already seen the issues, but
it is password protected, and I don't want to pay the membership fee for
what could be a one-off. Does anyone have the file? And would you send
it to me (without breaking copyright laws and bylaws etc.)?
Sincerely yours,
Søren Roug
More information about the odf-discuss
mailing list