[odf-discuss] Mars: XMLisation of PDF - opportunity for ODF?
Lars D. Noodén
lars at umich.edu
Thu Nov 9 10:29:27 EST 2006
On Thu, 9 Nov 2006, Arend van Beelen wrote:
> There are certainly advantages thinkable. For example, a big advantage PDF
> has over its paper counterpart is that it can be searched by a computer and
> PDF's can be indexed into search engines. Now extracting searchable text
> from an XML-based format is incredibly more easy than it is to write a
> PDF-parser and search the text you can get out of that.
Yes it is probably easier to parse text out of an XML document, but PDF is
just a wrapper (if I understand correctly) and often holds a bitmapped
image or other, from a searching perspective, useless material.
-Lars
Lars Noodén
Ensure access to your data in the future
http://opendocumentfellowship.org/about_us/contribute
More information about the odf-discuss
mailing list