[odf-discuss] Mars: XMLisation of PDF - opportunity for ODF?

Lars D. Noodén lars at umich.edu
Thu Nov 9 10:29:27 EST 2006


On Thu, 9 Nov 2006, Arend van Beelen wrote:
> There are certainly advantages thinkable.  For example, a big advantage PDF
> has over its paper counterpart is that it can be searched by a computer and
> PDF's can be indexed into search engines. Now extracting searchable text
> from an XML-based format is incredibly more easy than it is to write a
> PDF-parser and search the text you can get out of that.

Yes it is probably easier to parse text out of an XML document, but PDF is 
just a wrapper (if I understand correctly) and often holds a bitmapped 
image or other, from a searching perspective, useless material.

-Lars
Lars Noodén
 	Ensure access to your data in the future
 	http://opendocumentfellowship.org/about_us/contribute


More information about the odf-discuss mailing list