[odf-discuss] ODF and UTF-8/16/32-HEX/DEC/DECHTML

Alex Hudson alex at stratagia.co.uk
Wed Feb 21 12:47:54 EST 2007


Damon,

Damon Anderson wrote:
> Where I take issue is with using < and >. HTML supports UTF-8 
> Decimal fully, so should XML. < and > should be XML extensions, 
> not defaults. If our standards don't interact with each what good are 
> they? These same characters can be properly 'identified' as special 
> characters using UTF-8 (a full and international standard) as follows: 
> < = &lt; = &#60; | > = &gt; = &#62; | & = &amp; = &#38; (this is UTF-8 
> Decimal in HTML).

I think you're confused, I'm afraid :)  < isn't the same as &lt; and &#60;.

&lt; and &#60; are representations of the < character in an encoded 
form, but they are not the < itself (although from the data point of 
view, they _are_ equivalent). However, if you parse "a &gt; b", the 
actual UTF text you get out of the parser is "a < b".

Is there a specific problem you're hitting?

Cheers,

Alex.




More information about the odf-discuss mailing list