Re: Mix encodings in a document?

John Cowan (cowan@locke.ccil.org)
Wed, 23 Sep 1998 16:21:49 -0400


Deke Smith wrote:

> And what is the implications of this (if any) for XML rendering? I'm not
> sure of what you mean by "surrogates are correctly processed."

Essentially it means that the two 16-bit values that form a
surrogate-pair (representing a Unicode character on the Astral
Plane) is always treated as a single character.

In XML, surrogate-pairs can appear only in attribute values, #PCDATA
content, PIs, and comments; they are not allowed in element GIs,
attribute names, or the like.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)