Re: Mix encodings in a document?

John Cowan (cowan@locke.ccil.org)
Wed, 23 Sep 1998 16:23:09 -0400


Michael Kay wrote:

> No. UTF-16 is an encoding of ISO 10646 that uses 16 bits to
> represent the characters in the Basic MultiLingual Plane
> (BMP, equivalent to Unicode) and longer sequences to
> represent characters outside the BMP. It is thus a pure
> superset of UCS-2 or Unicode. See
> http://osiris.dkuug.dk/jtc1/sc2/wg2/docs/N1334.html

Almost. Unicode = UTF-16; Unicode applications are not
allowed to support only the BMP, although there are no
characters on the Astral Planes yet.

-- 
John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)