Re: Character Encoding and the XML PR (was Re: PR.xml)

David Megginson (ak117@freenet.carleton.ca)
Sat, 17 Jan 1998 07:06:32 -0500


James Clark writes:

> Are you saying that Java's 16-bit characters prevent complete support
> for some of those encodings in an XML parser? If so, I don't see why,
> since XML doesn't allow characters >= 0x110000, all legal XML characters
> are representable in UTF-16 and hence in Java.

Quite right, I wasn't connecting the two -- Java supports UCS-4 only
to the extent allowed by surrogates in UTF-16, but that's the limit in
XML as well, so there should be no problem (at least, not until
Unicode starts assigning codes >= 0x110000, in which case the problem
will be both Java's and XML's).

All the best,

David

-- 
David Megginson                 ak117@freenet.carleton.ca
Microstar Software Ltd.         dmeggins@microstar.com
      http://home.sprynet.com/sprynet/dmeggins/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@ic.ac.uk the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)