Re: Character Encoding and the XML PR (was Re: PR.xml)

James Clark (jjc@jclark.com)
Sat, 17 Jan 1998 10:35:19 +0700


David Megginson wrote:

> AElfred accepts the following encodings, and to my
> knowledge, supports them completely and correctly to the extent
> allowed by Java's 16-bit characters and by surrogates:
>
> - UTF-8
> - ISO-10646-UCS-2 (both byte orders)
> - ISO-10646-UCS-4 (four byte orders)
> - UTF-16
> - ISO-8859-1

Are you saying that Java's 16-bit characters prevent complete support
for some of those encodings in an XML parser? If so, I don't see why,
since XML doesn't allow characters >= 0x110000, all legal XML characters
are representable in UTF-16 and hence in Java.

James