imagine a plain text file which I want to markup using XML. Now it could be
that there are characters in this file whose ASCII value is greater than
127 (in PCDATA sections).
If I try to use expat on the generated XML file, it tells me that it is
not wellformed at the position where such a character occurs. Does the
XML spec say anything about not permitting characters with high ASCII
values? If so, where?
I guess, to correctly interpret and display those characters I have to
know the character set which was used to encode the original text file.
How can I communicate this character set to an XML parser?
I would be happy if anybody could point me to somewhere I could start
reading about this issue.
Thanks and have a nice weekend,
-- Steffen Rodig rodig@sdm.de