XML passes all whitespace in the source to the application.
<P> We took <VAR TYPE="float">23.03+e02</VAR><UNIT>gram</UNIT> of water</P>
has no space. An Author can be told to enter either one, depending on what
they want. If you want the effect of my markup with the source you gave,
that's a CML convention, to the effect that VAR and UNIT "eat" adjacent
whitespace...
or a formatting convention.
The worst problem I see with whitespace is one that can't be solved by a
parser easily:
if I have a document bit like:
This is an end of paragraph.</P><p>And this is the start of another.
There's no way to tell that there isn't a word "paragraph.And" in the
document, without knowing the meaning of the tags. Of course there is only
one word in:
<font size=+5>L</font>arge initial letter
But this tends to bear out my view that whitespace handling is just the tip
of an iceberg only soluble with a lot of semantic knowledge -- that it is
the duty of stylesheet and DTD authors to determine.
_________________________________________
David Durand dgd@cs.bu.edu \ david@dynamicDiagrams.com
Boston University Computer Science \ Sr. Analyst
http://www.cs.bu.edu/students/grads/dgd/ \ Dynamic Diagrams
--------------------------------------------\ http://dynamicDiagrams.com/
MAPA: mapping for the WWW \__________________________