It worries me too! Here is a concrete example of a CRLF bug that I hit
today.
I have just used an OffLine Browser called Snake to download a web site
authored in MS FrontPage. some of the links have been correctly munged to
local links and some have not. By inspecting the HTML it emerged that
correctly munged links looked like this:-
<AREA ... HREF="http://www.a.com/foo.htm">
whilst un-munged links looked like this:-
<AREA ...
HREF = "http://www.a.com/foo.htm">
It is easy to see what has happened here. The s/w developers have
a pattern for matching AREA elements that does not countenance the presence
of a CRLF.
How should analagous problems in XML be addressed. Doing WS processing makes
pattern
matching/state space handling easier but at the expense of making it very
difficult
to re-produce the elided WS to ensure lossless transformation.