Re: String character range

From: Doug Tody <dtody-at-nrao.edu>
Date: Fri, 1 Aug 2008 08:32:04 -0600 (MDT)


Right. Probably what is allowable in string data should be restricted as you suggest. Other byte data is binary data (although it may contain printable characters) and would want to be hex encoded or whatever to be transmitted as text. In any case you may still need to process character data further for an XML (or other) transport, even if character data is in the range allowable for strings.

On Fri, 1 Aug 2008, Mark Taylor wrote:

> On Fri, 1 Aug 2008, Doug Tody wrote:
>
> > Why not just enclose character data in CDATA sections when they pass
> > through XML? This is the mechanism XML provides for pass through
> > of arbitrary data (also encoding individual chars such as &lt;).
>
> That's what I thought at first, but the characters with Unicode code points
> 0x01-0x08, 0x0B, 0x0C, 0x0E-0x1F are not permitted *at all* in an XML
> document. CDATA sections provide for escaping characters such as '<' and '&'
> which might be interpreted as XML markup, but that is not the problem here.
> See productions [18] - [21] in the XML spec
> (http://www.w3.org/TR/2006/REC-xml-20060816/ sec 2.7).
>
>
Received on 2008-08-01Z16:33:16