[XSL-LIST Mailing List Archive Home]
[By Thread]
[By Date]
RE: [xsl] Re: Character 150 withs Windows-1252 output
Subject: RE: [xsl] Re: Character 150 withs Windows-1252 output From: "Michael Kay" <mike@xxxxxxxxxxxx> Date: Fri, 21 Apr 2006 11:53:06 +0100 |
> Reading around a bit 150 is a control character... so does > that mean it shouldn't appear in source XML document > (unresolved) where the encoding is specified as ISO-8859-1 ?? I believe that in the ISO standard ISO 8859/1, the control blocks C0 and C1 (which includes 150) are unused - they are not part of the character set. However, according to Wikipedia [1], "the character map ISO_8859-1:1987, more commonly known by its preferred MIME name of ISO-8859-1 ... assigns the C0 and C1 control characters to the code values 00-1F, 7F, and 80-9F. The XML recommendation defines encodings in terms of their IANA definitions not their ISO definitions, so on that basis ISO-8859-1 does include the control character 150. In XML 1.1, there is a requirement that C0 and C1 characters (with obvious exceptions such as TAB) must be represented as character references. This is primarily to catch the common error where a Windows 1252 file is mislabelled as ISO-8859-1. [1] http://en.wikipedia.org/wiki/ISO_8859-1 Michael Kay http://www.saxonica.com/
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
[xsl] Re: Character 150 withs Windo, andrew welch | Thread | Re: [xsl] Re: Character 150 withs W, andrew welch |
RE: [xsl] Character 150 withs Windo, Michael Kay | Date | [xsl] Phonetic Sorting for Japanese, Subhojit Chatterjee |
Month |
Keywords