[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: [xsl] Charsets in Saxon


Subject: Re: [xsl] Charsets in Saxon
From: David Carlisle <davidc@xxxxxxxxx>
Date: Thu, 5 Feb 2004 13:21:56 GMT

  >  .. The characters in the document looks good though (as if 
  > the actual encoding was ISO-8859-1)

  They would do. ISO-8859-1 is essentially a subset of UTF-8==Unicode. UTF-8 is 
  I expect the common default.

The set of characters in ISO-8859-1 is a subset of the characters in
utf8 but the encodings of those characters are different. The lower half
of ISO-8859-1 (that is, ascii) has the same encoding in ISO-8859-1 and
utf8 but the upper half has different encodings (taking one byte in
ISO-8859-1 and two in utf8). So If you get these encodings mixed you
would not expect the characters to "look good" in fact you'd expect to
see a lot of spurious accented A symbols (questions about which
account for a fair amount of the messages on this list:-)

David

-- 
http://www.dcarlisle.demon.co.uk/matthew

________________________________________________________________________
This e-mail has been scanned for all viruses by Star Internet. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:
http://www.star.net.uk
________________________________________________________________________

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list



Current Thread