[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: [xsl] Character substitution

Subject: Re: [xsl] Character substitution
From: David Carlisle <davidc@xxxxxxxxx>
Date: Mon, 10 Jan 2005 11:49:41 GMT

me> Your input has a reference to unicode 128. that is a control character
me> (on the meaning of which you explitly shouldn't depend).

  I would call it a ANSI character number 128 (using windows-1252 is just 
  too new fangled for me) with its assoc unicode number being 8364

ah, but it would be wrong to call it that. Your input was XML and used a
character reference, and characetr references never depend on the
encoding in the file, they always refer to uniocde characters, and 128
is an undefined control character. that's the whole point of character
references, they allow you to refer to characters that are not in your
file encoding.

  isnt ANSI what you wordy folks call windows-1252 

No. ANSI is the American standards body that represents USA in ISO.
Windows-1258 is a private encoding invented by some company in Seattle.
These can't be the same thing, surely.

> (once again for me this 
> was once known as CP1252)? doesnt ANSI define ANSI char 128 as the euro?

ANSI has nothing to do with the encoding that Microsoft sometimes call
ansi-new. Microsoft do more or less define euro to be character 128 in
their encoding, yes, but that is irrelevant to your input dociument as
that uses & # 1 2 8 ; which refers to uniocde 128, which definitely
isn't a euro.

  btw the same forgiveness occurs when using &#8364; ....it renders into 
  euro symbol in Mozilla when char encoding is  ISO-8859-1...I must admit 
  that I find it difficult to determine the default behavior.

that's not forgiveness, that's what's supposed to happen.


This e-mail has been scanned for all viruses by Star. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:

Current Thread