[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: [xsl] IXSLProcessor - input in UTF-8?


Subject: Re: [xsl] IXSLProcessor - input in UTF-8?
From: andrew welch <andrew.j.welch@xxxxxxxxx>
Date: Tue, 13 Sep 2005 16:18:07 +0100

On 9/13/05, Gabriel <gabriel.klappenbach@xxxxxxxxx> wrote:
> The input XML of course defines the encoding as UTF-8, and really is
encoded
> in UTF-8, but IXSLProcessor still reads it as ISO-8859-1. So it ignores the
> declaration in the XML file.
>
> The transform is taking place server side.
>
> So are you saying the IXSProcessor is supposed to be able to read the
> encoding specified in the XML file?

Yes - If the encoding is specified in the xml prologue in the file
then the xml parser has to read the file in that encoding - if it
doesn't then its non-conforming.

You may be doing by accident what I was trying to achieve the other
day - parse the file in a particular encoding ignoring the encoding
given in the prolog.  How are you reading the XML into the transform?
If it's in a non-xml aware way then you are likely reading the bytes
in the platform encoding and then the characters as XML.

In Java this is a FileInputStream -> InputStreamReader.  The
FileInputStream doesn't know about XML and so reads the file in
whatever encoding you tell it to.

If you post some code someone might know the equivalent in .Net


Current Thread
Keywords
xml