[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: [xsl] Recognising Unicode in a CSV to XML transform


Subject: Re: [xsl] Recognising Unicode in a CSV to XML transform
From: "Andrew Welch" <andrew.j.welch@xxxxxxxxx>
Date: Fri, 30 May 2008 09:28:59 +0100

> I was viewing the result in the display window of the Kernow v1.6
processer.

Ahh I could probably do somethere there... In Java where an encoding
isn't specified it uses the platform default encoding, which on
Windows is the single byte "cp-1252"... so when you give it multi-byte
UTF-8 you get to see each byte of the multi-byte sequence rather than
the character that sequence represents (C is the character mapped to
the byte in UTF-8 which indicates that it's a multi-byte character,
which is why you often see it)

In the mean time you set the platform encoding using the "file.encoding"
switch:

-Dfile.encoding=UTF-8

For Kernow this means editing the run.bat to add that switch:

javaw -Dfile.encoding=UTF-8 -Xmx256m -cp kernow.jar.......

--
Andrew Welch
http://andrewjwelch.com
Kernow: http://kernowforsaxon.sf.net/


Current Thread