[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: [xsl] text extraction


Subject: Re: [xsl] text extraction
From: David Carlisle <davidc@xxxxxxxxx>
Date: Thu, 12 Oct 2006 17:02:32 +0100

> Note that ISO-8859-1 is an order of magnitude smaller then UTF-8, so you 
> may end up with missing or replaced characters (not sure what they will 
> be replaced with though, when they don't exist) in the output stream.

XSLT will not drop or replace characters.

If the character is in a place where a character reference may be used,
the character will be serialised as a character reference (& # 1234 ;) 
If it is at a place where a character ref can not be used, (eg an
element name, or in a comment, or anywhere in the text output method,
then that's an error) The processor may avoid these errors by instead
using utf8 or iutf16 (which can encode all characters).

David


Current Thread
Keywords