[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

RE: [xsl] Unicode character blocks in strings


Subject: RE: [xsl] Unicode character blocks in strings
From: tom tom <tomxsllist@xxxxxxxxxxx>
Date: Thu, 28 May 2009 15:00:22 +0100

Very simple, thankyou. Is there a way to return the Unicode character
block (eg CJKUnifiedIdeographs) a string / character belongs to? Or
will I have to manually build something up using the
string-to-codepoints() function?

----------------------------------------
> From: mike@xxxxxxxxxxxx
> To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
> Date: Tue, 26 May 2009 14:16:56 +0100
> Subject: RE: [xsl] Unicode character blocks in strings
>
>
> Try:
>
> 
> 
> 
> 
> 
> 
> 
> 
>
> Regards,
>
> Michael Kay
> http://www.saxonica.com/
> http://twitter.com/michaelhkay
>
>> -----Original Message-----
>> From: tom tom [mailto:tomxsllist@xxxxxxxxxxx]
>> Sent: 26 May 2009 14:08
>> To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
>> Subject: [xsl] Unicode character blocks in strings
>>
>>
>> I have a string containing a mix of Chinese and Latin
>> characters, eg *|.Z'J%R+,H1N1,y7PZL.
>> I wish to return a nodeset containing the following kind of structure:
>>
>>
>>
>> *|.Z'J%R+,
>> H1N1
>> ,y7PZL
>>
>>
>> Where H1N1 falls into the BasicLatin unicode character block
>> and the other two strings can be categorized as CJKUnifiedIdeographs.
>>
>> Can anyone suggest the cleanest way to do this using XSLT 2?
>>
>> Tom
>>
>> _________________________________________________________________
>> View your Twitter and Flickr updates from one place !V Learn more!
>> http://clk.atdmt.com/UKM/go/137984870/direct/01/
>

_________________________________________________________________
Share your photos with Windows Live Photos !V Free.
http://clk.atdmt.com/UKM/go/134665338/direct/01/


Current Thread
Keywords