[XSL-LIST Mailing List Archive Home]
[By Thread]
[By Date]
On 01.03.2013 11:40, Michael Kay wrote:
If they had been concerned about what is a word constituent and what is not in a certain language, they wouldnbt have included \w and \W in http://www.w3.org/TR/xmlschema-2/#cces
\w is locale-independently defined as:
[#x0000-#x10FFFF]-[\p{P}\p{Z}\p{C}] (all characters except the set of "punctuation", "separator" and "other" characters)
So I think \b, defined as either a \w-\W, a \W-\w boundary, the start anchor, or the end anchor, is also perfectly well-defined and works as expected in most circumstances.
Gerrit
Re: [xsl] Support for lookaround regexp in XSLT -- any time soon?
Subject: Re: [xsl] Support for lookaround regexp in XSLT -- any time soon? From: "Imsieke, Gerrit, le-tex" <gerrit.imsieke@xxxxxxxxx> Date: Fri, 01 Mar 2013 12:02:38 +0100 |
On 01.03.2013 11:40, Michael Kay wrote:
(b) they wanted to exclude anything that didn't make sense in an international Unicode context (so things like word boundaries were immediately suspect)
If they had been concerned about what is a word constituent and what is not in a certain language, they wouldnbt have included \w and \W in http://www.w3.org/TR/xmlschema-2/#cces
\w is locale-independently defined as:
[#x0000-#x10FFFF]-[\p{P}\p{Z}\p{C}] (all characters except the set of "punctuation", "separator" and "other" characters)
So I think \b, defined as either a \w-\W, a \W-\w boundary, the start anchor, or the end anchor, is also perfectly well-defined and works as expected in most circumstances.
Gerrit
Current Thread |
---|
|
<- Previous | Index | Next -> |
---|---|---|
Re: [xsl] Support for lookaround re, Michael Kay | Thread | Re: [xsl] Support for lookaround re, Michael Kay |
Re: [xsl] Support for lookaround re, Michael Kay | Date | Re: [xsl] Support for lookaround re, Michael Kay |
Month |
Keywords