Need to tokenize words that are ALL CAPS
Posted: Sat Feb 28, 2015 1:33 am
I need to tokenize words that are ALL CAPS in certain text() nodes.
I can find them using the regex expression '\b(?=[A-Z])[A-Z ]+(?=\W)' except xslt doesn't support boundary identifiers like \b (word boundary)
This regex will find the two word groups 'WE DOING' and 'BOB' in the string below.
How are WE DOING today suPER BOB.
Notice it will not find, 'How' or 'suPER', which is correct because I only want ALL CAP words.
Anybody know of another regex that will work in analyze-string that would do this without using \b?
thanks,
Scott
PS: Why the **** doesn't XSLT support standard regex identities that PHP, Java, UNIX, and every other regex engine supports?
I can find them using the regex expression '\b(?=[A-Z])[A-Z ]+(?=\W)' except xslt doesn't support boundary identifiers like \b (word boundary)
This regex will find the two word groups 'WE DOING' and 'BOB' in the string below.
How are WE DOING today suPER BOB.
Notice it will not find, 'How' or 'suPER', which is correct because I only want ALL CAP words.
Anybody know of another regex that will work in analyze-string that would do this without using \b?
thanks,
Scott
PS: Why the **** doesn't XSLT support standard regex identities that PHP, Java, UNIX, and every other regex engine supports?