Unicode find/replace in regular expressions
Posted: Thu May 25, 2006 10:23 pm
Using the "regular expression" option in oXygen 7.1 Find/Replace, I can find a character using Perl \uNNNN syntax, but I cannot use that syntax to replace. For example, to search for é (accented 'e', U+00e9) it works if I put in the "Text to find" field, but if I want to replace the character with è, U+00e8, it does not work to put in the "Replace with" field. Instead, the replacement is literal 'u00e8'.
Is this a bug, or am I misunderstanding how Perl regex syntax works?
(I realize that I can use an XML numeric character entity as a workaround, or for that matter I can just input the UTF-8 character from my keyboard, but I have some use cases where it would be more convenient to use \u syntax.)
Code: Select all
\u00e9
Code: Select all
\u00e8
Is this a bug, or am I misunderstanding how Perl regex syntax works?
(I realize that I can use an XML numeric character entity as a workaround, or for that matter I can just input the UTF-8 character from my keyboard, but I have some use cases where it would be more convenient to use \u syntax.)