[oXygen-user] How to get the location of "invalid UTF-8 characters"?
Gibbons, Dave (US - Oregon)
David.Gibbons at Fiserv.com
Tue Feb 2 12:23:48 CST 2021
We have to search and replace high-bit characters sometimes. Here's what we do:
Find/Replace in Files
Enable the Regular expression option
Use this code:
[^\x00-\x7F]
If that gets garbled in posting, it's
left square bracket
caret
backslash
x00-
backslash
x7F
right square bracket
On 31.01.2021 00:30, Roger L Costello wrote:
> Hi Folks,
>
> I have a large XHTML file. Apparently, somewhere in the file is an invalid Unicode character because when I drag-and-drop the file into Oxygen I get the error message:
>
> Cannot open the specified file: Got a character encoding exception. File encoding (UTF-8) does not support all characters from the current file.
>
> The problem is this: Where is the invalid UTF-8 character?
>
> Is there a way to get Oxygen to tell me where -- what character position -- it detected the invalid Unicode character?
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.oxygenxml.com/pipermail/oxygen-user/attachments/20210202/f19a7f93/attachment.html>
More information about the oXygen-user
mailing list