How to find special characters?

Questions about XML that are not covered by the other forums should go here.
fschmitt
Posts: 7
Joined: Mon Jul 20, 2015 3:31 pm

How to find special characters?

Post by fschmitt » Mon Jul 22, 2019 8:12 pm

One of the documents i'm editing in Oxygen 21.1 triggers the "Special characters detected" warning message. I would like to search those characters to check if they are "legitimate" document content, or if they are the result of a transformation problem (it's a converted docx, so it can contain a variety of ugly content... :roll: ).

So, is there a way (regex search) to detect all characters / unicode control codes that may trigger the "Special characters detected" message?

The only "foreign" characters i was able to identify in the document were some greek letters, but since greek doesn't require bidirectional text layout, i doubt if they are responsible for triggering the message.

Radu
Posts: 6559
Joined: Fri Jul 09, 2004 5:18 pm

Re: How to find special characters?

Post by Radu » Tue Jul 23, 2019 10:02 am

Hi,

I'm afraid we do not yet have a way in the application to signal what those complex characters are. Usually this issue is triggered when you have situations in which characters combine (the font may render one symbol for multiple characters). This will mean for example that when moving the cursor using the arrow keys special code will be triggered to properly jump over the combining characters as if they are one symbol.
Enabling the support for complex characters is usually associated to a slowdown when opening and editing the document.

There is an Oxygen GitHub project containing lots of sample plugins which you can download as a zip:

https://github.com/oxygenxml/wsaccess-j ... le-plugins

I just uploaded there a plugin folder called "determineComplexLayoutChars" which can be copied to the "OXYGEN_INSTALL_DIR\plugins" folder. After you start Oxygen the plugin will add a new contextual menu action when an XML document is opened in the Text editing mode. This new "Determine Complex Layout Chars" action should run a detection and then report all characters in the results view.

Regards,
Radu
Radu Coravu
<oXygen/> XML Editor
http://www.oxygenxml.com

fschmitt
Posts: 7
Joined: Mon Jul 20, 2015 3:31 pm

Re: How to find special characters?

Post by fschmitt » Tue Jul 23, 2019 6:31 pm

Thanks a lot @Radu - the plugin works great and i was able to solve the issue with your help :D

Post Reply