Encoding of "oelig"
Posted: Thu Feb 20, 2014 1:32 pm
Hello!
I discovered that the "oelig" character "œ" has two definitions in the character map (and same for upper case):

Their shapes are slightly different, but it's not easy to recognize them at first glance.
The obvious reason for that situation is that they were included in the "Western European Character Set" before UTF-8 became the standard.
The same thing exists for other characters, e.g. "Š", "Ÿ" and probably some other ones I didn't notice.
Is there (or could there be) a way to warn users and prevent them for confusing these characters?
I discovered that the "oelig" character "œ" has two definitions in the character map (and same for upper case):

Their shapes are slightly different, but it's not easy to recognize them at first glance.
The obvious reason for that situation is that they were included in the "Western European Character Set" before UTF-8 became the standard.
The same thing exists for other characters, e.g. "Š", "Ÿ" and probably some other ones I didn't notice.
Is there (or could there be) a way to warn users and prevent them for confusing these characters?