[oXygen-user] How to type an UTF8 symbol in text as well as in author mode

Radu Coravu radu_coravu at sync.ro
Mon Feb 19 23:51:16 CST 2018


My presentation from XML Prague this year should also cover this aspect 
about conversions between bytes on disk and characters.

https://www.youtube.com/watch?v=JDOEMQD32Ss

Regards,
Radu

Radu Coravu
<oXygen/>  XML Editor, Schema Editor and XSLT Editor/Debugger
http://www.oxygenxml.com

On 2/19/2018 5:28 PM, Eliot Kimber wrote:
> The (hex.) column is the UTF-8 encoding of the character, that is, the
> sequence of bytes.
>
>
>
> The actual Unicode character number is the value in the first column,
> e.g., \u2190.
>
>
>
> So you should be able to type 2190 and get the character you want.
>
>
>
> Unicode is the character set and the character numbers (code points) are
> independent of how the characters are encoded.
>
>
>
> The encoding is how the characters are translated to bytes when written
> as a byte sequence.
>
>
>
> The Unicode standard defines a number of encodings, including UTF-8 and
> UTF-16.
>
>
>
> So there are not “UTF-8 characters”, only UTF-8 encodings of Unicode
> characters.
>
>
>
> The UTF-8 encoding was designed so that it is identical to ASCII for the
> first 127 or 255 characters (depending on which version of ASCII you’re
> looking at). But after character 255 it takes at least 3 bytes to encode
> a character.
>
>
>
> Cheers,
>
>
>
> E.
>
> --
>
> Eliot Kimber
>
> http://contrext.com
>
>
>
>
>
> *From: *oXygen-user <oxygen-user-bounces at oxygenxml.com> on behalf of
> Bernhard Kleine <bernhard.kleine at gmx.net>
> *Date: *Monday, February 19, 2018 at 9:17 AM
> *To: *<oxygen-user at oxygenxml.com>
> *Subject: *Re: [oXygen-user] How to type an UTF8 symbol in text as well
> as in author mode
>
>
>
> The UTF8 table at
> http://www.utf8-zeichentabelle.de/unicode-utf8-table.pl?start=8592shows
> this first four lines.
>
> *Unicode
> Codepos.*
>
> 	
>
> *Zeichen*
>
> 	
>
> *UTF-8
> (hex.)*
>
> 	
>
> *Name*
>
> U+2190
>
> 	
>
>>
> 	
>
> e2 86 90
>
> 	
>
> LEFTWARDS ARROW
>
> U+2191
>
> 	
>
>>
> 	
>
> e2 86 91
>
> 	
>
> UPWARDS ARROW
>
> U+2192
>
> 	
>
>>
> 	
>
> e2 86 92
>
> 	
>
> RIGHTWARDS ARROW
>
> U+2193
>
> 	
>
>>
> 	
>
> e2 86 93
>
> 	
>
> DOWNWARDS ARROW
>
> When I tried to change a utf8 hex value in a simple doc, using
> Ctrl-Shift-X, I get:
>
> cid:part1.482BC927.158A9BBD at gmx.net
>
> (not a valid hexadecimal sequence to change)
>
> I also tried the 0x1F926 from Bens example below. The same error. What
> do I wrong?
>
> These arrows would be a good example since they will be used.
>
> Regards
>
> Bernhard
>
>
>
> Am 19.02.2018 um 10:03 schrieb Oxygen XML Editor Support (Radu Coravu) :
>
>     Hi,
>
>     Thanks for the reminder Ben.
>     Indeed I forgot about this feature in Oxygen:
>
>     https://www.oxygenxml.com/doc/versions/19.1/ug-editor/topics/text-mode-actions.html#text-mode-actions__convert-hex-sequence
>
>     which basically allows you to type away the hex digits in Oxygen and
>     then invoke the special "Convert Hexadecimal Sequence to Character"
>     action.
>
>     Regards,
>     Radu
>
>     Radu Coravu
>     <oXygen/>  XML Editor, Schema Editor and XSLT Editor/Debugger
>     http://www.oxygenxml.com
>
>     On 2/19/2018 10:56 AM, Ben McGinnes wrote:
>
>         On Mon, Feb 19, 2018 at 09:33:28AM +0200, Oxygen XML Editor
>         Support (Radu Coravu)  wrote:
>
>             Hi Bernhard,
>
>             It seems that for "nbsp" which has the decimal equivalent
>             "160" you would
>             need to type "ALT" and then "0160", that leading "0" seems
>             to be important.
>             The same probably for all other characters, type their
>             decimal equivalent
>             but it needs to be four typed figures.
>
>
>         Oh, how quickly we forget certain things.  :)
>
>         oXygen has had the ability to enter UTF-8 characters in the first
>         plane by their four character hexadecimal code point value since
>         version 17.1.  I can't recall what the default hotkey is for
>         invoking
>         it because I changed mine (back) to F8 as soon as I installed that
>         version.  I believe I've still got the plugin you guys provided me
>         during my trial period for 17.0.
>
>         Anyway, if Bernhard is happy with using hex instead of int,
>         that's the
>         solution instead of the Windows alt sequences (or the Mac
>         alt/option
>         sequences either, for that matter).
>
>         ....
>
>
>
>         bash-4.4$ unum.pl 0x1f926
>            Octal  Decimal      Hex        HTML    Character   Unicode
>          0374446   129318  0x1F926   🤦    "🤦"         FACE PALM
>         bash-4.4$
>
>         Obviously some of us can see that character properly and some
>         can't,
>         but you all know which it is.
>
>
>         Regards,
>         Ben
>
>
>
>         _______________________________________________
>         oXygen-user mailing list
>         oXygen-user at oxygenxml.com<mailto:oXygen-user at oxygenxml.com>
>         https://www.oxygenxml.com/mailman/listinfo/oxygen-user
>
>     _______________________________________________
>     oXygen-user mailing list
>     oXygen-user at oxygenxml.com<mailto:oXygen-user at oxygenxml.com>
>     https://www.oxygenxml.com/mailman/listinfo/oxygen-user
>
>
>
> --
>
> spitzhalde9
>
> D-79853 lenzkirch
>
> bernhard.kleine at gmx.net<mailto:bernhard.kleine at gmx.net>
>
> www.b-kleine.com<http://www.b-kleine.com>,
> www.urseetal.net<http://www.urseetal.net>
>
> -
>
> thunderbird mit enigmail
>
> GPG schlüssel: D5257409
>
> fingerprint:
>
> 08 B7 F8 70 22 7A FC C1 15 49 CA A6 C7 6F A0 2E D5 25 74 09
>
> _______________________________________________ oXygen-user mailing list
> oXygen-user at oxygenxml.com
> https://www.oxygenxml.com/mailman/listinfo/oxygen-user
>
>
>
> _______________________________________________
> oXygen-user mailing list
> oXygen-user at oxygenxml.com
> https://www.oxygenxml.com/mailman/listinfo/oxygen-user
>




More information about the oXygen-user mailing list