


Unicode defines a codespace of 1,112,064 assignable code points within the U+0000 to U+10FFFF range.Įach of the assignable codepoints, once assigned by the Unicode consortium, maps to a specific character, emoji or control symbol. Unicode codepoints and encodingĪ Unicode code point is a number ranging from 0x0 to 0x10FFFF, usually represented using U+0000 to U+10FFFF syntax. Special care must be taken to consider the length of strings when generating message entities as the number of UTF-16 code units, even if the message itself must be encoded using UTF-8.Įxample implementations: tdlib, MadelineProto. Telegram supports styled text using message entities.Ī client that wants to send styled messages would simply have to integrate a Markdown/ HTML parser, and generate an array of message entities by iterating through the parsed tags.
