The direct answer to your question is U+10FFFD, which is a user-defined character from the Supplementary Private Use Area B. It appears that U+10FFFE and U+10FFFF are not allowed, probably to avoid problems with UTF-32 or UTF-16 and byte-order marks, etc. Thanks Jonathan for actually answering the question.
What character is 0x9D?
The Python "latin-1" codec translates it to Unicode 0x9D, which is "Operating System Command". That makes little sense. In Unicode you get a box with [009d].
Which character is Ã?
A with tilde (majuscule: Ã, minuscule: ã) is a letter of the Latin alphabet formed by addition of the tilde diacritic over the letter A. It is used in Portuguese, Guaraní, Kashubian, Taa, Aromanian, and Vietnamese.
Why does É become Ã?
The reason lies in the UTF-8 representation. Characters below or equal to 127 ( 0x7F ) are represented with 1 byte only, and this is equivalent to the ASCII value. “é” is therefore between 127 and 2027 (233), so it will be coded on 2 bytes. Therefore its UTF-8 representation is 11000011 10101001 .
What is the meaning of à ⠀?
It is a character encoding issue. Whom ever is sending the mail is using a character set that is not appropriate. View menu (Alt+V) > character encoding and select UTF-8 or unicode should see the correct display. It is a character encoding issue.
What type of encoding is UTF-8?
Unicode
What is 0xC3?
From Wikipedia, the first byte it's called the leading byte and the second, the trailing byte. 0xC3 it's a metadata byte that means that the character it's encoded with 1 byte, 0xA9, but the unicode value for é is 0xE9.May 4, 2012
What char is 0xA0?
Name: No-Break Space (NBSP)
---------------------- --------------------------------
Character is Mirrored: No
GCGID: SP300000
HTML Entity:  
UTF-8 Encoding: 0xC2 0xA0
What character is 0x81?
ISO Name: (HOP)
---------------------- -------------------------
Combining Class: Not Reordered (0)
Character is Mirrored: No
HTML Entity:
UTF-8 Encoding: 0xC2 0x81
What character is E2 80 93?
Character –
------------------ --------
Character name EN DASH
Hex code point 2013
Decimal code point 8211
Hex UTF-8 bytes E2 80 93
What kind of character is â?
What is this character ( Â ) and how do I remove it with PHP? It is showing up in strings pulled from webpages. It shows up where there was previously an empty space in the original string on the original site. This is the actual character that is stored in my database.Sep 6, 2017
Is UTF-8 a character?
UTF-8 is a variable-width character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode (or Universal Coded Character Set) Transformation Format 8-bit.