Related articles:
Unicode
Byte order mark
Unicode and e-mail
Mojibake
GB 18030
Code point
Variable-width encoding
UTF-16/UCS-2
Unicode and HTML
Character encoding
Comparison of Unicode encodings
UTF-32/UCS-4
Universal Character Set
Bush hid the facts
Extended ASCII
Code page
ASCII
Character encodings in HTML
UTF-1
ISO/IEC 8859-1
XML
ISO/IEC 8859
Shebang (Unix)
Iconv
Plan 9 from Bell Labs
Rob Pike
Ken Thompson
N
HTML
Asterisk
Key terms:
byte
encoding
unicode
ascii
bits
code points
invalid
bom
character set
utf
ascii characters
unicode standard
byte sequence
unicode character
rfc
character encoding
api
decoding
first byte
browsers
hex
iec
two bytes
disadvantages
byte order
parser
unicode code points
universal character set
cyrillic
one byte
byte stream
multilingual
byte order mark
surrogate
bell labs
four bytes
unicode text
code unit
mapping of unicode character planes
concatenated
simplistic
overlong
binary value
sorting
basic multilingual plane
prosser
hexadecimal
encoded using
Search external links cited by footnotes on Wikipedia page UTF-8:
|
|