UTF-8 and Unicode Standards :: What is UTF-8?
UTF-8 stands for Unicode
Transformation
Format-8. It is an octet (8-bit)
lossless encoding of Unicode characters.
UTF-8 encodes each Unicode character as a variable number of 1 to 4
octets, where the number of octets depends on the integer value assigned
to the Unicode character. It is an efficient encoding of Unicode
documents that use mostly US-ASCII characters because it […]