- published: 20 Sep 2013
- views: 349170
Unicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems. Developed in conjunction with the Universal Character Set standard and published in book form as The Unicode Standard, the latest version of Unicode consists of a repertoire of more than 110,000 characters covering 100 scripts, a set of code charts for visual reference, an encoding methodology and set of standard character encodings, an enumeration of character properties such as upper and lower case, a set of reference data computer files, and a number of related items, such as character properties, rules for normalization, decomposition, collation, rendering, and bidirectional display order (for the correct display of text containing both right-to-left scripts, such as Arabic and Hebrew, and left-to-right scripts). As of 2012, the most recent version is Unicode 6.1.
Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including XML, the Java programming language, the Microsoft .NET Framework, and modern operating systems.
Characters, Symbols and the Unicode Miracle - Computerphile
Introduction to UTF-8 and Unicode
unicode
Lecture 12/12: ASCII and Unicode
Decode unicode - the world's writing systems: Johannes Bergerhausen at TEDxVienna
What is Unicode?
Characters in a computer - Unicode Tutorial (UTF-32 & UTF-16)(2/3)
Travis Fischer, Esther Nam: Character encoding and Unicode in Python - PyCon 2014
CppCon 2014: James McNellis "Unicode in C++"
Unicode and character encoding
Unicode - Decoded
Unicode - Mars Needs Women
Mathias Bynens: JavaScript ♥ Unicode
Emoji and the Levitating Businessman - Computerphile