UTF-16

The first 216 Unicode code points. The stripe of solid gray near the bottom are the surrogate halves used by UTF-16 (the white region below the stripe is the Private Use Area)

Character encoding capable of encoding all 1,112,064 valid character code points of Unicode (in fact this number of code points is dictated by the design of UTF-16).

- UTF-16
The first 216 Unicode code points. The stripe of solid gray near the bottom are the surrogate halves used by UTF-16 (the white region below the stripe is the Private Use Area)

19 related topics

Alpha

Punched tape with the word "Wikipedia" encoded in ASCII. Presence and absence of a hole represents 1 and 0, respectively; for example, "W" is encoded as "1010111".

Variable-width encoding

Type of character encoding scheme in which codes of differing lengths are used to encode a character set for representation, usually in a computer.

Type of character encoding scheme in which codes of differing lengths are used to encode a character set for representation, usually in a computer.

Punched tape with the word "Wikipedia" encoded in ASCII. Presence and absence of a hole represents 1 and 0, respectively; for example, "W" is encoded as "1010111".

The Unicode standard has two variable-width encodings: UTF-8 and UTF-16 (it also has a fixed-width encoding, UTF-32).

Punched tape with the word "Wikipedia" encoded in ASCII. Presence and absence of a hole represents 1 and 0, respectively; for example, "W" is encoded as "1010111".

CCSID

16-bit number that represents a particular encoding of a specific code page.

16-bit number that represents a particular encoding of a specific code page.

Punched tape with the word "Wikipedia" encoded in ASCII. Presence and absence of a hole represents 1 and 0, respectively; for example, "W" is encoded as "1010111".

For example, Unicode is a code page that has several encoding (so called "transformation") forms, like UTF-8, UTF-16 and UTF-32, but which may or may not actually be accompanied by a CCSID number to indicate that this encoding is being used.

Text file of The Human Side of Animals by Royal Dixon, displayed by the command in an xterm window

Plain text

Loose term for data that represent only characters of readable material but not its graphical representation nor other objects (floating-point numbers, images, etc.).

Loose term for data that represent only characters of readable material but not its graphical representation nor other objects (floating-point numbers, images, etc.).

Text file of The Human Side of Animals by Royal Dixon, displayed by the command in an xterm window

As Unicode-based encodings such as UTF-8 and UTF-16 become more common, that usage may be shrinking.

A screenshot of Manjaro running the Cinnamon desktop environment, Firefox accessing Wikipedia which uses MediaWiki, LibreOffice Writer, Vim, GNOME Calculator, VLC and Nemo file manager, all of which are open-source software.

International Components for Unicode

Open-source project of mature C/C++ and Java libraries for Unicode support, software internationalization, and software globalization.

Open-source project of mature C/C++ and Java libraries for Unicode support, software internationalization, and software globalization.

A screenshot of Manjaro running the Cinnamon desktop environment, Firefox accessing Wikipedia which uses MediaWiki, LibreOffice Writer, Vim, GNOME Calculator, VLC and Nemo file manager, all of which are open-source software.

ICU has historically used UTF-16, and still does only for Java; while for C/C++ UTF-8 is supported, including the correct handling of "illegal UTF-8".

Swift (programming language)

General-purpose, multi-paradigm, compiled programming language developed by Apple Inc. and the open-source community.

General-purpose, multi-paradigm, compiled programming language developed by Apple Inc. and the open-source community.

This included the NSString string library (using Unicode, UTF-8 in Swift 5, changed from UTF-16), the NSArray and NSDictionary collection classes, and others.

Overview of the ISO 9660 directory structure

ISO 9660

File system for optical disc media.

File system for optical disc media.

Overview of the ISO 9660 directory structure

Joliet accomplishes this by supplying an additional set of filenames that are encoded in UCS-2BE (UTF-16BE in practice since Windows 2000).

PHP

General-purpose scripting language geared toward web development.

General-purpose scripting language geared toward web development.

This is an example of PHP code for the WordPress content management system.
The elePHPant, PHP mascot
A "Hello World" application in PHP 7.4 running on its built-in development server
Example output of the phpinfo function in PHP 7.1
A broad overview of the LAMP software bundle, displayed here together with Squid
Dynamic web page: example of server-side scripting (PHP and MySQL)

In 2005, a project headed by Andrei Zmievski was initiated to bring native Unicode support throughout PHP, by embedding the International Components for Unicode (ICU) library, and representing text strings as UTF-16 internally.

Main Menu of IBM i 7.1, shown inside a TN5250 client

IBM i

Operating system developed by IBM for IBM Power Systems.

Operating system developed by IBM for IBM Power Systems.

Main Menu of IBM i 7.1, shown inside a TN5250 client
Main Menu of IBM i 7.1, shown inside a TN5250 client
IBM i5/OS logo
Original IBM i logo
Diagram showing the architectural layers of the IBM i operating system, and their relationship to hardware and user applications
IBM i during initial program load of the SLIC
Main Menu of SSP 7.5, running on top of the Advanced 36 Machine environment

IBM i uses EBCDIC as the default character encoding, but also provides support for ASCII, UCS-2 and UTF-16.

Mobile phone tower

GSM 03.38

Character encoding used in GSM networks for SMS , CB (Cell Broadcast) and USSD (Unstructured Supplementary Service Data).

Character encoding used in GSM networks for SMS , CB (Cell Broadcast) and USSD (Unstructured Supplementary Service Data).

Mobile phone tower

However, since modern programming environments do not provide encoders or decoders for UCS-2, some cell phones (e.g. iPhones) use UTF-16 instead of UCS-2.