ISO/IEC 8859-11

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

ISO/IEC 8859-11:2001, Information technology — 8-bit single-byte coded graphic character sets — Part 11: Latin/Thai alphabet, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 2001. It is informally referred to as Latin/Thai. It is nearly identical to the national Thai standard TIS-620 (1990). The sole difference is that ISO/IEC 8859-11 allocates non-breaking space to code 0xA0, while TIS-620 leaves it undefined. (In practice, this small distinction is usually ignored.)

ISO-8859-11 is not a main registered IANA charset name despite following the normal pattern for IANA charsets based on the ISO 8859 series. However, it is defined as an alias[1] of the close equivalent TIS-620 (which lacks the non-breaking space), and which can without problems be used for ISO/IEC 8859-11, since the no-break space has a code which was unallocated in TIS-620. Microsoft has assigned code page 28601 a.k.a. Windows-28601 to ISO-8859-11 in Windows.[2] A draft had the Thai letters in different spots.[3]

As with all varieties of ISO/IEC 8859, the lower 128 codes are equivalent to ASCII. The additional characters, apart from no-break space, are found in Unicode in the same order, only shifted from 0xA1 to U+0E01 and so forth.

The Microsoft Windows code page 874 as well as the code page used in the Thai version of the Apple Macintosh, MacThai, are extensions of TIS-620 — incompatible with each other, however.

Character set[edit]

  Letter   Number   Punctuation   Symbol   Other  Undefined

ISO/IEC 8859-11
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_
0
1_
16
2_
32
SP
0020
!
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3_
48
0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4_
64
@
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5_
80
P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
\
005C
]
005D
^
005E
_
005F
6_
96
`
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7_
112
p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D
~
007E
8_
128
9_
144
A_
160
NBSP
00A0

0E01

0E02

0E03

0E04

0E05

0E06

0E07

0E08

0E09

0E0A

0E0B

0E0C

0E0D

0E0E

0E0F
B_
176

0E10

0E11

0E12

0E13

0E14

0E15

0E16

0E17

0E18

0E19

0E1A

0E1B

0E1C

0E1D

0E1E

0E1F
C_
192

0E20

0E21

0E22

0E23

0E24

0E25

0E26

0E27

0E28

0E29

0E2A

0E2B

0E2C

0E2D

0E2E

0E2F
D_
208

0E30
◌ั
0E31

0E32

0E33
◌ิ
0E34
◌ี
0E35
◌ึ
0E36
◌ื
0E37
◌ุ
0E38
◌ู
0E39
◌ฺ
0E3A
฿
0E3F
E_
224

0E40

0E41

0E42

0E43

0E44

0E45

0E46
◌็
0E47
◌่
0E48
◌้
0E49
◌๊
0E4A
◌๋
0E4B
◌์
0E4C
◌ํ
0E4D
◌๎
0E4E

0E4F
F_
240

0E50

0E51

0E52

0E53

0E54

0E55

0E56

0E57

0E58

0E59

0E5A

0E5B

Code values D1, D4-DA, E7-EE are for combining characters.

Code page 874 (IBM)[edit]

Code page 874, which is also known as CP874 and IBM874, differs from ISO/IEC 8859-11 only eight symbols shown boxed in the following table:

Code page 874 (difference only)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F[4]
D_
208

0E30
◌ั
0E31

0E32

0E33
◌ิ
0E34
◌ี
0E35
◌ึ
0E36
◌ื
0E37
◌ุ
0E38
◌ู
0E39
◌ฺ
0E3A
◌้︀
0E49 FE00
◌๊︀
0E4A FE00
◌๋︀
0E4B FE00
◌์︀
0E4C FE00
฿
0E3F
E_
224

0E40

0E41

0E42

0E43

0E44

0E45

0E46
◌็
0E47
◌่
0E48
◌้
0E49
◌๊
0E4A
◌๋
0E4B
◌์
0E4C
◌ํ
0E4D
◌๎
0E4E

0E4F
F_
240

0E50

0E51

0E52

0E53

0E54

0E55

0E56

0E57

0E58

0E59

0E5A

0E5B
¢
00A2
¬
00AC
¦
00A6
RSP
00A0

Code page 1161[edit]

Code page 1161, which is also known as CP1161 and IBM1161, is a variant of Code page 874. The only difference is the euro sign (€) in position DEhex (222).[5]

Code page 1162 (IBM) / 874 (Microsoft)[edit]

Code page 1162, which is also known as windows-874, CP1162, IBM1162, MS874, x-windows-874, and x-IBM874, used by Microsoft Windows differs from ISO/IEC 8859-11 by only nine symbols as shown in the following table:

Code page 1162 (Code page 874 in Microsoft Windows) (difference only)[6][7]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
8_
128

20AC
       
2026
                   
9_
144
 
2018

2019

201C

201D

2022

2013

2014
               

See also[edit]

References[edit]

  1. ^ "IANA Character Sets".
  2. ^ "js-codepage, Getting codepages".
  3. ^ Everson, Michael. "Proposed ISO 8859-11".
  4. ^ (PDF) ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP00874.pdf. Missing or empty |title= (help)
  5. ^ (PDF) ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP01161.pdf. Missing or empty |title= (help)
  6. ^ (PDF) ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP01162.pdf. Missing or empty |title= (help)
  7. ^ Windows-874, MSDN.

External links[edit]