Knowledge

Code point

Source 📝

2048: 2037: 413:
By the early 1980s, the software industry was starting to recognise the need for a solution to the problems involved with using multiple character encoding standards. Some particularly innovative work was begun at Xerox. The Xerox Star workstation used a multi-byte encoding that allowed it to support
195:
In Unicode, code points are part of Unicode's solution to a difficult conundrum faced by character encoding developers in the 1980s. If they added more bits per character to accommodate larger character sets, that design decision would also constitute an unacceptable waste of then-scarce computing
375:
On a computer, abstract characters are encoded internally as numbers. To create a complete character encoding, it is necessary to define the list of all characters to be encoded and to establish systematic rules for how the numbers represent the characters. The range of integers used to code the
52:
Code points are used in a multitude of formal information processing and telecommunication standards. For example ITU-T Recommendation T.35 contains a set of country codes for telecommunications equipment (originally fax machines) which allow equipment to indicate its country of manufacture or
200:
users (who constituted the vast majority of computer users at the time), since those extra bits would always be zeroed out for such users. The code point avoids this problem by breaking the old idea of a direct one-to-one correspondence between characters and particular sequences of bits.
376:
abstract characters is called the codespace. A particular integer in this set is called a code point. When an abstract character is mapped or assigned to a particular code point in the codespace, it is then referred to as an encodedcharacter.
42:, where the position has been assigned a meaning. The table may be one dimensional (a column), two dimensional (like cells in a spreadsheet), three dimensional (sheets in a workbook), etc... in any number of dimensions. 176:
character is not a graphical glyph but a unit of textual data. However, code points may also be left reserved for future assignment (most of the Unicode code space is unassigned), or given other designated functions.
126:(the basic multilingual plane, and 16 supplementary planes), each with 65,536 (= 2) code points. Thus the total size of the Unicode code space is 17 × 65,536 = 1,114,112. 180:
The distinction between a code point and the corresponding abstract character is not pronounced in Unicode but is evident for many other encoding schemes, where numerous
45:
Technically, a code point is a unique position in a quantized n-dimensional space, where the position has been assigned a semantic meaning. The table has
940: 595: 427: 882: 389: 2005: 495: 1990: 348: 283: 2010: 800: 785: 192:
The concept of a code point dates to the earliest standards for digital information processing and digital telecommunications.
245: 1029: 708: 1024: 558: 857: 488: 77:, or formatting. The set of all possible code points within a given encoding/character set make up that encoding's 862: 777: 678: 165: 46: 1339: 1058: 1159: 973: 957: 920: 767: 703: 523: 1898: 1768: 1083: 693: 53:
operation. In T.35, Argentina is represented by the code point 0x07, Canada by 0x20, Gambia by 0x41, etc.
2077: 2017: 1134: 945: 745: 481: 225: 1697: 1184: 1019: 1014: 663: 568: 1702: 1078: 397: 1622: 899: 608: 713: 1499: 2052: 1953: 1873: 1309: 1214: 563: 551: 161: 160:
encoding, different code points are encoded as sequences from one to four bytes long, forming a
1642: 1389: 1244: 1179: 894: 762: 668: 627: 220: 1838: 1592: 1587: 1464: 877: 642: 270:"T.35 : Procedure for the allocation of ITU-T defined codes for non-standard facilities" 215: 169: 66: 1544: 1823: 1737: 1657: 1379: 1344: 1224: 246:
https://www.etsi.org/deliver/etsi_ts/101700_101799/101773/01.02.01_60/ts_101773v010201p.pdf
73:—usually a letter, digit, punctuation mark, or whitespace—but sometimes represent symbols, 359: 294: 8: 1868: 1778: 1722: 1088: 1068: 867: 852: 757: 683: 673: 210: 1863: 310:
Format: Invisible but affects neighboring characters; includes line/paragraph separators
1969: 1888: 1858: 1828: 1808: 1444: 1424: 1174: 740: 617: 613: 518: 432: 62: 1938: 1848: 1833: 1672: 1637: 1449: 1289: 1114: 925: 637: 578: 149: 74: 1364: 2041: 2000: 1893: 1843: 1732: 1692: 1617: 1607: 1597: 1469: 1454: 1359: 1334: 1209: 1189: 1049: 935: 688: 647: 437: 39: 1995: 1948: 1933: 1793: 1758: 1753: 1687: 1677: 1627: 1494: 1484: 1479: 1429: 1399: 1269: 1259: 1219: 1119: 1034: 1009: 698: 603: 573: 269: 123: 1204: 257: 1903: 1883: 1803: 1783: 1773: 1682: 1529: 1459: 1434: 1414: 1369: 1354: 1329: 1279: 1254: 1199: 1154: 99: 469:
Codepoints.net, a site dedicated to all things characters, letters and Unicode
2071: 1923: 1908: 1788: 1727: 1612: 1554: 1549: 1514: 1489: 1439: 1324: 1314: 1164: 1109: 952: 750: 546: 153: 1918: 1652: 1647: 1602: 1504: 1419: 1409: 1319: 1294: 1234: 1149: 1129: 1124: 1104: 983: 930: 197: 528: 1974: 1813: 1798: 1712: 1582: 1559: 1524: 1394: 1374: 1349: 1169: 632: 622: 323: 90: 904: 1853: 1304: 1194: 830: 20: 1717: 1632: 1564: 1299: 1073: 993: 988: 889: 872: 181: 136: 1943: 1913: 1763: 1748: 1743: 1534: 1284: 1264: 1229: 1139: 978: 795: 70: 1384: 65:, where a code point is a numerical value that maps to a specific 1667: 1662: 1539: 1474: 1404: 1144: 504: 111: 16:
Numerical value representing a character in a coded character set
49:(whole) and positive positions (1, 2, 3, 4, but not fractions). 1928: 1707: 1519: 1509: 1239: 825: 820: 790: 414:
a single character set with potentially millions of characters.
69:. In character encoding code points usually represent a single 2022: 1878: 1818: 1274: 1249: 815: 810: 805: 157: 141: 85: 428:"Unicode Technical Standard #10 UNICODE COLLATION ALGORITHM" 168:
for details. Code points are normally assigned to abstract
145: 473: 468: 349:"The Unicode® Standard Version 11.0 – Core Specification" 284:"The Unicode® Standard Version 11.0 – Core Specification" 134:
For Unicode, the particular sequence of bits is called a
425: 122:. The Unicode code space is divided into seventeen 1048: 2069: 114:comprises 1,114,112 code points in the range 0 489: 258:https://datatracker.ietf.org/doc/html/rfc4190 84:For example, the character encoding scheme 1991:Cultural, political, and religious symbols 496: 482: 426:Mark Davis; Ken Whistler (23 March 2001). 358:. 30 June 2018. p. 22. Archived from 293:. 30 June 2018. p. 23. Archived from 387: 144:encoding, any code point is encoded as 4- 102:comprises 256 code points in the range 0 88:comprises 128 code points in the range 0 56: 524:ISO/IEC 10646 (Universal Character Set) 2070: 1047: 477: 394:NRSI: Computers & Writing Systems 1025:International Components for Unicode 974:Common Locale Data Repository (CLDR) 321: 184:may exist for a single code space. 13: 2006:Mathematical operators and symbols 14: 2089: 462: 388:Constable, Peter (13 June 2001). 238: 61:Code points are commonly used in 2047: 2046: 2036: 2035: 2018:Phonetic symbols (including IPA) 166:comparison of Unicode encodings 419: 381: 341: 315: 276: 262: 250: 38:is a particular position in a 1: 958:International Ideographs Core 768:International Ideographs Core 709:Alias names and abbreviations 244:ETSI TS 101 773 (section 4), 231: 129: 1180:CJK Unified Ideographs (Han) 1030:People involved with Unicode 390:"Understanding Unicode™ - I" 7: 503: 324:"Glossary of Unicode Terms" 226:Unicode collation algorithm 204: 10: 2094: 1020:Ideographic Research Group 1015:ConScript Unicode Registry 187: 18: 2031: 1983: 1962: 1573: 1097: 1057: 1043: 1002: 966: 913: 900:Regional indicator symbol 843: 776: 733: 726: 656: 609:Combining grapheme joiner 594: 587: 537: 511: 2053:Category: Unicode blocks 858:Compatibility characters 19:Not to be confused with 778:Comparison of encodings 704:Halfwidth and fullwidth 559:Universal Character Set 453:6.2 Large Weight Values 162:self-synchronizing code 1703:Inscriptional Parthian 1390:Nyiakeng Puachue Hmong 1052:and symbols in Unicode 669:CJK Unified Ideographs 221:Text-based (computing) 1839:Old Persian cuneiform 1698:Inscriptional Pahlavi 1593:Ancient North Arabian 1588:Anatolian hieroglyphs 878:Precomposed character 714:Whitespace characters 643:Zero-width non-joiner 256:RFC4190 (section 1), 216:Replacement character 57:In character encoding 1658:Egyptian hieroglyphs 863:Duplicate characters 679:Duplicate characters 403:on 16 September 2010 365:on 19 September 2018 300:on 19 September 2018 1723:Khitan small script 1160:Canadian Aboriginal 895:Variation sequences 853:Combining character 763:Variation sequences 674:Combining character 211:Combining character 2078:Character encoding 1963:Notational scripts 1914:Tagalog (Baybayin) 1623:Caucasian Albanian 946:numeric references 921:Domain names (IDN) 741:Bidirectional text 618:Right-to-left mark 614:Left-to-right mark 569:Character property 519:Unicode Consortium 433:Unicode Consortium 356:Unicode Consortium 291:Unicode Consortium 75:control characters 63:character encoding 2065: 2064: 2061: 2060: 2042:Category: Unicode 1079:Punctuation marks 1061:inherited scripts 967:Related standards 941:entity references 839: 838: 722: 721: 638:Zero-width joiner 443:on 25 August 2001 2085: 2050: 2049: 2039: 2038: 2001:Control Pictures 1954:Zanabazar Square 1693:Imperial Aramaic 1576:historic scripts 1045: 1044: 905:Emoji skin color 731: 730: 648:Zero-width space 592: 591: 579:Private Use Area 564:Character charts 498: 491: 484: 475: 474: 456: 455: 450: 448: 442: 436:. Archived from 423: 417: 416: 410: 408: 402: 396:. Archived from 385: 379: 378: 372: 370: 364: 353: 345: 339: 338: 336: 334: 319: 313: 312: 307: 305: 299: 288: 280: 274: 273: 266: 260: 254: 248: 242: 2093: 2092: 2088: 2087: 2086: 2084: 2083: 2082: 2068: 2067: 2066: 2057: 2027: 2011:List by subject 1984:Symbols, emojis 1979: 1958: 1874:Psalter Pahlavi 1575: 1569: 1430:Pracalit (Newa) 1245:Hanifi Rohingya 1093: 1069:Combining marks 1060: 1053: 1039: 1035:Han unification 998: 962: 909: 845: 835: 772: 718: 652: 596:Special purpose 583: 533: 507: 502: 465: 460: 459: 446: 444: 440: 424: 420: 406: 404: 400: 386: 382: 368: 366: 362: 351: 347: 346: 342: 332: 330: 320: 316: 303: 301: 297: 286: 282: 281: 277: 268: 267: 263: 255: 251: 243: 239: 234: 207: 190: 156:, while in the 132: 121: 117: 109: 105: 97: 93: 59: 24: 17: 12: 11: 5: 2091: 2081: 2080: 2063: 2062: 2059: 2058: 2056: 2055: 2044: 2032: 2029: 2028: 2026: 2025: 2020: 2015: 2014: 2013: 2003: 1998: 1993: 1987: 1985: 1981: 1980: 1978: 1977: 1972: 1966: 1964: 1960: 1959: 1957: 1956: 1951: 1946: 1941: 1936: 1931: 1926: 1921: 1916: 1911: 1906: 1901: 1896: 1891: 1886: 1881: 1876: 1871: 1866: 1861: 1856: 1851: 1846: 1841: 1836: 1831: 1826: 1821: 1816: 1811: 1806: 1801: 1796: 1791: 1786: 1781: 1776: 1771: 1766: 1761: 1756: 1751: 1746: 1741: 1735: 1730: 1725: 1720: 1715: 1710: 1705: 1700: 1695: 1690: 1685: 1680: 1675: 1670: 1665: 1660: 1655: 1650: 1645: 1640: 1635: 1630: 1625: 1620: 1615: 1610: 1605: 1600: 1595: 1590: 1585: 1579: 1577: 1571: 1570: 1568: 1567: 1562: 1557: 1552: 1547: 1542: 1537: 1532: 1527: 1522: 1517: 1512: 1507: 1502: 1497: 1492: 1487: 1482: 1477: 1472: 1467: 1465:Sorang Sompeng 1462: 1457: 1452: 1447: 1442: 1437: 1432: 1427: 1422: 1417: 1412: 1407: 1402: 1397: 1392: 1387: 1382: 1377: 1372: 1367: 1362: 1357: 1355:Miao (Pollard) 1352: 1347: 1342: 1337: 1332: 1327: 1322: 1317: 1312: 1307: 1302: 1297: 1292: 1287: 1282: 1277: 1272: 1267: 1262: 1257: 1252: 1247: 1242: 1237: 1232: 1227: 1222: 1217: 1212: 1207: 1202: 1197: 1192: 1187: 1182: 1177: 1172: 1167: 1162: 1157: 1152: 1147: 1142: 1137: 1132: 1127: 1122: 1117: 1112: 1107: 1101: 1099: 1098:Modern scripts 1095: 1094: 1092: 1091: 1086: 1081: 1076: 1071: 1065: 1063: 1055: 1054: 1041: 1040: 1038: 1037: 1032: 1027: 1022: 1017: 1012: 1006: 1004: 1003:Related topics 1000: 999: 997: 996: 991: 986: 981: 976: 970: 968: 964: 963: 961: 960: 955: 950: 949: 948: 943: 933: 928: 923: 917: 915: 911: 910: 908: 907: 902: 897: 892: 887: 886: 885: 875: 870: 865: 860: 855: 849: 847: 841: 840: 837: 836: 834: 833: 828: 823: 818: 813: 808: 803: 798: 793: 788: 782: 780: 774: 773: 771: 770: 765: 760: 755: 754: 753: 743: 737: 735: 728: 724: 723: 720: 719: 717: 716: 711: 706: 701: 696: 691: 686: 681: 676: 671: 666: 660: 658: 654: 653: 651: 650: 645: 640: 635: 630: 625: 620: 611: 606: 600: 598: 589: 585: 584: 582: 581: 576: 571: 566: 561: 556: 555: 554: 543: 541: 535: 534: 532: 531: 526: 521: 515: 513: 509: 508: 501: 500: 493: 486: 478: 472: 471: 464: 463:External links 461: 458: 457: 418: 380: 340: 314: 275: 261: 249: 236: 235: 233: 230: 229: 228: 223: 218: 213: 206: 203: 196:resources for 189: 186: 154:binary numbers 131: 128: 119: 115: 107: 103: 100:Extended ASCII 95: 89: 58: 55: 15: 9: 6: 4: 3: 2: 2090: 2079: 2076: 2075: 2073: 2054: 2045: 2043: 2034: 2033: 2030: 2024: 2021: 2019: 2016: 2012: 2009: 2008: 2007: 2004: 2002: 1999: 1997: 1994: 1992: 1989: 1988: 1986: 1982: 1976: 1973: 1971: 1968: 1967: 1965: 1961: 1955: 1952: 1950: 1947: 1945: 1942: 1940: 1937: 1935: 1934:Tulu Tigalari 1932: 1930: 1927: 1925: 1922: 1920: 1917: 1915: 1912: 1910: 1909:Sylheti Nagri 1907: 1905: 1902: 1900: 1899:South Arabian 1897: 1895: 1892: 1890: 1887: 1885: 1882: 1880: 1877: 1875: 1872: 1870: 1867: 1865: 1862: 1860: 1857: 1855: 1852: 1850: 1847: 1845: 1842: 1840: 1837: 1835: 1832: 1830: 1827: 1825: 1824:Old Hungarian 1822: 1820: 1817: 1815: 1812: 1810: 1807: 1805: 1802: 1800: 1797: 1795: 1792: 1790: 1787: 1785: 1782: 1780: 1777: 1775: 1772: 1770: 1767: 1765: 1762: 1760: 1757: 1755: 1752: 1750: 1747: 1745: 1742: 1739: 1736: 1734: 1731: 1729: 1726: 1724: 1721: 1719: 1716: 1714: 1711: 1709: 1706: 1704: 1701: 1699: 1696: 1694: 1691: 1689: 1686: 1684: 1681: 1679: 1676: 1674: 1671: 1669: 1666: 1664: 1661: 1659: 1656: 1654: 1651: 1649: 1646: 1644: 1641: 1639: 1636: 1634: 1631: 1629: 1626: 1624: 1621: 1619: 1616: 1614: 1611: 1609: 1606: 1604: 1601: 1599: 1596: 1594: 1591: 1589: 1586: 1584: 1581: 1580: 1578: 1572: 1566: 1563: 1561: 1558: 1556: 1553: 1551: 1548: 1546: 1543: 1541: 1538: 1536: 1533: 1531: 1528: 1526: 1523: 1521: 1518: 1516: 1513: 1511: 1508: 1506: 1503: 1501: 1498: 1496: 1493: 1491: 1488: 1486: 1483: 1481: 1478: 1476: 1473: 1471: 1468: 1466: 1463: 1461: 1458: 1456: 1453: 1451: 1448: 1446: 1443: 1441: 1438: 1436: 1433: 1431: 1428: 1426: 1423: 1421: 1418: 1416: 1413: 1411: 1408: 1406: 1403: 1401: 1398: 1396: 1393: 1391: 1388: 1386: 1383: 1381: 1378: 1376: 1373: 1371: 1368: 1366: 1363: 1361: 1358: 1356: 1353: 1351: 1348: 1346: 1345:Mende Kikakui 1343: 1341: 1340:Masaram Gondi 1338: 1336: 1333: 1331: 1328: 1326: 1325:Lisu (Fraser) 1323: 1321: 1318: 1316: 1313: 1311: 1308: 1306: 1303: 1301: 1298: 1296: 1293: 1291: 1288: 1286: 1283: 1281: 1278: 1276: 1273: 1271: 1268: 1266: 1263: 1261: 1258: 1256: 1253: 1251: 1248: 1246: 1243: 1241: 1238: 1236: 1233: 1231: 1228: 1226: 1225:Gunjala Gondi 1223: 1221: 1218: 1216: 1213: 1211: 1208: 1206: 1203: 1201: 1198: 1196: 1193: 1191: 1188: 1186: 1183: 1181: 1178: 1176: 1173: 1171: 1168: 1166: 1163: 1161: 1158: 1156: 1153: 1151: 1148: 1146: 1143: 1141: 1138: 1136: 1133: 1131: 1128: 1126: 1123: 1121: 1118: 1116: 1113: 1111: 1108: 1106: 1103: 1102: 1100: 1096: 1090: 1087: 1085: 1082: 1080: 1077: 1075: 1072: 1070: 1067: 1066: 1064: 1062: 1056: 1051: 1046: 1042: 1036: 1033: 1031: 1028: 1026: 1023: 1021: 1018: 1016: 1013: 1011: 1008: 1007: 1005: 1001: 995: 992: 990: 987: 985: 982: 980: 977: 975: 972: 971: 969: 965: 959: 956: 954: 951: 947: 944: 942: 939: 938: 937: 934: 932: 929: 927: 924: 922: 919: 918: 916: 912: 906: 903: 901: 898: 896: 893: 891: 888: 884: 881: 880: 879: 876: 874: 871: 869: 866: 864: 861: 859: 856: 854: 851: 850: 848: 842: 832: 829: 827: 824: 822: 819: 817: 814: 812: 809: 807: 804: 802: 799: 797: 794: 792: 789: 787: 784: 783: 781: 779: 775: 769: 766: 764: 761: 759: 756: 752: 751:ISO/IEC 14651 749: 748: 747: 744: 742: 739: 738: 736: 732: 729: 725: 715: 712: 710: 707: 705: 702: 700: 697: 695: 692: 690: 687: 685: 682: 680: 677: 675: 672: 670: 667: 665: 662: 661: 659: 655: 649: 646: 644: 641: 639: 636: 634: 631: 629: 626: 624: 621: 619: 615: 612: 610: 607: 605: 602: 601: 599: 597: 593: 590: 586: 580: 577: 575: 572: 570: 567: 565: 562: 560: 557: 553: 550: 549: 548: 545: 544: 542: 540: 536: 530: 527: 525: 522: 520: 517: 516: 514: 510: 506: 499: 494: 492: 487: 485: 480: 479: 476: 470: 467: 466: 454: 439: 435: 434: 429: 422: 415: 399: 395: 391: 384: 377: 361: 357: 350: 344: 329: 325: 318: 311: 296: 292: 285: 279: 271: 265: 259: 253: 247: 241: 237: 227: 224: 222: 219: 217: 214: 212: 209: 208: 202: 199: 193: 185: 183: 178: 175: 171: 167: 163: 159: 155: 151: 147: 143: 139: 138: 127: 125: 113: 101: 92: 87: 82: 80: 76: 72: 68: 64: 54: 50: 48: 43: 41: 37: 36:code position 33: 29: 22: 1789:Meetei Mayek 1740:(Chorasmian) 1643:Cypro-Minoan 1420:Pahawh Hmong 1235:Gurung Khema 984:ISO/IEC 8859 826:UTF-32/UCS-4 821:UTF-16/UCS-2 628:Variant form 538: 452: 445:. Retrieved 438:the original 431: 421: 412: 405:. Retrieved 398:the original 393: 383: 374: 367:. Retrieved 360:the original 355: 343: 331:. Retrieved 327: 317: 309: 302:. Retrieved 295:the original 290: 278: 264: 252: 240: 198:Latin script 194: 191: 179: 173: 135: 133: 83: 78: 60: 51: 44: 35: 31: 27: 25: 1975:SignWriting 1844:Old Sogdian 1814:Nandinagari 1738:Khwarezmian 1648:Dives Akuru 1574:Ancient and 1560:Warang Citi 1425:Pau Cin Hau 1380:New Tai Lue 1375:Nag Mundari 1350:Medefaidrin 1059:Common and 868:Equivalence 846:code points 844:On pairs of 758:Equivalence 633:Word joiner 623:Soft hyphen 539:Code points 447:25 December 407:25 December 369:25 December 328:unicode.org 304:25 December 1869:Phoenician 1854:Old Uyghur 1849:Old Turkic 1834:Old Permic 1829:Old Italic 1779:Manichaean 1673:Glagolitic 1450:Saurashtra 1195:Devanagari 1074:Diacritics 831:UTF-EBCDIC 734:Algorithms 727:Processing 664:Characters 588:Characters 232:References 182:code pages 170:characters 140:– for the 130:In Unicode 28:code point 21:point code 1864:ʼPhags-pa 1859:Palmyrene 1809:Nabataean 1733:Khudawadi 1718:Kharosthi 1633:Cuneiform 1608:Bhaiksuki 1603:Bassa Vah 1470:Sundanese 1445:Samaritan 1360:Mongolian 1335:Malayalam 1300:Kirat Rai 1010:Anomalies 994:ISO 15924 989:DIN 91379 890:Z-variant 873:Homoglyph 746:Collation 322:Unicode. 137:code unit 118:to 10FFFF 79:codespace 67:character 32:codepoint 2072:Category 1996:Currency 1970:Duployan 1944:Vithkuqi 1939:Ugaritic 1794:Meroitic 1764:Mahajani 1749:Linear B 1744:Linear A 1535:Tifinagh 1500:Tai Viet 1495:Tai Tham 1485:Tagbanwa 1400:Ol Chiki 1290:Kayah Li 1285:Katakana 1270:Javanese 1265:Hiragana 1255:Hanunuoo 1230:Gurmukhi 1220:Gujarati 1210:Georgian 1185:Cyrillic 1175:Cherokee 1140:Bopomofo 1120:Balinese 1115:Armenian 979:GB 18030 796:Punycode 684:Numerals 616: / 529:Versions 333:20 March 205:See also 174:abstract 71:grapheme 47:discrete 1904:Soyombo 1894:Sogdian 1889:Siddham 1884:Sharada 1804:Multani 1784:Marchen 1774:Mandaic 1769:Makasar 1683:Grantha 1668:Elymaic 1663:Elbasan 1638:Cypriot 1598:Avestan 1540:Tirhuta 1530:Tibetan 1475:Sunuwar 1460:Sinhala 1455:Shavian 1435:Ranjana 1415:Osmanya 1405:Ol Onal 1330:Lontara 1280:Kannada 1190:Deseret 1155:Burmese 1145:Braille 1135:Bengali 1089:Numbers 1050:Scripts 699:Symbols 689:Scripts 512:Unicode 505:Unicode 188:History 112:Unicode 2051:  2040:  1949:Yezidi 1929:Todhri 1924:Tangut 1759:Lydian 1754:Lycian 1728:Khojki 1708:Kaithi 1688:Hatran 1678:Gothic 1628:Coptic 1618:Carian 1613:Brāhmī 1555:Wancho 1520:Thaana 1515:Telugu 1510:Tangsa 1490:Tai Le 1480:Syriac 1440:Rejang 1315:Lepcha 1260:Hebrew 1240:Hangul 1165:Chakma 1110:Arabic 1084:Spaces 791:CESU-8 786:BOCU-1 694:Spaces 441:(html) 401:(html) 164:. See 124:planes 110:, and 2023:Emoji 1919:Takri 1879:Runic 1819:Ogham 1653:Dogra 1505:Tamil 1410:Osage 1385:Nüshu 1320:Limbu 1310:Latin 1295:Khmer 1275:Kanji 1250:Hanja 1215:Greek 1205:Geʽez 1200:Garay 1150:Buhid 1130:Batak 1125:Bamum 1105:Adlam 953:Input 931:Fonts 926:Email 914:Usage 816:UTF-8 811:UTF-7 806:UTF-1 657:Lists 574:Plane 547:Block 363:(PDF) 352:(PDF) 298:(PDF) 287:(PDF) 172:. An 158:UTF-8 150:octet 142:UCS-4 106:to FF 94:to 7F 86:ASCII 40:table 1799:Modi 1713:Kawi 1583:Ahom 1545:Toto 1525:Thai 1395:Odia 1370:N'Ko 1170:Cham 936:HTML 883:list 801:SCSU 552:List 449:2018 409:2018 371:2018 335:2023 306:2018 146:byte 1550:Vai 1365:Mru 1305:Lao 604:BOM 120:hex 116:hex 108:hex 104:hex 96:hex 91:hex 34:or 2074:: 1565:Yi 451:. 430:. 411:. 392:. 373:. 354:. 326:. 308:. 289:. 152:) 98:, 81:. 30:, 26:A 497:e 490:t 483:v 337:. 272:. 148:( 23:.

Index

point code
table
discrete
character encoding
character
grapheme
control characters
ASCII
hex
Extended ASCII
Unicode
planes
code unit
UCS-4
byte
octet
binary numbers
UTF-8
self-synchronizing code
comparison of Unicode encodings
characters
code pages
Latin script
Combining character
Replacement character
Text-based (computing)
Unicode collation algorithm
https://www.etsi.org/deliver/etsi_ts/101700_101799/101773/01.02.01_60/ts_101773v010201p.pdf
https://datatracker.ietf.org/doc/html/rfc4190
"T.35 : Procedure for the allocation of ITU-T defined codes for non-standard facilities"

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.