Knowledge

Binary Ordered Compression for Unicode

Source 📝

4540: 2496: 2485: 729:, which employed both of the inventors of BOCU-1 at the time it was created, stated in the Unicode Technical Note that implementers of a "fully compliant version of BOCU-1" had to contact IBM to request a royalty-free license. BOCU-1 is the only Unicode compression scheme described on the Unicode Web site that is known to have been encumbered with 59:. SCSU has not been widely adopted, as it is not suitable for MIME "text" media types. For example, SCSU cannot be used directly in emails and similar protocols. SCSU requires a complicated encoder design for good performance. Usually, the 576:), the encoder is in a known state at the begin of each line. The corruption of a single byte therefore affects at most one line. For comparison, the corruption of a single byte in 4158: 697: 744:"freely available to anyone concerned towards making the transformation format as part of the UCS standards", instead of requiring implementers to request a license. 699:
octets in multi-byte encodings. BOCU-1 needs at most four bytes consisting of a lead byte and one to three trail bytes. The trail bytes encode a remaining "
3003: 2840: 3949: 1388: 113:) are encoded as a difference between the code point and a normalized version of the most recently encoded code point that was not an ASCII space ( 1043: 1330: 2948: 624:. In other words, the signature cannot simply be stripped as in most other Unicode encoding schemes. Adding a reset byte after the signature ( 3023: 2537: 2453: 943: 52:
is designed to be useful for compressing short strings, and maintains code point order. BOCU-1 is specified in a Unicode Technical Note.
2438: 703:
243" (base 243) difference, the lead byte determines the number of trail bytes and an initial difference. Note that the reset byte
4567: 4250: 4004: 2458: 1248: 581: 55:
For comparison SCSU was adopted as standard Unicode compression scheme with a byte/code point ratio similar to language-specific
42: 173: 3989: 2943: 1477: 1156: 599:
reset bytes is not recommended in the BOCU-1 specification, because it conflicts with other BOCU-1 design goals, notably the
4123: 1472: 915: 71: 4517: 4028: 3831: 2575: 1006: 587:
BOCU-1 offers a similar robustness also for input texts without the above-mentioned values with the special reset code
4073: 3689: 3684: 3187: 3018: 2565: 2530: 3107: 1305: 936: 4325: 4260: 4014: 3994: 1310: 1225: 1126: 1787: 1506: 4078: 2671: 4192: 4163: 3813: 1607: 1421: 1405: 1368: 1215: 1151: 971: 741: 287:
The difference between the current code point and the normalized previous code point is encoded as follows:
4255: 4143: 4103: 2523: 2346: 2216: 1531: 1141: 836: 4497: 4108: 4038: 4024: 4009: 3913: 3826: 3798: 3764: 2465: 1582: 1393: 1193: 929: 4471: 4416: 4337: 4118: 3774: 3769: 3122: 2145: 1632: 1467: 1462: 1111: 1016: 4113: 2150: 1526: 4562: 4178: 4133: 3969: 3518: 3222: 3167: 3132: 2070: 1347: 1056: 3693: 3202: 3182: 3177: 3117: 3112: 2621: 1161: 720: 152: 4068: 1947: 67:, and other industry standard algorithms compact larger amounts of Unicode text more efficiently. 4543: 4527: 4454: 4449: 4411: 4382: 4347: 3779: 3513: 3212: 3097: 2500: 2401: 2321: 1757: 1662: 1011: 999: 723:#6,737,994, which also mentions the specific BOCU-1 implementation. This patent has now expired. 640: 670: 4138: 4128: 3984: 3974: 3508: 3217: 2662: 2649: 2585: 2090: 1837: 1692: 1627: 1342: 1210: 1116: 1075: 4315: 4153: 4088: 3964: 3503: 2657: 2286: 2040: 2035: 1912: 1325: 1090: 730: 519: 20: 3533: 1992: 4476: 4148: 3908: 3528: 2271: 2185: 2105: 1827: 1792: 1672: 861: 857: 832: 762: 628:) could avoid this effect, but the BOCU-1 specification does not recommend this practice. 8: 4431: 4058: 3543: 3428: 3418: 3413: 2316: 2226: 2170: 1536: 1516: 1315: 1300: 1205: 1131: 1121: 2311: 4512: 4360: 4173: 4168: 4093: 3092: 3066: 2590: 2546: 2417: 2336: 2306: 2276: 2256: 1892: 1872: 1622: 1188: 1065: 1061: 966: 97:
are encoded in BOCU-1 as the corresponding byte value. All other code points (that is,
49: 820: 4502: 4441: 4421: 4083: 4063: 4043: 3671: 3147: 3127: 2639: 2386: 2296: 2281: 2120: 2085: 1897: 1737: 1562: 1373: 1085: 1026: 882: 1812: 809: 4459: 4033: 3999: 3709: 3538: 2489: 2448: 2341: 2291: 2180: 2140: 2065: 2055: 2045: 1917: 1902: 1807: 1782: 1657: 1637: 1497: 1383: 1136: 1095: 700: 194: 766: 4507: 4426: 3157: 3152: 3142: 3087: 2772: 2762: 2757: 2752: 2747: 2742: 2737: 2443: 2396: 2381: 2241: 2206: 2201: 2135: 2125: 2075: 1942: 1932: 1927: 1877: 1847: 1717: 1707: 1667: 1567: 1482: 1457: 1146: 1051: 1021: 607: 60: 37:
compatible Unicode compression scheme. BOCU-1 combines the wide applicability of
1652: 3959: 3954: 3944: 3939: 3934: 3929: 3893: 3888: 3881: 3876: 3871: 3866: 3861: 3856: 3851: 3846: 3841: 3836: 3704: 3661: 3656: 3651: 3646: 3641: 3636: 3631: 3626: 3621: 3616: 3611: 3606: 3601: 3596: 3591: 3498: 3493: 3488: 3483: 3478: 3473: 3468: 3463: 3458: 3453: 3448: 3443: 3227: 2812: 2732: 2727: 2722: 2717: 2712: 2707: 2702: 2697: 2692: 2560: 2351: 2331: 2251: 2231: 2221: 2130: 1977: 1907: 1882: 1862: 1817: 1802: 1777: 1727: 1702: 1647: 1602: 3207: 4556: 4279: 3699: 3586: 3581: 3576: 3571: 3566: 3561: 3438: 3433: 3423: 3408: 3403: 3398: 3393: 3388: 3383: 3378: 3373: 3368: 3363: 3358: 3353: 3348: 3343: 3338: 3333: 3328: 3323: 3318: 3313: 3308: 3303: 3298: 3293: 3288: 3283: 3278: 3273: 3268: 3263: 3258: 3253: 3248: 3243: 3162: 3137: 3102: 3061: 2807: 2371: 2356: 2236: 2175: 2060: 2002: 1997: 1962: 1937: 1887: 1772: 1762: 1612: 1557: 1400: 1198: 994: 4299: 4294: 4289: 4284: 4019: 3759: 3754: 3749: 3744: 3739: 3734: 3729: 3724: 3719: 3714: 3197: 3192: 3172: 3056: 3048: 2681: 2366: 2100: 2095: 2050: 1952: 1867: 1857: 1767: 1742: 1682: 1597: 1577: 1572: 1552: 1431: 1378: 837:"United States Patent #6,737,994, "Binary-ordered compression for unicode"" 976: 2614: 2597: 2422: 2261: 2246: 2160: 2030: 2007: 1972: 1842: 1822: 1797: 1617: 1080: 1070: 83: 1352: 4464: 4372: 4225: 3903: 2998: 2968: 2963: 2958: 2953: 2918: 2802: 2797: 2787: 2782: 2580: 2570: 2301: 1752: 1642: 1278: 986: 737: 4405: 2515: 4352: 4330: 4235: 4048: 3077: 3008: 2988: 2983: 2908: 2903: 2165: 2080: 2012: 1747: 1521: 1441: 1436: 1337: 1320: 918:
A library that can convert between BOCU-1 and other Unicode encodings
719:
Prior to 16 November 2022, the general BOCU algorithm was covered by
56: 612:
at the begin of BOCU-1 encoded texts, i.e. the BOCU-1 byte sequence
4522: 4377: 4342: 4320: 4230: 4053: 2993: 2978: 2938: 2933: 2928: 2913: 2872: 2867: 2862: 2857: 2852: 2847: 2644: 2634: 2630: 2604: 2391: 2361: 2211: 2196: 2191: 1982: 1732: 1712: 1677: 1587: 1426: 1243: 1832: 4392: 4188: 4098: 3979: 3553: 2923: 2898: 2888: 2626: 2115: 2110: 1987: 1922: 1852: 1592: 952: 652: 46: 787: 561:. Because the above-mentioned values cover line end code points 4397: 4387: 4365: 4245: 4220: 4215: 3898: 3789: 3679: 3038: 3028: 3013: 2830: 2376: 2155: 1967: 1957: 1687: 1273: 1268: 1238: 648: 4492: 4210: 4205: 4200: 3817: 3523: 3033: 2973: 2835: 2609: 2470: 2326: 2266: 1722: 1697: 1263: 1258: 1253: 909: 905: 636: 632: 577: 64: 38: 3803: 2893: 34: 921: 855: 760: 740:, but it chose in that case to make the documentation and 4270: 726: 591:. When a decoder finds this octet it resets its state to 673: 667:
code points encoded as single octets BOCU-1 can use
1496: 880: 691: 522:with the following thirteen byte values excluded: 4554: 4472:Unicode control, format and separator characters 534:, is immediately followed by the byte sequence 2531: 937: 736:By contrast, IBM also filed for a patent on 121:. The normalization mapping is as follows: 19:"BOCU" redirects here. For other uses, see 2538: 2524: 2439:Cultural, political, and religious symbols 944: 930: 788:"UTN #14: A survey of Unicode compression" 16:MIME compatible Unicode compression scheme 2545: 972:ISO/IEC 10646 (Universal Character Set) 43:Standard Compression Scheme for Unicode 4555: 524:00 07 08 09 0A 0B 0C 0D 0E 0F 1A 1B 20 27:Binary Ordered Compression for Unicode 2519: 1495: 925: 831: 785: 1473:International Components for Unicode 1422:Common Locale Data Repository (CLDR) 916:International Components for Unicode 908:contains a comparison of the UTF-1, 580:affects at most one code point, for 821:IANA registration record for BOCU-1 584:it can affect the entire document. 13: 3882:Norwegian and Danish (alternative) 2454:Mathematical operators and symbols 14: 4579: 881:V.S. Umamaheswaran (2002-04-16). 810:IANA registration record for SCSU 526:. For example, the byte sequence 4539: 4538: 2495: 2494: 2484: 2483: 2466:Phonetic symbols (including IPA) 606:The optional use of a signature 86:, and all ranges are inclusive. 82:All numbers in this section are 4326:Digital encoding of APL symbols 4261:Comparison of Unicode encodings 2779:Proposed but not approved 4568:Unicode Transformation Formats 874: 849: 825: 814: 803: 779: 754: 595:as for a line end. The use of 1: 1406:International Ideographs Core 1216:International Ideographs Core 1157:Alias names and abbreviations 747: 711:and can occur as trail byte. 538:, coding for a difference of 530:, coding for a difference of 1628:CJK Unified Ideographs (Han) 1478:People involved with Unicode 835:; et al. (2004-05-18). 616:, changes the initial state 7: 4498:Character encodings in HTML 3832:National Replacement (NRCS) 3799:Japanese language in EBCDIC 951: 899: 10: 4584: 1468:Ideographic Research Group 1463:ConScript Unicode Registry 786:Ewell, Doug (2004-01-30). 692:{\displaystyle 256-13=243} 663:. Excluding the thirteen 639:could encode the original 77: 18: 4536: 4485: 4440: 4308: 4269: 4187: 3922: 3812: 3788: 3670: 3552: 3236: 3075: 3047: 2881: 2823: 2680: 2553: 2479: 2431: 2410: 2021: 1545: 1505: 1491: 1450: 1414: 1361: 1348:Regional indicator symbol 1291: 1224: 1181: 1174: 1104: 1057:Combining grapheme joiner 1042: 1035: 985: 959: 714: 520:lexicographically ordered 70:Both SCSU and BOCU-1 are 4528:Variable-length encoding 4309:Miscellaneous code pages 3067:Extended Unix Code / EUC 2758:-15 (New Western Europe) 2554:Early telecommunications 2501:Category: Unicode blocks 1306:Compatibility characters 266:(excluding ranges above) 233:(excluding ranges above) 207:encoder state kept as is 117:). The initial state is 41:with the compactness of 4455:C0 and C1 control codes 1226:Comparison of encodings 1152:Halfwidth and fullwidth 1007:Universal Character Set 643:set with 31 bits up to 2703:-3 (Maltese/Esperanto) 2654:World System Teletext 2151:Inscriptional Parthian 1838:Nyiakeng Puachue Hmong 1500:and symbols in Unicode 1117:CJK Unified Ideographs 693: 651:can encode the modern 557:resets the encoder to 130:Normalized code point 4477:Whitespace characters 4154:Ventura International 2287:Old Persian cuneiform 2146:Inscriptional Pahlavi 2041:Ancient North Arabian 2036:Anatolian hieroglyphs 1326:Precomposed character 1162:Whitespace characters 1091:Zero-width non-joiner 883:"UTR #16: UTF-EBCDIC" 731:intellectual property 694: 74:registered charsets. 21:BOCU (disambiguation) 3872:Norwegian and Danish 2106:Egyptian hieroglyphs 1311:Duplicate characters 1127:Duplicate characters 912:, and BOCU-1 designs 721:United States Patent 671: 4432:Unified Hangul Code 4104:PostScript Standard 3827:Multinational (MCS) 2698:-2 (Central Europe) 2693:-1 (Western Europe) 2547:Character encodings 2171:Khitan small script 1608:Canadian Aboriginal 1343:Variation sequences 1301:Combining character 1211:Variation sequences 1122:Combining character 518:Each byte range is 296:Byte sequence range 4513:Hardware code page 4273:typesetting system 4109:PostScript Latin 1 3765:Cyrillic + Finnish 3672:Windows code pages 3554:IBM AIX code pages 2882:National standards 2813:Ukrainian Cyrillic 2411:Notational scripts 2362:Tagalog (Baybayin) 2071:Caucasian Albanian 1394:numeric references 1369:Domain names (IDN) 1189:Bidirectional text 1066:Right-to-left mark 1062:Left-to-right mark 1017:Character property 967:Unicode Consortium 689: 4550: 4549: 4503:Charset detection 4442:Control character 4124:Sharp calculators 3995:Casio calculators 3923:Platform specific 3775:Cyrillic + German 3770:Cyrillic + French 3188:Maltese/Esperanto 2824:Bibliographic use 2708:-4 (North Europe) 2640:T.51/ISO/IEC 6937 2598:Baudot and Murray 2513: 2512: 2509: 2508: 2490:Category: Unicode 1527:Punctuation marks 1509:inherited scripts 1415:Related standards 1389:entity references 1287: 1286: 1170: 1169: 1086:Zero-width joiner 516: 515: 300: 293:Difference range 285: 284: 267: 234: 208: 89:Code points from 4575: 4563:Data compression 4542: 4541: 4034:DG International 3909:Special Graphics 3710:Extended Latin-8 3108:Central European 3098:Barents Cyrillic 2803:Barents Cyrillic 2773:-12 (Devanagari) 2769:Abandoned parts 2540: 2533: 2526: 2517: 2516: 2498: 2497: 2487: 2486: 2449:Control Pictures 2402:Zanabazar Square 2141:Imperial Aramaic 2024:historic scripts 1493: 1492: 1353:Emoji skin color 1179: 1178: 1096:Zero-width space 1040: 1039: 1027:Private Use Area 1012:Character charts 946: 939: 932: 923: 922: 893: 892: 890: 889: 878: 872: 871: 869: 868: 862:"UTN #6: BOCU-1" 856:Markus Scherer, 853: 847: 846: 844: 843: 829: 823: 818: 812: 807: 801: 800: 798: 797: 792: 783: 777: 776: 774: 773: 767:"UTN #6: BOCU-1" 761:Markus Scherer, 758: 706: 698: 696: 695: 690: 662: 658: 646: 627: 623: 619: 615: 610: 598: 594: 590: 575: 568: 564: 560: 556: 553:excluding space 552: 548: 545:Any ASCII input 541: 537: 533: 529: 525: 512: 509: 506: 503: 499: 496: 493: 490: 485: 481: 474: 471: 468: 464: 461: 458: 453: 449: 442: 439: 435: 432: 427: 423: 416: 412: 407: 403: 396: 393: 389: 386: 381: 377: 370: 367: 364: 360: 357: 354: 349: 345: 338: 335: 332: 329: 325: 322: 319: 316: 311: 307: 298: 290: 289: 276: 265: 263: 255: 239: 232: 230: 222: 206: 203: 191: 186: 182: 170: 165: 161: 149: 144: 140: 124: 123: 120: 116: 112: 108: 104: 100: 96: 92: 4583: 4582: 4578: 4577: 4576: 4574: 4573: 4572: 4553: 4552: 4551: 4546: 4532: 4508:Han unification 4481: 4436: 4304: 4265: 4183: 4005:Compucolor 8001 3918: 3914:Technical (TCS) 3837:French Canadian 3808: 3784: 3780:Polytonic Greek 3666: 3548: 3232: 3218:Turkic Cyrillic 3133:Font X (Kermit) 3128:Farsi (Persian) 3080: 3071: 3043: 2877: 2819: 2689:Approved parts 2676: 2549: 2544: 2514: 2505: 2475: 2459:List by subject 2432:Symbols, emojis 2427: 2406: 2322:Psalter Pahlavi 2023: 2017: 1878:Pracalit (Newa) 1693:Hanifi Rohingya 1541: 1517:Combining marks 1508: 1501: 1487: 1483:Han unification 1446: 1410: 1357: 1293: 1283: 1220: 1166: 1100: 1044:Special purpose 1031: 981: 955: 950: 902: 897: 896: 887: 885: 879: 875: 866: 864: 854: 850: 841: 839: 830: 826: 819: 815: 808: 804: 795: 793: 790: 784: 780: 771: 769: 759: 755: 750: 742:encoding scheme 717: 704: 672: 669: 668: 660: 656: 644: 625: 621: 617: 613: 608: 596: 592: 588: 573: 566: 562: 558: 554: 550: 546: 539: 535: 531: 527: 523: 510: 507: 504: 501: 497: 494: 491: 488: 483: 479: 472: 469: 466: 462: 459: 456: 451: 447: 440: 437: 433: 430: 425: 421: 414: 410: 405: 401: 394: 391: 387: 384: 379: 375: 368: 365: 362: 358: 355: 352: 347: 343: 336: 333: 330: 327: 323: 320: 317: 314: 309: 305: 297: 280: 270: 264: 257: 249: 243: 237: 231: 224: 216: 201: 189: 184: 180: 168: 163: 159: 147: 142: 138: 118: 114: 110: 106: 102: 98: 94: 90: 80: 24: 17: 12: 11: 5: 4581: 4571: 4570: 4565: 4548: 4547: 4544:Character sets 4537: 4534: 4533: 4531: 4530: 4525: 4520: 4515: 4510: 4505: 4500: 4495: 4489: 4487: 4486:Related topics 4483: 4482: 4480: 4479: 4474: 4469: 4468: 4467: 4462: 4452: 4450:Morse prosigns 4446: 4444: 4438: 4437: 4435: 4434: 4429: 4424: 4419: 4414: 4409: 4402: 4401: 4400: 4395: 4390: 4380: 4375: 4370: 4369: 4368: 4363: 4355: 4350: 4345: 4340: 4335: 4334: 4333: 4323: 4318: 4312: 4310: 4306: 4305: 4303: 4302: 4297: 4292: 4287: 4282: 4276: 4274: 4267: 4266: 4264: 4263: 4258: 4253: 4248: 4243: 4238: 4233: 4228: 4223: 4218: 4213: 4208: 4203: 4197: 4195: 4185: 4184: 4182: 4181: 4176: 4171: 4166: 4161: 4156: 4151: 4146: 4144:TI calculators 4141: 4136: 4131: 4126: 4121: 4116: 4111: 4106: 4101: 4096: 4091: 4086: 4081: 4076: 4071: 4066: 4061: 4056: 4051: 4046: 4041: 4036: 4031: 4022: 4017: 4012: 4007: 4002: 3997: 3992: 3987: 3982: 3977: 3972: 3967: 3962: 3957: 3952: 3947: 3942: 3937: 3932: 3926: 3924: 3920: 3919: 3917: 3916: 3911: 3906: 3901: 3896: 3891: 3886: 3885: 3884: 3879: 3874: 3869: 3864: 3859: 3854: 3852:United Kingdom 3849: 3844: 3839: 3829: 3823: 3821: 3810: 3809: 3807: 3806: 3801: 3795: 3793: 3786: 3785: 3783: 3782: 3777: 3772: 3767: 3762: 3757: 3752: 3747: 3742: 3737: 3732: 3727: 3722: 3717: 3712: 3707: 3702: 3697: 3687: 3682: 3676: 3674: 3668: 3667: 3665: 3664: 3659: 3654: 3649: 3644: 3639: 3634: 3629: 3624: 3619: 3614: 3609: 3604: 3599: 3594: 3589: 3584: 3579: 3574: 3569: 3564: 3558: 3556: 3550: 3549: 3547: 3546: 3541: 3536: 3531: 3526: 3521: 3516: 3511: 3506: 3501: 3496: 3491: 3486: 3481: 3476: 3471: 3466: 3461: 3456: 3451: 3446: 3441: 3436: 3431: 3426: 3421: 3416: 3411: 3406: 3401: 3396: 3391: 3386: 3381: 3376: 3371: 3366: 3361: 3356: 3351: 3346: 3341: 3336: 3331: 3326: 3321: 3316: 3311: 3306: 3301: 3296: 3291: 3286: 3281: 3276: 3271: 3266: 3261: 3256: 3251: 3246: 3240: 3238: 3237:DOS code pages 3234: 3233: 3231: 3230: 3225: 3220: 3215: 3210: 3205: 3200: 3195: 3190: 3185: 3183:Latin (Kermit) 3180: 3175: 3170: 3165: 3160: 3155: 3150: 3145: 3140: 3135: 3130: 3125: 3120: 3115: 3110: 3105: 3100: 3095: 3090: 3084: 3082: 3073: 3072: 3070: 3069: 3064: 3059: 3053: 3051: 3045: 3044: 3042: 3041: 3036: 3031: 3026: 3021: 3016: 3011: 3006: 3001: 2996: 2991: 2986: 2981: 2976: 2971: 2966: 2961: 2956: 2951: 2946: 2941: 2936: 2931: 2926: 2921: 2916: 2911: 2906: 2901: 2896: 2891: 2885: 2883: 2879: 2878: 2876: 2875: 2870: 2865: 2860: 2855: 2850: 2845: 2844: 2843: 2838: 2827: 2825: 2821: 2820: 2818: 2817: 2816: 2815: 2810: 2805: 2800: 2792: 2791: 2790: 2785: 2783:KOI-8 Cyrillic 2777: 2776: 2775: 2767: 2766: 2765: 2763:-16 (Romanian) 2760: 2755: 2750: 2745: 2740: 2735: 2730: 2725: 2720: 2715: 2710: 2705: 2700: 2695: 2686: 2684: 2678: 2677: 2675: 2674: 2669: 2668: 2667: 2666: 2665: 2660: 2652: 2647: 2642: 2624: 2619: 2618: 2617: 2607: 2602: 2601: 2600: 2595: 2594: 2593: 2588: 2583: 2578: 2568: 2561:Telegraph code 2557: 2555: 2551: 2550: 2543: 2542: 2535: 2528: 2520: 2511: 2510: 2507: 2506: 2504: 2503: 2492: 2480: 2477: 2476: 2474: 2473: 2468: 2463: 2462: 2461: 2451: 2446: 2441: 2435: 2433: 2429: 2428: 2426: 2425: 2420: 2414: 2412: 2408: 2407: 2405: 2404: 2399: 2394: 2389: 2384: 2379: 2374: 2369: 2364: 2359: 2354: 2349: 2344: 2339: 2334: 2329: 2324: 2319: 2314: 2309: 2304: 2299: 2294: 2289: 2284: 2279: 2274: 2269: 2264: 2259: 2254: 2249: 2244: 2239: 2234: 2229: 2224: 2219: 2214: 2209: 2204: 2199: 2194: 2189: 2183: 2178: 2173: 2168: 2163: 2158: 2153: 2148: 2143: 2138: 2133: 2128: 2123: 2118: 2113: 2108: 2103: 2098: 2093: 2088: 2083: 2078: 2073: 2068: 2063: 2058: 2053: 2048: 2043: 2038: 2033: 2027: 2025: 2019: 2018: 2016: 2015: 2010: 2005: 2000: 1995: 1990: 1985: 1980: 1975: 1970: 1965: 1960: 1955: 1950: 1945: 1940: 1935: 1930: 1925: 1920: 1915: 1913:Sorang Sompeng 1910: 1905: 1900: 1895: 1890: 1885: 1880: 1875: 1870: 1865: 1860: 1855: 1850: 1845: 1840: 1835: 1830: 1825: 1820: 1815: 1810: 1805: 1803:Miao (Pollard) 1800: 1795: 1790: 1785: 1780: 1775: 1770: 1765: 1760: 1755: 1750: 1745: 1740: 1735: 1730: 1725: 1720: 1715: 1710: 1705: 1700: 1695: 1690: 1685: 1680: 1675: 1670: 1665: 1660: 1655: 1650: 1645: 1640: 1635: 1630: 1625: 1620: 1615: 1610: 1605: 1600: 1595: 1590: 1585: 1580: 1575: 1570: 1565: 1560: 1555: 1549: 1547: 1546:Modern scripts 1543: 1542: 1540: 1539: 1534: 1529: 1524: 1519: 1513: 1511: 1503: 1502: 1489: 1488: 1486: 1485: 1480: 1475: 1470: 1465: 1460: 1454: 1452: 1451:Related topics 1448: 1447: 1445: 1444: 1439: 1434: 1429: 1424: 1418: 1416: 1412: 1411: 1409: 1408: 1403: 1398: 1397: 1396: 1391: 1381: 1376: 1371: 1365: 1363: 1359: 1358: 1356: 1355: 1350: 1345: 1340: 1335: 1334: 1333: 1323: 1318: 1313: 1308: 1303: 1297: 1295: 1289: 1288: 1285: 1284: 1282: 1281: 1276: 1271: 1266: 1261: 1256: 1251: 1246: 1241: 1236: 1230: 1228: 1222: 1221: 1219: 1218: 1213: 1208: 1203: 1202: 1201: 1191: 1185: 1183: 1176: 1172: 1171: 1168: 1167: 1165: 1164: 1159: 1154: 1149: 1144: 1139: 1134: 1129: 1124: 1119: 1114: 1108: 1106: 1102: 1101: 1099: 1098: 1093: 1088: 1083: 1078: 1073: 1068: 1059: 1054: 1048: 1046: 1037: 1033: 1032: 1030: 1029: 1024: 1019: 1014: 1009: 1004: 1003: 1002: 991: 989: 983: 982: 980: 979: 974: 969: 963: 961: 957: 956: 949: 948: 941: 934: 926: 920: 919: 913: 901: 898: 895: 894: 873: 860:(2006-02-04). 848: 824: 813: 802: 778: 765:(2006-02-04). 752: 751: 749: 746: 733:restrictions. 716: 713: 688: 685: 682: 679: 676: 514: 513: 486: 476: 475: 454: 444: 443: 428: 418: 417: 408: 398: 397: 382: 372: 371: 350: 340: 339: 312: 302: 301: 294: 283: 282: 277: 268: 246: 245: 240: 235: 213: 212: 209: 204: 198: 197: 192: 187: 177: 176: 171: 166: 156: 155: 150: 145: 135: 134: 131: 128: 79: 76: 15: 9: 6: 4: 3: 2: 4580: 4569: 4566: 4564: 4561: 4560: 4558: 4545: 4535: 4529: 4526: 4524: 4521: 4519: 4516: 4514: 4511: 4509: 4506: 4504: 4501: 4499: 4496: 4494: 4491: 4490: 4488: 4484: 4478: 4475: 4473: 4470: 4466: 4463: 4461: 4458: 4457: 4456: 4453: 4451: 4448: 4447: 4445: 4443: 4439: 4433: 4430: 4428: 4425: 4423: 4420: 4418: 4415: 4413: 4410: 4408: 4407: 4403: 4399: 4396: 4394: 4391: 4389: 4386: 4385: 4384: 4381: 4379: 4376: 4374: 4371: 4367: 4364: 4362: 4359: 4358: 4356: 4354: 4351: 4349: 4346: 4344: 4341: 4339: 4336: 4332: 4329: 4328: 4327: 4324: 4322: 4319: 4317: 4314: 4313: 4311: 4307: 4301: 4298: 4296: 4293: 4291: 4288: 4286: 4283: 4281: 4278: 4277: 4275: 4272: 4268: 4262: 4259: 4257: 4254: 4252: 4249: 4247: 4244: 4242: 4239: 4237: 4234: 4232: 4229: 4227: 4224: 4222: 4219: 4217: 4214: 4212: 4209: 4207: 4204: 4202: 4199: 4198: 4196: 4194: 4193:ISO/IEC 10646 4190: 4186: 4180: 4177: 4175: 4172: 4170: 4167: 4165: 4162: 4160: 4157: 4155: 4152: 4150: 4147: 4145: 4142: 4140: 4137: 4135: 4132: 4130: 4127: 4125: 4122: 4120: 4117: 4115: 4112: 4110: 4107: 4105: 4102: 4100: 4097: 4095: 4092: 4090: 4087: 4085: 4082: 4080: 4077: 4075: 4072: 4070: 4067: 4065: 4062: 4060: 4057: 4055: 4052: 4050: 4047: 4045: 4042: 4040: 4037: 4035: 4032: 4030: 4026: 4023: 4021: 4018: 4016: 4013: 4011: 4010:Compucolor II 4008: 4006: 4003: 4001: 3998: 3996: 3993: 3991: 3988: 3986: 3983: 3981: 3978: 3976: 3973: 3971: 3968: 3966: 3965:Acorn RISC OS 3963: 3961: 3958: 3956: 3953: 3951: 3948: 3946: 3943: 3941: 3938: 3936: 3933: 3931: 3928: 3927: 3925: 3921: 3915: 3912: 3910: 3907: 3905: 3902: 3900: 3897: 3895: 3894:8-bit Turkish 3892: 3890: 3887: 3883: 3880: 3878: 3875: 3873: 3870: 3868: 3865: 3863: 3860: 3858: 3855: 3853: 3850: 3848: 3845: 3843: 3840: 3838: 3835: 3834: 3833: 3830: 3828: 3825: 3824: 3822: 3819: 3815: 3811: 3805: 3802: 3800: 3797: 3796: 3794: 3791: 3787: 3781: 3778: 3776: 3773: 3771: 3768: 3766: 3763: 3761: 3758: 3756: 3753: 3751: 3748: 3746: 3743: 3741: 3738: 3736: 3733: 3731: 3728: 3726: 3723: 3721: 3718: 3716: 3713: 3711: 3708: 3706: 3703: 3701: 3698: 3695: 3691: 3688: 3686: 3683: 3681: 3678: 3677: 3675: 3673: 3669: 3663: 3660: 3658: 3655: 3653: 3650: 3648: 3645: 3643: 3640: 3638: 3635: 3633: 3630: 3628: 3625: 3623: 3620: 3618: 3615: 3613: 3610: 3608: 3605: 3603: 3600: 3598: 3595: 3593: 3590: 3588: 3585: 3583: 3580: 3578: 3575: 3573: 3570: 3568: 3565: 3563: 3560: 3559: 3557: 3555: 3551: 3545: 3542: 3540: 3537: 3535: 3532: 3530: 3527: 3525: 3522: 3520: 3517: 3515: 3512: 3510: 3507: 3505: 3502: 3500: 3497: 3495: 3492: 3490: 3487: 3485: 3482: 3480: 3477: 3475: 3472: 3470: 3467: 3465: 3462: 3460: 3457: 3455: 3452: 3450: 3447: 3445: 3442: 3440: 3437: 3435: 3432: 3430: 3427: 3425: 3422: 3420: 3417: 3415: 3412: 3410: 3407: 3405: 3402: 3400: 3397: 3395: 3392: 3390: 3387: 3385: 3382: 3380: 3377: 3375: 3372: 3370: 3367: 3365: 3362: 3360: 3357: 3355: 3352: 3350: 3347: 3345: 3342: 3340: 3337: 3335: 3332: 3330: 3327: 3325: 3322: 3320: 3317: 3315: 3312: 3310: 3307: 3305: 3302: 3300: 3297: 3295: 3292: 3290: 3287: 3285: 3282: 3280: 3277: 3275: 3272: 3270: 3267: 3265: 3262: 3260: 3257: 3255: 3252: 3250: 3247: 3245: 3242: 3241: 3239: 3235: 3229: 3226: 3224: 3221: 3219: 3216: 3214: 3211: 3209: 3206: 3204: 3201: 3199: 3196: 3194: 3191: 3189: 3186: 3184: 3181: 3179: 3176: 3174: 3171: 3169: 3166: 3164: 3161: 3159: 3156: 3154: 3151: 3149: 3146: 3144: 3141: 3139: 3136: 3134: 3131: 3129: 3126: 3124: 3121: 3119: 3116: 3114: 3111: 3109: 3106: 3104: 3101: 3099: 3096: 3094: 3091: 3089: 3086: 3085: 3083: 3079: 3074: 3068: 3065: 3063: 3062:ISO/IEC 10367 3060: 3058: 3055: 3054: 3052: 3050: 3046: 3040: 3037: 3035: 3032: 3030: 3027: 3025: 3022: 3020: 3017: 3015: 3012: 3010: 3007: 3005: 3002: 3000: 2997: 2995: 2992: 2990: 2987: 2985: 2982: 2980: 2977: 2975: 2972: 2970: 2967: 2965: 2962: 2960: 2957: 2955: 2952: 2950: 2947: 2945: 2942: 2940: 2937: 2935: 2932: 2930: 2927: 2925: 2922: 2920: 2917: 2915: 2912: 2910: 2907: 2905: 2902: 2900: 2897: 2895: 2892: 2890: 2887: 2886: 2884: 2880: 2874: 2871: 2869: 2866: 2864: 2861: 2859: 2856: 2854: 2851: 2849: 2846: 2842: 2839: 2837: 2834: 2833: 2832: 2829: 2828: 2826: 2822: 2814: 2811: 2809: 2806: 2804: 2801: 2799: 2796: 2795: 2793: 2789: 2786: 2784: 2781: 2780: 2778: 2774: 2771: 2770: 2768: 2764: 2761: 2759: 2756: 2754: 2751: 2749: 2746: 2744: 2741: 2739: 2736: 2734: 2731: 2729: 2726: 2724: 2721: 2719: 2716: 2714: 2713:-5 (Cyrillic) 2711: 2709: 2706: 2704: 2701: 2699: 2696: 2694: 2691: 2690: 2688: 2687: 2685: 2683: 2679: 2673: 2670: 2664: 2661: 2659: 2656: 2655: 2653: 2651: 2648: 2646: 2643: 2641: 2638: 2637: 2636: 2632: 2628: 2625: 2623: 2620: 2616: 2613: 2612: 2611: 2608: 2606: 2603: 2599: 2596: 2592: 2589: 2587: 2584: 2582: 2579: 2577: 2574: 2573: 2572: 2569: 2567: 2564: 2563: 2562: 2559: 2558: 2556: 2552: 2548: 2541: 2536: 2534: 2529: 2527: 2522: 2521: 2518: 2502: 2493: 2491: 2482: 2481: 2478: 2472: 2469: 2467: 2464: 2460: 2457: 2456: 2455: 2452: 2450: 2447: 2445: 2442: 2440: 2437: 2436: 2434: 2430: 2424: 2421: 2419: 2416: 2415: 2413: 2409: 2403: 2400: 2398: 2395: 2393: 2390: 2388: 2385: 2383: 2382:Tulu Tigalari 2380: 2378: 2375: 2373: 2370: 2368: 2365: 2363: 2360: 2358: 2357:Sylheti Nagri 2355: 2353: 2350: 2348: 2347:South Arabian 2345: 2343: 2340: 2338: 2335: 2333: 2330: 2328: 2325: 2323: 2320: 2318: 2315: 2313: 2310: 2308: 2305: 2303: 2300: 2298: 2295: 2293: 2290: 2288: 2285: 2283: 2280: 2278: 2275: 2273: 2272:Old Hungarian 2270: 2268: 2265: 2263: 2260: 2258: 2255: 2253: 2250: 2248: 2245: 2243: 2240: 2238: 2235: 2233: 2230: 2228: 2225: 2223: 2220: 2218: 2215: 2213: 2210: 2208: 2205: 2203: 2200: 2198: 2195: 2193: 2190: 2187: 2184: 2182: 2179: 2177: 2174: 2172: 2169: 2167: 2164: 2162: 2159: 2157: 2154: 2152: 2149: 2147: 2144: 2142: 2139: 2137: 2134: 2132: 2129: 2127: 2124: 2122: 2119: 2117: 2114: 2112: 2109: 2107: 2104: 2102: 2099: 2097: 2094: 2092: 2089: 2087: 2084: 2082: 2079: 2077: 2074: 2072: 2069: 2067: 2064: 2062: 2059: 2057: 2054: 2052: 2049: 2047: 2044: 2042: 2039: 2037: 2034: 2032: 2029: 2028: 2026: 2020: 2014: 2011: 2009: 2006: 2004: 2001: 1999: 1996: 1994: 1991: 1989: 1986: 1984: 1981: 1979: 1976: 1974: 1971: 1969: 1966: 1964: 1961: 1959: 1956: 1954: 1951: 1949: 1946: 1944: 1941: 1939: 1936: 1934: 1931: 1929: 1926: 1924: 1921: 1919: 1916: 1914: 1911: 1909: 1906: 1904: 1901: 1899: 1896: 1894: 1891: 1889: 1886: 1884: 1881: 1879: 1876: 1874: 1871: 1869: 1866: 1864: 1861: 1859: 1856: 1854: 1851: 1849: 1846: 1844: 1841: 1839: 1836: 1834: 1831: 1829: 1826: 1824: 1821: 1819: 1816: 1814: 1811: 1809: 1806: 1804: 1801: 1799: 1796: 1794: 1793:Mende Kikakui 1791: 1789: 1788:Masaram Gondi 1786: 1784: 1781: 1779: 1776: 1774: 1773:Lisu (Fraser) 1771: 1769: 1766: 1764: 1761: 1759: 1756: 1754: 1751: 1749: 1746: 1744: 1741: 1739: 1736: 1734: 1731: 1729: 1726: 1724: 1721: 1719: 1716: 1714: 1711: 1709: 1706: 1704: 1701: 1699: 1696: 1694: 1691: 1689: 1686: 1684: 1681: 1679: 1676: 1674: 1673:Gunjala Gondi 1671: 1669: 1666: 1664: 1661: 1659: 1656: 1654: 1651: 1649: 1646: 1644: 1641: 1639: 1636: 1634: 1631: 1629: 1626: 1624: 1621: 1619: 1616: 1614: 1611: 1609: 1606: 1604: 1601: 1599: 1596: 1594: 1591: 1589: 1586: 1584: 1581: 1579: 1576: 1574: 1571: 1569: 1566: 1564: 1561: 1559: 1556: 1554: 1551: 1550: 1548: 1544: 1538: 1535: 1533: 1530: 1528: 1525: 1523: 1520: 1518: 1515: 1514: 1512: 1510: 1504: 1499: 1494: 1490: 1484: 1481: 1479: 1476: 1474: 1471: 1469: 1466: 1464: 1461: 1459: 1456: 1455: 1453: 1449: 1443: 1440: 1438: 1435: 1433: 1430: 1428: 1425: 1423: 1420: 1419: 1417: 1413: 1407: 1404: 1402: 1399: 1395: 1392: 1390: 1387: 1386: 1385: 1382: 1380: 1377: 1375: 1372: 1370: 1367: 1366: 1364: 1360: 1354: 1351: 1349: 1346: 1344: 1341: 1339: 1336: 1332: 1329: 1328: 1327: 1324: 1322: 1319: 1317: 1314: 1312: 1309: 1307: 1304: 1302: 1299: 1298: 1296: 1290: 1280: 1277: 1275: 1272: 1270: 1267: 1265: 1262: 1260: 1257: 1255: 1252: 1250: 1247: 1245: 1242: 1240: 1237: 1235: 1232: 1231: 1229: 1227: 1223: 1217: 1214: 1212: 1209: 1207: 1204: 1200: 1199:ISO/IEC 14651 1197: 1196: 1195: 1192: 1190: 1187: 1186: 1184: 1180: 1177: 1173: 1163: 1160: 1158: 1155: 1153: 1150: 1148: 1145: 1143: 1140: 1138: 1135: 1133: 1130: 1128: 1125: 1123: 1120: 1118: 1115: 1113: 1110: 1109: 1107: 1103: 1097: 1094: 1092: 1089: 1087: 1084: 1082: 1079: 1077: 1074: 1072: 1069: 1067: 1063: 1060: 1058: 1055: 1053: 1050: 1049: 1047: 1045: 1041: 1038: 1034: 1028: 1025: 1023: 1020: 1018: 1015: 1013: 1010: 1008: 1005: 1001: 998: 997: 996: 993: 992: 990: 988: 984: 978: 975: 973: 970: 968: 965: 964: 962: 958: 954: 947: 942: 940: 935: 933: 928: 927: 924: 917: 914: 911: 907: 904: 903: 884: 877: 863: 859: 852: 838: 834: 828: 822: 817: 811: 806: 789: 782: 768: 764: 757: 753: 745: 743: 739: 734: 732: 728: 724: 722: 712: 710: 702: 686: 683: 680: 677: 674: 666: 654: 650: 647:. BOCU-1 and 642: 638: 634: 629: 611: 604: 602: 585: 583: 579: 571: 543: 521: 487: 478: 477: 455: 446: 445: 429: 420: 419: 409: 400: 399: 383: 374: 373: 351: 342: 341: 313: 304: 303: 295: 292: 291: 288: 278: 274: 269: 261: 253: 248: 247: 241: 236: 228: 220: 215: 214: 210: 205: 200: 199: 196: 193: 188: 179: 178: 175: 172: 167: 158: 157: 154: 151: 146: 137: 136: 132: 129: 126: 125: 122: 87: 85: 75: 73: 68: 66: 62: 58: 53: 51: 48: 45:(SCSU). This 44: 40: 36: 32: 28: 22: 4460:ISO/IEC 6429 4417:Stanford/ITS 4404: 4338:ARIB STD-B24 4240: 4119:Sega SC-3000 4020:DEC RADIX 50 3057:ISO/IEC 8859 3049:ISO/IEC 2022 2794:Adaptations 2753:-14 (Celtic) 2748:-13 (Baltic) 2738:-10 (Nordic) 2733:-9 (Turkish) 2682:ISO/IEC 8859 2237:Meetei Mayek 2188:(Chorasmian) 2091:Cypro-Minoan 1868:Pahawh Hmong 1683:Gurung Khema 1432:ISO/IEC 8859 1274:UTF-32/UCS-4 1269:UTF-16/UCS-2 1233: 1076:Variant form 886:. Retrieved 876: 865:. Retrieved 851: 840:. Retrieved 827: 816: 805: 794:. Retrieved 781: 770:. Retrieved 756: 735: 725: 718: 708: 664: 630: 605: 601:binary order 600: 586: 569: 544: 517: 286: 272: 259: 251: 226: 218: 88: 81: 69: 54: 30: 26: 25: 4179:ZX Spectrum 4134:Sinclair QL 3970:Amstrad CPC 3889:8-bit Greek 3816:terminals ( 3529:Iran System 3081:("scripts") 2728:-8 (Hebrew) 2718:-6 (Arabic) 2615:ISO/IEC 646 2423:SignWriting 2292:Old Sogdian 2262:Nandinagari 2186:Khwarezmian 2096:Dives Akuru 2022:Ancient and 2008:Warang Citi 1873:Pau Cin Hau 1828:New Tai Lue 1823:Nag Mundari 1798:Medefaidrin 1507:Common and 1316:Equivalence 1294:code points 1292:On pairs of 1206:Equivalence 1081:Word joiner 1071:Soft hyphen 987:Code points 626:FB EE 28 FF 299:(see below) 127:Code range 84:hexadecimal 4557:Categories 4465:JIS X 0211 4373:ISO-IR-169 4226:UTF-EBCDIC 3792:code pages 3519:CSX+ Indic 3123:Devanagari 3078:Code pages 2999:LST 1590-4 2969:JIS X 0213 2964:JIS X 0212 2959:JIS X 0208 2954:JIS X 0201 2919:GOST 10859 2841:CCCII/EACC 2743:-11 (Thai) 2723:-7 (Greek) 2658:background 2581:Wabun/Kana 2317:Phoenician 2302:Old Uyghur 2297:Old Turkic 2282:Old Permic 2277:Old Italic 2227:Manichaean 2121:Glagolitic 1898:Saurashtra 1643:Devanagari 1522:Diacritics 1279:UTF-EBCDIC 1182:Algorithms 1175:Processing 1112:Characters 1036:Characters 888:2008-11-16 867:2014-02-05 858:Mark Davis 842:2022-12-28 796:2008-06-13 772:2008-05-18 763:Mark Davis 748:References 738:UTF-EBCDIC 631:In theory 57:code pages 4518:MICR code 4353:IEC-P27-1 4331:ISO-IR-68 4236:DIN 91379 4114:SAM Coupé 4049:GSM 03.38 4039:Galaksija 3534:Kamenický 3514:CSX Indic 3223:Ukrainian 3009:Shift JIS 2989:KS X 1002 2984:KS X 1001 2909:DIN 66003 2904:CNS 11643 2672:Transcode 2650:ITU T.101 2576:Non-Latin 2312:ʼPhags-pa 2307:Palmyrene 2257:Nabataean 2181:Khudawadi 2166:Kharosthi 2081:Cuneiform 2056:Bhaiksuki 2051:Bassa Vah 1918:Sundanese 1893:Samaritan 1808:Mongolian 1783:Malayalam 1748:Kirat Rai 1458:Anomalies 1442:ISO 15924 1437:DIN 91379 1338:Z-variant 1321:Homoglyph 1194:Collation 709:protected 678:− 665:protected 655:set from 4523:Mojibake 4378:ISO 2033 4343:Fieldata 4321:ASMO 449 4231:GB 18030 4191: / 4139:Teletext 4129:Sharp MZ 4059:HP FOCAL 4054:HP Roman 3985:Atari ST 3975:Apple II 3509:CS Indic 3203:Romanian 3178:Keyboard 3158:Gurmukhi 3153:Gujarati 3143:Georgian 3118:Cyrillic 3113:Croatian 3088:Armenian 2994:LST 1564 2979:KPS 9566 2939:GB 18030 2934:GB 12052 2929:GB 12345 2914:ELOT 927 2848:ISO 5426 2808:Estonian 2645:ITU T.61 2635:Teletext 2631:Videotex 2605:Fieldata 2591:Cyrillic 2444:Currency 2418:Duployan 2392:Vithkuqi 2387:Ugaritic 2242:Meroitic 2212:Mahajani 2197:Linear B 2192:Linear A 1983:Tifinagh 1948:Tai Viet 1943:Tai Tham 1933:Tagbanwa 1848:Ol Chiki 1738:Kayah Li 1733:Katakana 1718:Javanese 1713:Hiragana 1703:Hanunuoo 1678:Gurmukhi 1668:Gujarati 1658:Georgian 1633:Cyrillic 1623:Cherokee 1588:Bopomofo 1568:Balinese 1563:Armenian 1427:GB 18030 1244:Punycode 1132:Numerals 1064: / 977:Versions 900:See also 661:U+10FFFF 645:7FFFFFFF 614:FB EE 28 536:FC 10 01 528:FC 06 FF 238:U+hhhh40 153:Hiragana 111:U+10FFFF 109:through 101:through 50:encoding 4412:SEASCII 4406:Mojikyō 4393:KOI8-RU 4316:ABICOMP 4189:Unicode 4099:PETSCII 4089:NEC APC 4025:DEC MCS 3980:ATASCII 3877:Swedish 3862:Finnish 3847:Spanish 3539:Mazovia 3504:ABICOMP 3213:Turkish 3168:Iceland 3076:Mac OS 3019:TIS-620 2924:GB 2312 2899:BraSCII 2889:ArmSCII 2627:Teletex 2586:Chinese 2352:Soyombo 2342:Sogdian 2337:Siddham 2332:Sharada 2252:Multani 2232:Marchen 2222:Mandaic 2217:Makasar 2131:Grantha 2116:Elymaic 2111:Elbasan 2086:Cypriot 2046:Avestan 1988:Tirhuta 1978:Tibetan 1923:Sunuwar 1908:Sinhala 1903:Shavian 1883:Ranjana 1863:Osmanya 1853:Ol Onal 1778:Lontara 1728:Kannada 1638:Deseret 1603:Burmese 1593:Braille 1583:Bengali 1537:Numbers 1498:Scripts 1147:Symbols 1137:Scripts 960:Unicode 953:Unicode 707:is not 653:Unicode 306:-10FF9F 281:of 128 244:of 128 78:Details 47:Unicode 33:) is a 4422:Symbol 4398:KOI8-U 4388:KOI8-R 4256:TACE16 4246:CESU-8 4241:BOCU-1 4221:UTF-32 4216:UTF-16 4159:WISCII 4149:TRS-80 4069:SQUOZE 4064:HP RPL 3904:Hebrew 3899:SI 960 3867:French 3790:EBCDIC 3680:CER-GS 3163:Hebrew 3138:Gaelic 3103:Celtic 3093:Arabic 3039:YUSCII 3029:VISCII 3014:SI 960 3004:PASCII 2853:5426-2 2831:MARC-8 2566:Needle 2499:  2488:  2397:Yezidi 2377:Todhri 2372:Tangut 2207:Lydian 2202:Lycian 2176:Khojki 2156:Kaithi 2136:Hatran 2126:Gothic 2076:Coptic 2066:Carian 2061:Brāhmī 2003:Wancho 1968:Thaana 1963:Telugu 1958:Tangsa 1938:Tai Le 1928:Syriac 1888:Rejang 1763:Lepcha 1708:Hebrew 1688:Hangul 1613:Chakma 1558:Arabic 1532:Spaces 1239:CESU-8 1234:BOCU-1 1142:Spaces 715:Patent 701:modulo 657:U+0000 649:UTF-16 622:U+FEC0 618:U+0040 609:U+FEFF 593:U+0040 567:U+000A 563:U+000D 559:U+0040 555:U+0020 551:U+007F 547:U+0000 484:10FFBF 344:-2DD0C 310:-2DD0D 279:middle 242:middle 211:Space 202:U+0020 195:Hangul 190:U+C1D1 185:U+D7A3 181:U+AC00 174:Unihan 169:U+7711 164:U+9FA5 160:U+4E00 148:U+3070 143:U+309F 139:U+3040 133:Notes 119:U+0040 115:U+0020 107:U+E000 103:U+D7FF 99:U+0021 95:U+0020 91:U+0000 4493:CCSID 4366:8-bit 4361:7-bit 4357:INIS 4211:UTF-8 4206:UTF-7 4201:UTF-1 4079:LMBCS 4015:CP/M+ 3857:Dutch 3842:Swiss 3524:CWI-2 3228:VT100 3198:Roman 3193:Ogham 3173:Inuit 3148:Greek 3034:VSCII 3024:TSCII 2974:KOI-7 2949:ISCII 2944:HKSCS 2836:ANSEL 2798:Welsh 2622:BCDIC 2610:ASCII 2571:Morse 2471:Emoji 2367:Takri 2327:Runic 2267:Ogham 2101:Dogra 1953:Tamil 1858:Osage 1833:Nüshu 1768:Limbu 1758:Latin 1743:Khmer 1723:Kanji 1698:Hanja 1663:Greek 1653:Geʽez 1648:Garay 1598:Buhid 1578:Batak 1573:Bamum 1553:Adlam 1401:Input 1379:Fonts 1374:Email 1362:Usage 1264:UTF-8 1259:UTF-7 1254:UTF-1 1105:Lists 1022:Plane 995:Block 910:UTF-8 906:UTF-1 833:Davis 791:(PDF) 641:UCS-4 637:UTF-8 633:UTF-1 578:UTF-8 574:0D 0A 570:as is 540:1156C 532:1156B 480:2DD0C 452:2DD0B 376:-2911 348:-2912 65:bzip2 39:UTF-8 4427:TRON 4280:Cork 4251:SCSU 4174:ZX81 4169:ZX80 4164:XCCS 4094:NeXT 4074:LICS 4029:NRCS 3990:BICS 3960:1058 3955:1057 3950:1056 3945:1055 3940:1054 3935:1053 3930:1052 3804:DKOI 3760:1270 3755:1258 3750:1257 3745:1256 3740:1255 3735:1254 3730:1253 3725:1252 3720:1251 3715:1250 3705:1169 3662:1133 3657:1124 3652:1046 3647:1019 3642:1018 3637:1017 3632:1016 3627:1015 3622:1014 3617:1013 3612:1012 3607:1010 3602:1009 3597:1008 3592:1006 3499:3846 3494:1127 3489:1118 3484:1117 3479:1116 3474:1115 3469:1098 3464:1044 3459:1043 3454:1042 3449:1040 3444:1034 3208:Sámi 2894:Big5 2873:6862 2868:6438 2863:5428 2858:5427 2788:Sámi 2663:sets 2629:and 2247:Modi 2161:Kawi 2031:Ahom 1993:Toto 1973:Thai 1843:Odia 1818:N'Ko 1618:Cham 1384:HTML 1331:list 1249:SCSU 1000:List 705:0xFF 635:and 597:0xFF 589:0xFF 582:SCSU 565:and 448:2911 426:2910 273:hhhh 260:hhhh 252:hhhh 227:hhhh 219:hhhh 105:and 72:IANA 35:MIME 31:BOCU 4383:KOI 4300:OT1 4295:OMS 4290:OML 4285:LY1 4271:TeX 4084:MSX 4044:GEM 4000:CDC 3818:VTx 3814:DEC 3700:950 3694:GBK 3690:936 3685:932 3587:922 3582:921 3577:915 3572:912 3567:896 3562:895 3544:MIK 3439:951 3434:950 3429:949 3424:942 3419:936 3414:932 3409:904 3404:903 3399:899 3394:897 3389:869 3384:868 3379:867 3374:866 3369:865 3364:864 3359:863 3354:862 3349:861 3344:860 3339:859 3334:858 3329:857 3324:856 3319:855 3314:853 3309:852 3304:851 3299:850 3294:778 3289:777 3284:776 3279:775 3274:773 3269:770 3264:737 3259:720 3254:708 3249:668 3244:437 1998:Vai 1813:Mru 1753:Lao 1052:BOM 727:IBM 687:243 675:256 659:to 620:to 549:to 500:to 482:to 465:to 450:to 436:to 424:to 413:to 404:to 402:-40 390:to 380:-41 378:to 361:to 346:to 326:to 308:to 256:to 223:to 183:to 162:to 141:to 93:to 61:zip 4559:: 4348:HZ 2013:Yi 681:13 603:. 542:. 511:54 508:B4 505:19 502:FE 498:01 495:01 492:01 489:FE 473:FF 470:FF 467:FD 463:01 460:01 457:FB 441:FF 438:FA 434:01 431:D0 422:40 415:CF 411:50 406:3F 395:FF 392:4F 388:01 385:25 369:FF 366:FF 363:24 359:01 356:01 353:22 337:FF 334:FF 331:FF 328:21 324:D9 321:58 318:F0 315:21 275:C0 271:U+ 262:FF 258:U+ 254:80 250:U+ 229:7F 225:U+ 221:00 217:U+ 63:, 4027:/ 3820:) 3696:) 3692:( 2633:/ 2539:e 2532:t 2525:v 945:e 938:t 931:v 891:. 870:. 845:. 799:. 775:. 684:= 572:( 29:( 23:.

Index

BOCU (disambiguation)
MIME
UTF-8
Standard Compression Scheme for Unicode
Unicode
encoding
code pages
zip
bzip2
IANA
hexadecimal
Hiragana
Unihan
Hangul
lexicographically ordered
UTF-8
SCSU
U+FEFF
UTF-1
UTF-8
UCS-4
UTF-16
Unicode
modulo
United States Patent
IBM
intellectual property
UTF-EBCDIC
encoding scheme
Mark Davis

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.