Knowledge

Extended ASCII

Source 📝

2959: 20: 309: 108: 212:, and some computing. Early teleprinters were electromechanical, having no microprocessor and just enough electromechanical memory to function. They fully processed one character at a time, returning to an idle state immediately afterward; this meant that any control sequences had to be only one character long, and thus a large number of codes needed to be reserved for such controls. They were typewriter-derived 283:
fields, or packing 8 characters into 7 bytes.) This would allow ASCII to be used unchanged and provide 128 more characters. Many manufacturers devised 8-bit character sets consisting of ASCII plus up to 128 of the unused codes: encodings which covered all the more used Western European (and Latin American) languages, such as Danish, Dutch, French, German, Portuguese, Spanish, Swedish and more could be made.
295:(semi-readable resulting text, often users learned how to manually decode it). There were eventually attempts at cooperation or coordination by national and international standards bodies in the late 1990s, but manufacturer-proprietary sets remained the most popular by far, primarily because the international standards excluded characters popular in or peculiar to specific cultures. 271:"??<" and "??>" to represent "{" and "}". Languages with dissimilar basic alphabets could use transliteration, such as replacing all the Latin letters with the closest match Cyrillic letters (resulting in odd but somewhat readable text when English was printed in Cyrillic or vice versa). Schemes were also devised so that two letters could be overprinted (often with the 637:(complete nonsense). Because many Internet standards use ISO 8859-1, and because Microsoft Windows (using the code page 1252 superset of ISO 8859-1) is the dominant operating system for personal computers today, unannounced use of ISO 8859-1 is quite commonplace, and may generally be assumed unless there are indications otherwise. 263:. Modified variants of 7-bit ASCII appeared promptly, trading some lesser-used symbols for highly desired symbols or letters, such as replacing "#" with "£" on UK Teletypes, "\" with "¥" in Japan or "₩" in Korea, etc. At least 29 variant sets resulted. 12 code points were modified by at least one modified set, leaving only 243:, and far too small for universal use. Many more letters and symbols are desirable, useful, or required to directly represent letters of alphabets other than English, more kinds of punctuation and spacing, more mathematical operators and symbols (× ÷ ⋅ ≠ ≥ ≈ π etc.), some unique symbols used by some programming languages, 443:
and assigned numbers to both those they themselves invented as well as many invented and used by other manufacturers. Accordingly, character sets are very often indicated by their IBM code page number. In ASCII-compatible code pages, the lower 128 characters maintained their standard ASCII values,
632:
Because the full English alphabet and the most-used characters in English are included in the seven-bit code points of ASCII, which are common to all encodings (even most proprietary encodings), English-language text is less damaged by interpreting it with the wrong encoding, but text in other
282:
in the 1970s, it became obvious that computers and software could handle text that uses 256-character sets at almost no additional cost in programming, and no additional cost for storage. (Assuming that the unused 8th bit of each byte was not reused in some way, such as error checking, Boolean
227:
and one space), which include the English alphabet (uppercase and lowercase), digits, and 31 punctuation marks and symbols: all of the symbols on a standard US typewriter plus a few selected for programming tasks. Some popular peripherals only implemented a 64-printing-character subset:
617:
The meaning of each extended code point can be different in every encoding. In order to correctly interpret and display text data (sequences of characters) that includes extended codes, hardware and software that reads or receives the text must use the
452:, which included accented characters needed for French, German, and a few other European languages, as well as some graphical line-drawing characters. The larger character set made it possible to create documents in a combination of languages such as 286:
128 additional characters is still not enough to cover all purposes, all languages, or even all European languages, so the emergence of many proprietary and national ASCII-derived 8-bit character sets was inevitable. Translating between these sets
629:, asking the user, letting the user select or override, and/or defaulting to last selection. When text is transferred between computers that use different operating systems, software, and encodings, applying the wrong encoding can be commonplace. 254:
The biggest problem for computer users around the world was other alphabets. ASCII's English alphabet almost accommodates European languages, if accented letters are replaced by non-accented letters or two-character approximations such as
592:
Microsoft intended to use ISO 8859 standards in Windows, but soon replaced the unused C1 control characters with additional characters, making the proprietary Windows-1252 character set, which is sometimes mislabeled as
426:
characters (0x80 through 0xBF) that implemented low-resolution block graphics. (Each block-graphic character displayed as a 2x3 grid of pixels, with each block pixel effectively controlled by one of the lower 6 bits.)
577:
with the high-order bit 'set', are reserved by ISO for control use and unused for printable characters (they are also reserved in Unicode). This convention was almost universally ignored by other extended ASCII sets.
45:
character set, plus up to 128 additional characters. There is no formal definition of "extended ASCII", and even use of the term is sometimes criticized, because it can be mistakenly interpreted to mean that the
67:("ISO Latin 1") – which supports most Western European languages – is best known in the West. There are many other extended ASCII encodings (more than 220 DOS and Windows 625:
Software can use a fixed encoding selection, or it can select from a palette of encodings by defaulting, checking the computer's nation and language settings, reading a declaration in the text,
232:
could not transmit "a" through "z" or five less-common symbols ("`", "{", "|", "}", and "~"). and when they received such characters they instead printed "A" through "Z" (forced
2577: 609:, and letters missing from French and Finnish. This became the most-used extended ASCII in the world, and often is used on the web even when 8859-1 is specified. 546:(also called "ISO Latin 1") which contains characters sufficient for the most common Western European languages. Other standards in the 8859 group included 1422: 1259: 622:
extended ASCII encoding that applies to it. Applying the wrong encoding causes irrational substitution of many or all extended characters in the text.
2368: 216:, and could only print a fixed set of glyphs, which were cast into a metal type element or elements; this also encouraged a minimum set of glyphs. 275:
control between them) to produce accented letters. Users were not comfortable with any of these compromises and they were often poorly supported.
1367: 539: 382:
around 1978/1979 for use with their workstations, terminals and printers. This later evolved into the widely used regular 8-bit character sets
63:
was the first international standard to formalise a (limited) expansion of the ASCII character set: of the many language variants it encoded,
1442: 956: 90:, and supporting multiple extended ASCII character sets required software to be written in ways that made it much easier to support the 219:
Seven-bit ASCII improved over prior five- and six-bit codes. Of the 2=128 codes, 33 were used for controls, and 95 carefully selected
879:
When a browser detects ISO-8859-1 it normally defaults to Windows-1252, because Windows-1252 has 32 more international characters.
824: 2669: 2423: 899: 565:
One notable way in which the ISO standards differ from some vendor-specific extended ASCII is that the 32 character positions 80
75:("the other" major character code) likewise developed many extended variants (more than 186 EBCDIC codepages) over the decades. 2659: 714: 56:
standard to include more characters, or that the term identifies a single unambiguous encoding, neither of which is the case.
2408: 1362: 737: 268: 267:. Programming languages however had assigned meaning to many of the replaced characters, work-arounds were devised such as C 172: 47: 2542: 670: 144: 653: 798: 2936: 2447: 2250: 994: 151: 2492: 2108: 2103: 1606: 1437: 984: 949: 513: 348: 191: 1526: 2744: 2679: 2433: 2413: 326: 125: 158: 2497: 1090: 330: 129: 2611: 2582: 2232: 497: 2674: 2562: 2522: 942: 542:(ISO) published a set of standards for eight-bit ASCII extensions, ISO 8859. The most popular of these was 491: 140: 2916: 2527: 2457: 2443: 2428: 2332: 2245: 2217: 2183: 501: 759: 2890: 2835: 2756: 2537: 2193: 2188: 1541: 680: 2532: 2597: 2552: 2388: 1937: 1641: 1586: 1551: 820: 2112: 1621: 1601: 1596: 1536: 1531: 1040: 437:
and later produced variations for different languages and cultures. IBM called such character sets
2487: 239:
The ASCII character set is barely large enough for US English use and lacks many glyphs common in
2981: 2962: 2946: 2873: 2868: 2830: 2801: 2766: 2198: 1932: 1631: 1516: 641: 504:, which had fewer characters but more letter and diacritic combinations. It was supported by the 444:
and different pages (or sets of characters) could be made available in the upper 128 characters.
397: 319: 118: 291:) is complex (especially if a character is not in both sets); and was often not done, producing 2557: 2547: 2403: 2393: 1927: 1636: 1081: 1068: 1004: 2734: 2572: 2507: 2383: 1922: 1076: 1952: 165: 2895: 2567: 2327: 1947: 794:
Rationale for American National Standard for Information Systems - Programming Language - C
415: 87: 8: 2850: 2477: 1962: 1847: 1837: 1832: 220: 86:
which supports thousands of characters. However, extended ASCII remains important in the
2931: 2779: 2592: 2587: 2512: 1511: 1485: 1009: 965: 856: 379: 378:
started to add European characters to their extended 7-bit / 8-bit ASCII character set
365: 38: 2921: 2860: 2840: 2502: 2482: 2462: 2090: 1566: 1546: 1058: 891: 626: 574: 509: 487: 229: 706: 2878: 2452: 2418: 2128: 1957: 763: 453: 79: 788: 2926: 2845: 1576: 1571: 1561: 1506: 1191: 1181: 1176: 1171: 1166: 1161: 1156: 559: 457: 375: 2986: 2378: 2373: 2363: 2358: 2353: 2348: 2312: 2307: 2300: 2295: 2290: 2285: 2280: 2275: 2270: 2265: 2260: 2255: 2123: 2080: 2075: 2070: 2065: 2060: 2055: 2050: 2045: 2040: 2035: 2030: 2025: 2020: 2015: 2010: 1917: 1912: 1907: 1902: 1897: 1892: 1887: 1882: 1877: 1872: 1867: 1862: 1646: 1231: 1151: 1146: 1141: 1136: 1131: 1126: 1121: 1116: 1111: 979: 845: 598: 475: 465: 213: 52: 1626: 792: 2975: 2698: 2118: 2005: 2000: 1995: 1990: 1985: 1980: 1857: 1852: 1842: 1827: 1822: 1817: 1812: 1807: 1802: 1797: 1792: 1787: 1782: 1777: 1772: 1767: 1762: 1757: 1752: 1747: 1742: 1737: 1732: 1727: 1722: 1717: 1712: 1707: 1702: 1697: 1692: 1687: 1682: 1677: 1672: 1667: 1662: 1581: 1556: 1521: 1480: 1226: 469: 461: 449: 400: 2718: 2713: 2708: 2703: 2438: 2178: 2173: 2168: 2163: 2158: 2153: 2148: 2143: 2138: 2133: 1616: 1611: 1591: 1475: 1467: 1100: 675: 587: 551: 533: 483: 423: 369: 767: 361:
Various proprietary modifications and extensions of ASCII appeared on non-
1033: 1016: 288: 264: 240: 205: 930:
A short page on ASCII, with the OEM 8-bit chart and the ANSI 8-bit chart
2883: 2791: 2644: 2322: 1417: 1387: 1382: 1377: 1372: 1337: 1221: 1216: 1206: 1201: 999: 989: 929: 555: 547: 543: 521: 439: 387: 383: 333: in this section. Unsourced material may be challenged and removed. 209: 132: in this section. Unsourced material may be challenged and removed. 64: 19: 2824: 934: 870: 403:
added many graphic symbols to their non-standard ASCII (Respectively,
2771: 2749: 2654: 2467: 1496: 1427: 1407: 1402: 1327: 1322: 665: 606: 272: 707:"Re: Cygwin Termcap information involving extended ascii charicters" 308: 107: 2941: 2796: 2761: 2739: 2649: 2472: 1412: 1397: 1357: 1352: 1347: 1332: 1291: 1286: 1281: 1276: 1271: 1266: 1063: 1053: 1049: 1023: 634: 512:. This later became the basis for other character sets such as the 292: 248: 244: 233: 68: 60: 2811: 2607: 2517: 2398: 1972: 1342: 1317: 1307: 1045: 602: 517: 448:
computers built for the North American market, for example, used
408: 404: 83: 2816: 2806: 2784: 2664: 2639: 2634: 2317: 2208: 2098: 1457: 1447: 1432: 1249: 895: 479: 434: 419: 362: 72: 28: 652:, require the character encoding of content to be tagged with 2911: 2629: 2624: 2619: 2236: 1942: 1452: 1392: 1254: 1028: 925:
Roman Czyborra's Unicode and extended ASCII information pages
685: 505: 393: 224: 91: 42: 236:) and five other mostly-similar symbols ("@", "", and "^"). 2222: 1312: 649: 645: 279: 433:
introduced eight-bit extended ASCII codes on the original
2689: 445: 430: 278:
When computers and peripherals standardized on eight-bit
924: 846:"C1 Controls and Latin-1 Supplement | Range: 0080–00FF" 478:
introduced their own eight-bit extended ASCII codes in
704: 411:, based on the original ASCII standard of 1963). 2973: 2891:Unicode control, format and separator characters 16:Nickname for 8-bit ASCII-derived character sets 898:. 27 January 2015. sec. 5.2 Names and labels. 540:International Organization for Standardization 950: 757: 957: 943: 735: 612: 818: 550:for Eastern European languages using the 349:Learn how and when to remove this message 298: 192:Learn how and when to remove this message 738:"Print Extended ASCII Codes in sql*plus" 597:. The added characters included "curly" 464:), but not, for example, in English and 18: 964: 41:that include (most of) the original 96 2974: 601:and other typographical elements like 938: 884: 656:-assigned character set identifiers. 460:(though French computers usually use 48:American National Standards Institute 902:from the original on 4 February 2015 705:Benjamin Riefenstahl (26 Feb 2001). 671:Digraphs and trigraphs (programming) 331:adding citations to reliable sources 302: 204:ASCII was designed in the 1960s for 130:adding citations to reliable sources 101: 390:(as well as a number of variants). 13: 2301:Norwegian and Danish (alternative) 853:The Unicode Standard, Version 15.1 760:"vim: how to type extended-ascii?" 14: 2998: 918: 717:from the original on 11 July 2013 514:Lotus International Character Set 2958: 2957: 573:, which correspond to the ASCII 307: 251:, box-drawing characters, etc. 106: 2745:Digital encoding of APL symbols 2680:Comparison of Unicode encodings 1198:Proposed but not approved 827:from the original on 2017-07-29 801:from the original on 2018-09-29 758:Mark J. Reed (March 28, 2004). 581: 318:needs additional citations for 117:needs additional citations for 863: 838: 812: 781: 751: 729: 698: 372:, especially in universities. 1: 691: 498:Digital Equipment Corporation 789:"2.2.1.1 Trigraph sequences" 7: 2917:Character encodings in HTML 2251:National Replacement (NRCS) 2218:Japanese language in EBCDIC 821:"Graphic Tips & Tricks" 736:S. Wolicki (Mar 23, 2012). 659: 527: 502:Multinational Character Set 10: 3003: 681:List of Unicode characters 585: 531: 97: 94:encoding method later on. 2955: 2904: 2859: 2727: 2688: 2606: 2341: 2231: 2207: 2089: 1971: 1655: 1494: 1466: 1300: 1242: 1099: 972: 633:languages can display as 269:three-character sequences 2947:Variable-length encoding 2728:Miscellaneous code pages 1486:Extended Unix Code / EUC 1177:-15 (New Western Europe) 973:Early telecommunications 642:communications protocols 558:for languages using the 492:Postscript character set 2874:C0 and C1 control codes 819:Goldklang, Ira (2015). 613:Character set confusion 422:home computer added 64 50:(ANSI) had updated its 1122:-3 (Maltese/Esperanto) 1073:World System Teletext 299:Proprietary extensions 31: 23:Output of the program 2896:Whitespace characters 2573:Ventura International 871:"HTML Character Sets" 22: 2291:Norwegian and Danish 500:(DEC) developed the 490:also introduced the 416:TRS-80 character set 327:improve this article 265:82 "invariant" codes 221:printable characters 126:improve this article 88:history of computing 2851:Unified Hangul Code 2523:PostScript Standard 2246:Multinational (MCS) 1117:-2 (Central Europe) 1112:-1 (Western Europe) 966:Character encodings 644:, most importantly 366:mainframe computers 39:character encodings 37:is a repertoire of 2932:Hardware code page 2692:typesetting system 2528:PostScript Latin 1 2184:Cyrillic + Finnish 2091:Windows code pages 1973:IBM AIX code pages 1301:National standards 1232:Ukrainian Cyrillic 857:Unicode Consortium 627:analyzing the text 575:control characters 510:computer terminals 380:HP Roman Extension 32: 2969: 2968: 2922:Charset detection 2861:Control character 2543:Sharp calculators 2414:Casio calculators 2342:Platform specific 2194:Cyrillic + German 2189:Cyrillic + French 1607:Maltese/Esperanto 1243:Bibliographic use 1127:-4 (North Europe) 1059:T.51/ISO/IEC 6937 1017:Baudot and Murray 488:Apple LaserWriter 359: 358: 351: 230:Teletype Model 33 202: 201: 194: 176: 80:operating systems 59:The ISO standard 2994: 2961: 2960: 2453:DG International 2328:Special Graphics 2129:Extended Latin-8 1527:Central European 1517:Barents Cyrillic 1222:Barents Cyrillic 1192:-12 (Devanagari) 1188:Abandoned parts 959: 952: 945: 936: 935: 912: 911: 909: 907: 888: 882: 881: 867: 861: 860: 850: 842: 836: 835: 833: 832: 816: 810: 809: 807: 806: 785: 779: 778: 776: 774: 755: 749: 748: 746: 744: 733: 727: 726: 724: 722: 713:(Mailing list). 702: 468:(which required 354: 347: 343: 340: 334: 311: 303: 262: 258: 197: 190: 186: 183: 177: 175: 141:"Extended ASCII" 134: 110: 102: 55: 26: 3002: 3001: 2997: 2996: 2995: 2993: 2992: 2991: 2972: 2971: 2970: 2965: 2951: 2927:Han unification 2900: 2855: 2723: 2684: 2602: 2424:Compucolor 8001 2337: 2333:Technical (TCS) 2256:French Canadian 2227: 2203: 2199:Polytonic Greek 2085: 1967: 1651: 1637:Turkic Cyrillic 1552:Font X (Kermit) 1547:Farsi (Persian) 1499: 1490: 1462: 1296: 1238: 1108:Approved parts 1095: 968: 963: 921: 916: 915: 905: 903: 890: 889: 885: 869: 868: 864: 848: 844: 843: 839: 830: 828: 817: 813: 804: 802: 787: 786: 782: 772: 770: 756: 752: 742: 740: 734: 730: 720: 718: 703: 699: 694: 662: 615: 599:quotation marks 590: 584: 572: 568: 560:Cyrillic script 536: 530: 376:Hewlett-Packard 355: 344: 338: 335: 324: 312: 301: 260: 256: 214:impact printers 198: 187: 181: 178: 135: 133: 123: 111: 100: 51: 24: 17: 12: 11: 5: 3000: 2990: 2989: 2984: 2982:Character sets 2967: 2966: 2963:Character sets 2956: 2953: 2952: 2950: 2949: 2944: 2939: 2934: 2929: 2924: 2919: 2914: 2908: 2906: 2905:Related topics 2902: 2901: 2899: 2898: 2893: 2888: 2887: 2886: 2881: 2871: 2869:Morse prosigns 2865: 2863: 2857: 2856: 2854: 2853: 2848: 2843: 2838: 2833: 2828: 2821: 2820: 2819: 2814: 2809: 2799: 2794: 2789: 2788: 2787: 2782: 2774: 2769: 2764: 2759: 2754: 2753: 2752: 2742: 2737: 2731: 2729: 2725: 2724: 2722: 2721: 2716: 2711: 2706: 2701: 2695: 2693: 2686: 2685: 2683: 2682: 2677: 2672: 2667: 2662: 2657: 2652: 2647: 2642: 2637: 2632: 2627: 2622: 2616: 2614: 2604: 2603: 2601: 2600: 2595: 2590: 2585: 2580: 2575: 2570: 2565: 2563:TI calculators 2560: 2555: 2550: 2545: 2540: 2535: 2530: 2525: 2520: 2515: 2510: 2505: 2500: 2495: 2490: 2485: 2480: 2475: 2470: 2465: 2460: 2455: 2450: 2441: 2436: 2431: 2426: 2421: 2416: 2411: 2406: 2401: 2396: 2391: 2386: 2381: 2376: 2371: 2366: 2361: 2356: 2351: 2345: 2343: 2339: 2338: 2336: 2335: 2330: 2325: 2320: 2315: 2310: 2305: 2304: 2303: 2298: 2293: 2288: 2283: 2278: 2273: 2271:United Kingdom 2268: 2263: 2258: 2248: 2242: 2240: 2229: 2228: 2226: 2225: 2220: 2214: 2212: 2205: 2204: 2202: 2201: 2196: 2191: 2186: 2181: 2176: 2171: 2166: 2161: 2156: 2151: 2146: 2141: 2136: 2131: 2126: 2121: 2116: 2106: 2101: 2095: 2093: 2087: 2086: 2084: 2083: 2078: 2073: 2068: 2063: 2058: 2053: 2048: 2043: 2038: 2033: 2028: 2023: 2018: 2013: 2008: 2003: 1998: 1993: 1988: 1983: 1977: 1975: 1969: 1968: 1966: 1965: 1960: 1955: 1950: 1945: 1940: 1935: 1930: 1925: 1920: 1915: 1910: 1905: 1900: 1895: 1890: 1885: 1880: 1875: 1870: 1865: 1860: 1855: 1850: 1845: 1840: 1835: 1830: 1825: 1820: 1815: 1810: 1805: 1800: 1795: 1790: 1785: 1780: 1775: 1770: 1765: 1760: 1755: 1750: 1745: 1740: 1735: 1730: 1725: 1720: 1715: 1710: 1705: 1700: 1695: 1690: 1685: 1680: 1675: 1670: 1665: 1659: 1657: 1656:DOS code pages 1653: 1652: 1650: 1649: 1644: 1639: 1634: 1629: 1624: 1619: 1614: 1609: 1604: 1602:Latin (Kermit) 1599: 1594: 1589: 1584: 1579: 1574: 1569: 1564: 1559: 1554: 1549: 1544: 1539: 1534: 1529: 1524: 1519: 1514: 1509: 1503: 1501: 1492: 1491: 1489: 1488: 1483: 1478: 1472: 1470: 1464: 1463: 1461: 1460: 1455: 1450: 1445: 1440: 1435: 1430: 1425: 1420: 1415: 1410: 1405: 1400: 1395: 1390: 1385: 1380: 1375: 1370: 1365: 1360: 1355: 1350: 1345: 1340: 1335: 1330: 1325: 1320: 1315: 1310: 1304: 1302: 1298: 1297: 1295: 1294: 1289: 1284: 1279: 1274: 1269: 1264: 1263: 1262: 1257: 1246: 1244: 1240: 1239: 1237: 1236: 1235: 1234: 1229: 1224: 1219: 1211: 1210: 1209: 1204: 1202:KOI-8 Cyrillic 1196: 1195: 1194: 1186: 1185: 1184: 1182:-16 (Romanian) 1179: 1174: 1169: 1164: 1159: 1154: 1149: 1144: 1139: 1134: 1129: 1124: 1119: 1114: 1105: 1103: 1097: 1096: 1094: 1093: 1088: 1087: 1086: 1085: 1084: 1079: 1071: 1066: 1061: 1043: 1038: 1037: 1036: 1026: 1021: 1020: 1019: 1014: 1013: 1012: 1007: 1002: 997: 987: 980:Telegraph code 976: 974: 970: 969: 962: 961: 954: 947: 939: 933: 932: 927: 920: 919:External links 917: 914: 913: 883: 862: 837: 811: 780: 750: 728: 696: 695: 693: 690: 689: 688: 683: 678: 673: 668: 661: 658: 614: 611: 586:Main article: 583: 580: 570: 566: 562:, and others. 532:Main article: 529: 526: 508:and later DEC 476:Apple Computer 401:home computers 357: 356: 315: 313: 306: 300: 297: 200: 199: 114: 112: 105: 99: 96: 53:ANSI X3.4-1986 35:Extended ASCII 15: 9: 6: 4: 3: 2: 2999: 2988: 2985: 2983: 2980: 2979: 2977: 2964: 2954: 2948: 2945: 2943: 2940: 2938: 2935: 2933: 2930: 2928: 2925: 2923: 2920: 2918: 2915: 2913: 2910: 2909: 2907: 2903: 2897: 2894: 2892: 2889: 2885: 2882: 2880: 2877: 2876: 2875: 2872: 2870: 2867: 2866: 2864: 2862: 2858: 2852: 2849: 2847: 2844: 2842: 2839: 2837: 2834: 2832: 2829: 2827: 2826: 2822: 2818: 2815: 2813: 2810: 2808: 2805: 2804: 2803: 2800: 2798: 2795: 2793: 2790: 2786: 2783: 2781: 2778: 2777: 2775: 2773: 2770: 2768: 2765: 2763: 2760: 2758: 2755: 2751: 2748: 2747: 2746: 2743: 2741: 2738: 2736: 2733: 2732: 2730: 2726: 2720: 2717: 2715: 2712: 2710: 2707: 2705: 2702: 2700: 2697: 2696: 2694: 2691: 2687: 2681: 2678: 2676: 2673: 2671: 2668: 2666: 2663: 2661: 2658: 2656: 2653: 2651: 2648: 2646: 2643: 2641: 2638: 2636: 2633: 2631: 2628: 2626: 2623: 2621: 2618: 2617: 2615: 2613: 2612:ISO/IEC 10646 2609: 2605: 2599: 2596: 2594: 2591: 2589: 2586: 2584: 2581: 2579: 2576: 2574: 2571: 2569: 2566: 2564: 2561: 2559: 2556: 2554: 2551: 2549: 2546: 2544: 2541: 2539: 2536: 2534: 2531: 2529: 2526: 2524: 2521: 2519: 2516: 2514: 2511: 2509: 2506: 2504: 2501: 2499: 2496: 2494: 2491: 2489: 2486: 2484: 2481: 2479: 2476: 2474: 2471: 2469: 2466: 2464: 2461: 2459: 2456: 2454: 2451: 2449: 2445: 2442: 2440: 2437: 2435: 2432: 2430: 2429:Compucolor II 2427: 2425: 2422: 2420: 2417: 2415: 2412: 2410: 2407: 2405: 2402: 2400: 2397: 2395: 2392: 2390: 2387: 2385: 2384:Acorn RISC OS 2382: 2380: 2377: 2375: 2372: 2370: 2367: 2365: 2362: 2360: 2357: 2355: 2352: 2350: 2347: 2346: 2344: 2340: 2334: 2331: 2329: 2326: 2324: 2321: 2319: 2316: 2314: 2313:8-bit Turkish 2311: 2309: 2306: 2302: 2299: 2297: 2294: 2292: 2289: 2287: 2284: 2282: 2279: 2277: 2274: 2272: 2269: 2267: 2264: 2262: 2259: 2257: 2254: 2253: 2252: 2249: 2247: 2244: 2243: 2241: 2238: 2234: 2230: 2224: 2221: 2219: 2216: 2215: 2213: 2210: 2206: 2200: 2197: 2195: 2192: 2190: 2187: 2185: 2182: 2180: 2177: 2175: 2172: 2170: 2167: 2165: 2162: 2160: 2157: 2155: 2152: 2150: 2147: 2145: 2142: 2140: 2137: 2135: 2132: 2130: 2127: 2125: 2122: 2120: 2117: 2114: 2110: 2107: 2105: 2102: 2100: 2097: 2096: 2094: 2092: 2088: 2082: 2079: 2077: 2074: 2072: 2069: 2067: 2064: 2062: 2059: 2057: 2054: 2052: 2049: 2047: 2044: 2042: 2039: 2037: 2034: 2032: 2029: 2027: 2024: 2022: 2019: 2017: 2014: 2012: 2009: 2007: 2004: 2002: 1999: 1997: 1994: 1992: 1989: 1987: 1984: 1982: 1979: 1978: 1976: 1974: 1970: 1964: 1961: 1959: 1956: 1954: 1951: 1949: 1946: 1944: 1941: 1939: 1936: 1934: 1931: 1929: 1926: 1924: 1921: 1919: 1916: 1914: 1911: 1909: 1906: 1904: 1901: 1899: 1896: 1894: 1891: 1889: 1886: 1884: 1881: 1879: 1876: 1874: 1871: 1869: 1866: 1864: 1861: 1859: 1856: 1854: 1851: 1849: 1846: 1844: 1841: 1839: 1836: 1834: 1831: 1829: 1826: 1824: 1821: 1819: 1816: 1814: 1811: 1809: 1806: 1804: 1801: 1799: 1796: 1794: 1791: 1789: 1786: 1784: 1781: 1779: 1776: 1774: 1771: 1769: 1766: 1764: 1761: 1759: 1756: 1754: 1751: 1749: 1746: 1744: 1741: 1739: 1736: 1734: 1731: 1729: 1726: 1724: 1721: 1719: 1716: 1714: 1711: 1709: 1706: 1704: 1701: 1699: 1696: 1694: 1691: 1689: 1686: 1684: 1681: 1679: 1676: 1674: 1671: 1669: 1666: 1664: 1661: 1660: 1658: 1654: 1648: 1645: 1643: 1640: 1638: 1635: 1633: 1630: 1628: 1625: 1623: 1620: 1618: 1615: 1613: 1610: 1608: 1605: 1603: 1600: 1598: 1595: 1593: 1590: 1588: 1585: 1583: 1580: 1578: 1575: 1573: 1570: 1568: 1565: 1563: 1560: 1558: 1555: 1553: 1550: 1548: 1545: 1543: 1540: 1538: 1535: 1533: 1530: 1528: 1525: 1523: 1520: 1518: 1515: 1513: 1510: 1508: 1505: 1504: 1502: 1498: 1493: 1487: 1484: 1482: 1481:ISO/IEC 10367 1479: 1477: 1474: 1473: 1471: 1469: 1465: 1459: 1456: 1454: 1451: 1449: 1446: 1444: 1441: 1439: 1436: 1434: 1431: 1429: 1426: 1424: 1421: 1419: 1416: 1414: 1411: 1409: 1406: 1404: 1401: 1399: 1396: 1394: 1391: 1389: 1386: 1384: 1381: 1379: 1376: 1374: 1371: 1369: 1366: 1364: 1361: 1359: 1356: 1354: 1351: 1349: 1346: 1344: 1341: 1339: 1336: 1334: 1331: 1329: 1326: 1324: 1321: 1319: 1316: 1314: 1311: 1309: 1306: 1305: 1303: 1299: 1293: 1290: 1288: 1285: 1283: 1280: 1278: 1275: 1273: 1270: 1268: 1265: 1261: 1258: 1256: 1253: 1252: 1251: 1248: 1247: 1245: 1241: 1233: 1230: 1228: 1225: 1223: 1220: 1218: 1215: 1214: 1212: 1208: 1205: 1203: 1200: 1199: 1197: 1193: 1190: 1189: 1187: 1183: 1180: 1178: 1175: 1173: 1170: 1168: 1165: 1163: 1160: 1158: 1155: 1153: 1150: 1148: 1145: 1143: 1140: 1138: 1135: 1133: 1132:-5 (Cyrillic) 1130: 1128: 1125: 1123: 1120: 1118: 1115: 1113: 1110: 1109: 1107: 1106: 1104: 1102: 1098: 1092: 1089: 1083: 1080: 1078: 1075: 1074: 1072: 1070: 1067: 1065: 1062: 1060: 1057: 1056: 1055: 1051: 1047: 1044: 1042: 1039: 1035: 1032: 1031: 1030: 1027: 1025: 1022: 1018: 1015: 1011: 1008: 1006: 1003: 1001: 998: 996: 993: 992: 991: 988: 986: 983: 982: 981: 978: 977: 975: 971: 967: 960: 955: 953: 948: 946: 941: 940: 937: 931: 928: 926: 923: 922: 901: 897: 893: 887: 880: 876: 872: 866: 858: 854: 847: 841: 826: 822: 815: 800: 796: 795: 790: 784: 769: 765: 761: 754: 739: 732: 716: 712: 708: 701: 697: 687: 684: 682: 679: 677: 674: 672: 669: 667: 664: 663: 657: 655: 651: 647: 643: 638: 636: 630: 628: 623: 621: 610: 608: 604: 600: 596: 589: 579: 576: 563: 561: 557: 553: 549: 545: 541: 538:In 1987, the 535: 525: 523: 519: 515: 511: 507: 503: 499: 495: 493: 489: 485: 481: 477: 473: 471: 470:code page 737 467: 463: 462:code page 850 459: 455: 451: 450:code page 437 447: 442: 441: 436: 432: 428: 425: 421: 417: 412: 410: 406: 402: 399: 395: 391: 389: 385: 381: 377: 373: 371: 370:minicomputers 367: 364: 353: 350: 342: 332: 328: 322: 321: 316:This section 314: 310: 305: 304: 296: 294: 290: 284: 281: 276: 274: 270: 266: 252: 250: 246: 242: 237: 235: 231: 226: 222: 217: 215: 211: 207: 196: 193: 185: 174: 171: 167: 164: 160: 157: 153: 150: 146: 143: –  142: 138: 137:Find sources: 131: 127: 121: 120: 115:This section 113: 109: 104: 103: 95: 93: 89: 85: 81: 76: 74: 70: 66: 62: 57: 54: 49: 44: 40: 36: 30: 21: 2879:ISO/IEC 6429 2836:Stanford/ITS 2823: 2757:ARIB STD-B24 2538:Sega SC-3000 2439:DEC RADIX 50 1476:ISO/IEC 8859 1468:ISO/IEC 2022 1213:Adaptations 1172:-14 (Celtic) 1167:-13 (Baltic) 1157:-10 (Nordic) 1152:-9 (Turkish) 1101:ISO/IEC 8859 904:. Retrieved 886: 878: 874: 865: 852: 840: 829:. Retrieved 814: 803:. Retrieved 793: 783: 771:. Retrieved 768:comp.editors 753: 741:. Retrieved 731: 719:. Retrieved 710: 700: 676:Input method 639: 631: 624: 619: 616: 594: 591: 588:Windows-1252 582:Windows-1252 564: 552:Latin script 537: 534:ISO/IEC 8859 496: 484:Mac OS Roman 474: 438: 429: 424:semigraphics 413: 392: 374: 360: 345: 336: 325:Please help 320:verification 317: 285: 277: 253: 238: 218: 206:teleprinters 203: 188: 179: 169: 162: 155: 148: 136: 124:Please help 119:verification 116: 77: 58: 34: 33: 2598:ZX Spectrum 2553:Sinclair QL 2389:Amstrad CPC 2308:8-bit Greek 2235:terminals ( 1948:Iran System 1500:("scripts") 1147:-8 (Hebrew) 1137:-6 (Arabic) 1034:ISO/IEC 646 289:transcoding 241:typesetting 78:All modern 2976:Categories 2884:JIS X 0211 2792:ISO-IR-169 2645:UTF-EBCDIC 2211:code pages 1938:CSX+ Indic 1542:Devanagari 1497:Code pages 1418:LST 1590-4 1388:JIS X 0213 1383:JIS X 0212 1378:JIS X 0208 1373:JIS X 0201 1338:GOST 10859 1260:CCCII/EACC 1162:-11 (Thai) 1142:-7 (Greek) 1077:background 1000:Wabun/Kana 906:4 February 892:"Encoding" 875:W3 Schools 831:2017-07-29 805:2019-02-08 721:2 December 692:References 556:ISO 8859-5 548:ISO 8859-2 544:ISO 8859-1 522:ISO 8859-1 482:, such as 440:code pages 388:HP Roman-9 384:HP Roman-8 210:telegraphy 182:March 2016 152:newspapers 65:ISO 8859-1 2937:MICR code 2772:IEC-P27-1 2750:ISO-IR-68 2655:DIN 91379 2533:SAM Coupé 2468:GSM 03.38 2458:Galaksija 1953:Kamenický 1933:CSX Indic 1642:Ukrainian 1428:Shift JIS 1408:KS X 1002 1403:KS X 1001 1328:DIN 66003 1323:CNS 11643 1091:Transcode 1069:ITU T.101 995:Non-Latin 764:Newsgroup 666:ASCII art 607:euro sign 398:Commodore 339:June 2020 273:backspace 249:logograms 245:ideograms 69:codepages 2942:Mojibake 2797:ISO 2033 2762:Fieldata 2740:ASMO 449 2650:GB 18030 2610: / 2558:Teletext 2548:Sharp MZ 2478:HP FOCAL 2473:HP Roman 2404:Atari ST 2394:Apple II 1928:CS Indic 1622:Romanian 1597:Keyboard 1577:Gurmukhi 1572:Gujarati 1562:Georgian 1537:Cyrillic 1532:Croatian 1507:Armenian 1413:LST 1564 1398:KPS 9566 1358:GB 18030 1353:GB 12052 1348:GB 12345 1333:ELOT 927 1267:ISO 5426 1227:Estonian 1064:ITU T.61 1054:Teletext 1050:Videotex 1024:Fieldata 1010:Cyrillic 900:Archived 825:Archived 799:Archived 715:Archived 660:See also 635:mojibake 620:specific 528:ISO 8859 516:(LICS), 418:for the 293:mojibake 234:all caps 61:ISO 8859 2831:SEASCII 2825:Mojikyō 2812:KOI8-RU 2735:ABICOMP 2608:Unicode 2518:PETSCII 2508:NEC APC 2444:DEC MCS 2399:ATASCII 2296:Swedish 2281:Finnish 2266:Spanish 1958:Mazovia 1923:ABICOMP 1632:Turkish 1587:Iceland 1495:Mac OS 1438:TIS-620 1343:GB 2312 1318:BraSCII 1308:ArmSCII 1046:Teletex 1005:Chinese 773:May 17, 766::  743:May 17, 603:em dash 518:ECMA-94 454:English 409:PETSCII 405:ATASCII 166:scholar 98:History 84:Unicode 2841:Symbol 2817:KOI8-U 2807:KOI8-R 2675:TACE16 2665:CESU-8 2660:BOCU-1 2640:UTF-32 2635:UTF-16 2578:WISCII 2568:TRS-80 2488:SQUOZE 2483:HP RPL 2323:Hebrew 2318:SI 960 2286:French 2209:EBCDIC 2099:CER-GS 1582:Hebrew 1557:Gaelic 1522:Celtic 1512:Arabic 1458:YUSCII 1448:VISCII 1433:SI 960 1423:PASCII 1272:5426-2 1250:MARC-8 985:Needle 896:WHATWG 711:cygwin 605:, the 486:. The 480:Mac OS 458:French 435:IBM PC 420:TRS-80 363:EBCDIC 225:glyphs 168:  161:  154:  147:  139:  73:EBCDIC 29:Cygwin 2987:ASCII 2912:CCSID 2785:8-bit 2780:7-bit 2776:INIS 2630:UTF-8 2625:UTF-7 2620:UTF-1 2498:LMBCS 2434:CP/M+ 2276:Dutch 2261:Swiss 1943:CWI-2 1647:VT100 1617:Roman 1612:Ogham 1592:Inuit 1567:Greek 1453:VSCII 1443:TSCII 1393:KOI-7 1368:ISCII 1363:HKSCS 1255:ANSEL 1217:Welsh 1041:BCDIC 1029:ASCII 990:Morse 849:(PDF) 686:KOI-8 640:Many 569:to 9F 506:VT220 466:Greek 394:Atari 280:bytes 173:JSTOR 159:books 92:UTF-8 43:ASCII 25:ascii 2846:TRON 2699:Cork 2670:SCSU 2593:ZX81 2588:ZX80 2583:XCCS 2513:NeXT 2493:LICS 2448:NRCS 2409:BICS 2379:1058 2374:1057 2369:1056 2364:1055 2359:1054 2354:1053 2349:1052 2223:DKOI 2179:1270 2174:1258 2169:1257 2164:1256 2159:1255 2154:1254 2149:1253 2144:1252 2139:1251 2134:1250 2124:1169 2081:1133 2076:1124 2071:1046 2066:1019 2061:1018 2056:1017 2051:1016 2046:1015 2041:1014 2036:1013 2031:1012 2026:1010 2021:1009 2016:1008 2011:1006 1918:3846 1913:1127 1908:1118 1903:1117 1898:1116 1893:1115 1888:1098 1883:1044 1878:1043 1873:1042 1868:1040 1863:1034 1627:Sámi 1313:Big5 1292:6862 1287:6438 1282:5428 1277:5427 1207:Sámi 1082:sets 1048:and 908:2015 775:2022 745:2022 723:2012 654:IANA 650:HTTP 648:and 646:SMTP 595:ANSI 554:and 520:and 456:and 414:The 407:and 396:and 386:and 368:and 259:for 223:(94 208:and 145:news 82:use 2802:KOI 2719:OT1 2714:OMS 2709:OML 2704:LY1 2690:TeX 2503:MSX 2463:GEM 2419:CDC 2237:VTx 2233:DEC 2119:950 2113:GBK 2109:936 2104:932 2006:922 2001:921 1996:915 1991:912 1986:896 1981:895 1963:MIK 1858:951 1853:950 1848:949 1843:942 1838:936 1833:932 1828:904 1823:903 1818:899 1813:897 1808:869 1803:868 1798:867 1793:866 1788:865 1783:864 1778:863 1773:862 1768:861 1763:860 1758:859 1753:858 1748:857 1743:856 1738:855 1733:853 1728:852 1723:851 1718:850 1713:778 1708:777 1703:776 1698:775 1693:773 1688:770 1683:737 1678:720 1673:708 1668:668 1663:437 472:). 446:DOS 431:IBM 329:by 128:by 71:). 27:in 2978:: 2767:HZ 894:. 877:. 873:. 855:. 851:. 823:. 797:. 791:. 762:. 709:. 571:16 567:16 524:. 494:. 257:ss 247:, 2446:/ 2239:) 2115:) 2111:( 1052:/ 958:e 951:t 944:v 910:. 859:. 834:. 808:. 777:. 747:. 725:. 352:) 346:( 341:) 337:( 323:. 287:( 261:ß 195:) 189:( 184:) 180:( 170:· 163:· 156:· 149:· 122:.

Index


Cygwin
character encodings
ASCII
American National Standards Institute
ANSI X3.4-1986
ISO 8859
ISO 8859-1
codepages
EBCDIC
operating systems
Unicode
history of computing
UTF-8

verification
improve this article
adding citations to reliable sources
"Extended ASCII"
news
newspapers
books
scholar
JSTOR
Learn how and when to remove this message
teleprinters
telegraphy
impact printers
printable characters
glyphs

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.