3rdpageSearch Front end to several search engines and portals that allows you to enter queries in various character sets. |
A Brief History of Character Codes A concise history of the development of character encoding in Western and East Asian languages, including ASCII, EBCDIC, Unicode and TRON. |
An Early History of Character Set Standardization Covers the beginnings of the ASCII standards from ASCII-1963 onwards and information on Cyrillic, Japanese, Korean, Thai and Vietnamese encoding systems, including various localized versions of EBCDIC. With tables and links to other resources. |
ASCII and EBCDIC Compared A comparison of two of these two basic encoding systems, with tables. |
Basis Technology: Presentations and Papers A wide range of articles on Unicode, East Asian localization and Internationalization issues. |
Character Set Issues beyond HTML3.2 Internationalization issues beyond HTML3.2 and ISO-8859-1. Includes information on Baltic encodings. |
Characters and Encodings A tutorial on character code issues in digital processing and transfer of text data, on the Internet or otherwise. Includes tables and a detailed listing of control codes. In English and Finnish. |
czyborra.com Information on the ISO 8859 alphabet soup, Cyrillic encoding, PC codepages, East Asian encoding, ASCII and variants, EBCDIC and Unicode - by Roman Czyborra. |
Diffuse Project: Character Set Standards An overview of different character set encoding standards. |
Domain Island: Language Tables Information on GB2312 (Chinese simplified), Big5 (Chinese traditional), Shift-JIS (Japanese), KSC5601 (Korean), Windows-874 (Thai), Windows-1258 (Vietnamese) and ISO 8859 (multilingual) codepages. |
ECMA: Character Code Structure and Extension Techniques Specifies the structure of ECMA-35, for 8-bit codes and 7-bit codes which provide for the coding of character sets, with a detailed PDF document. |
eGrannie: ASCII-EBCDIC chart A side-by-side comparision of ASCII and EBCDIC encoding. |
EKI Letter Database Query character sets, encoding, codepages and Unicode information in an easy-to-use web form. Held at the Institute of the Estonian Language. |
HTML Validation: Using Character Encodings How to validate HTML documents in various character encodings. |
IANA: Character Sets The official names for character sets that may be used in the Internet and referred to in Internet documentation - held at the Internet Assigned Number Authority. |
ISO 639 Language Names The standard names for use in SGML and XML, including a complete list of language name codes. |
MS Windows characters in HTML A review of the HTML authoring problems caused by some special characters which belong to MS Windows character set but not to ISO Latin 1. Includes technical details and substitution tables. In English and Finnish. |
Tutorial: Shady Characters A tutorial that explains HTML character sets, character encodings and character references from Webreference.com. |
World Wide Web Consortium Covers code tables, Unicode, HTML and XML and links to other resources and discusses internationalization and localization issues relating to character sets. |
Xceed Binary Encoding Library A library for Windows developers that allows applications to encode binary data and files into text and vice-versa. |