Unicode explained / Jukka K. Korpela.

  • First edition.
  • Beijing ; Sebastopol, California : O'Reilly [2006]
Physical description
1 online resource (704 p.)
  • 0-596-15331-7
  • Includes index.
  • Reproduction available: Electronic reproduction. Sebastopol, CA : O'Reilly Media, Inc., ©2021. Available in HTML format. Description based on contents viewed 24 June 2021.
  • English
  • Description based on print version record.
  • Table of Contents; Preface; Audience; Assumptions and Approach; Contents of This Book; Self-Assessment Test; Conventions Used in This Book; Using Code Examples; Safari® Enabled; How to Contact Us; Acknowledgments; Part I. Working with Characters; Chapter 1. Characters as Data; Introduction to Characters and Unicode; Why Unicode?; Unicode Can Be Easy; What's in a Character?; Why Do We Need to Know About Characters?; Characters as Units of Text; Characters as abstractions; Variation of appearance or different characters?; Variation in shape turned into a character difference
  • Characters and "abstract characters"Characters and other units of text; Characters Versus Images; Processing of Characters; Giving Identity to Characters; Definitions of characters in standards; Annotations used to emphasize differences; The representative glyphs; The number and the Unicode name as identifiers; Unicode is more explicit; Spelling of names and the U+nnnn convention; Unicode Definitions of Characters; Definitions of Characters Elsewhere; What's in a Name?; Should We Be Strict About the Meanings of Characters?; Ambiguity Among Characters; How Do I Find My Character?
  • Which Characters Does Each Language Use?Variation of Writing Systems; Glyphs and Fonts; Allowed Variation of Glyphs; Fonts and Their Properties; Font Variation Versus Characters; Fonts in Implementations; Failures to Display a Character; Font Embedding; Definitions of Character Repertoires; Formally Defined Repertoires; Practical Repertoires; Numbering Characters; Hexadecimal Notation; Numbers as Indexes; Making Use of Character Numbers; Encoding Characters as Octet Sequences; Plain Text and Other Formats for Text; Bytes and Octets; Character Encodings; Single-Octet Encodings
  • Multi-Octet EncodingsThe "Character Set" Confusion; Working with Encodings; Selecting the Encoding When Saving; How Encodings Should Be Detected; Setting the Encoding Manually; Sending Unicode Email; Viewing Web Pages in Different Encodings; Common Confusion: Encoding Versus Language; Working with Fonts; Installing Additional Support; Font Support in Web Browsers; Font Substitution: a Solution and a Problem; Printer Fonts; Finding Fonts; Fonts in Web Authoring; The fallback problem; Effects of browser settings; Summaries; Summary of Definitions; Summary of Concept Levels
  • The code page-specific Alt-n method
Related item
  • Electronic books.
  • Illustrated
  • text
  • English


  • Fundamentally, computers just deal with numbers. They store letters and other characters by assigning a number for each one. There are hundreds of different encoding systems for mapping characters to numbers, but Unicode promises a single mapping. Unicode enables a single software product or website to be targeted across multiple platforms, languages and countries without re-engineering. It's no wonder that industry giants like Apple, Hewlett-Packard, IBM andMicrosoft have all adopted Unicode. Containing everything you need to understand Unicode, this comprehensive reference from

Summary holdings does not include live availability details. Select a library name for the full Holdings display.