Contents
What's the Problem?
How it Looks to the Computer
An Early Encoding
The ASCII Character Set
The Latin-1 Character Set
The Latin-2 Character Set
The WinLatin-1 Character Set
Code-Page Chaos
What is Unicode?
Unicode Code Point Assignments
Code Point Notation
Encoding Unicode - UTF-16
Byte Order Mark - BOM
Encoding Unicode - UTF-32
Encoding Unicode - UTF-8
Specifying an Encoding in HTML
Using Unicode Characters in HTML
Perl's World View
Perl Defaults to Binary IO
Specifying an Encoding Layer
Character Semantics for Built-in Functions
Include Unicode Literals in Source
Manually Upgrading Strings to UTF-8
Perl, XML and Unicode
Encoding of HTML Form Submissions
Retrieving Encoded Web Content
References