Encoding Unicode - UTF-16
- Originally < 64,000 characters
- (Basic Multilingual Plane - BMP)
- 16 bits per character were enough
- UTF-16 encodes Unicode with 2 bytes/character
- With endian issues (UTF-16BE vs UTF-16LE)
- Converting ASCII to UTF-16 Unicode doubles file size
- Surrogate pairs to get to other 'planes'