Understanding UTF-8
U+0080 ... need more than one byte
Every byte with MSB set (1xxxxxxx) ...
is part of a multi-byte character