7 lines
336 B
Python
7 lines
336 B
Python
#!/bin/env python
|
|
# -*- coding: UTF-8 -*-
|
|
s = u"møøse"
|
|
assert len(s) == 5
|
|
assert len(s.encode('UTF-8')) == 7
|
|
assert len(s.encode('UTF-16-BE')) == 10 # There are 3 different UTF-16 encodings: LE and BE are little endian and big endian respectively, the third one (without suffix) adds 2 extra leading bytes: the byte-order mark (BOM).
|