https://ianthehenry.com/posts/decoding-utf-8/