Split off the first UTF char of a string

@NobbZ thx for clarifying that. After reading parts of Codepoints vs Grapheme I think I got the difference.

I use that splitting of unicode binaries in the scanner of my xml lib (elixml). So I am more or less copying text. In that case I guess it is ok to work on codepoints. Correct me if I’m wrong.