(no title)
mwsherman | 4 months ago
A grapheme can be multiple codepoints, with modifiers, joiners, etc.
This is true in all languages, it’s a Unicode thing, not a Go thing. Shameless plug, here is a grapheme tokenizer for Go: https://github.com/clipperhouse/uax29/tree/master/graphemes
HeyImAlex|4 months ago
debugnik|4 months ago
I'm saving this one. Not exactly how I'd explain it, but it's simplified enough to share with my current co-workers without being misleading.
virtualritz|4 months ago
I do not use Go but ran into this when I had to write a Go wrapper for some Rust stuff the other day. I was baffled.