(no title)
niccaluim | 9 years ago
That being said, an FAQ on combining characters points out that Unicode's definition of "character" may not match an end user's, and that it's best to use the word "grapheme" instead for clarity. (And that being said, if the typical end user knows what "grapheme" means, I'll eat my cat.)
So from a practical standpoint, it's best to make sure that any input to rev is in one of the composed normal forms.
(Incidentally, the proper sequence is <base character><combining character>…, not the other way around.)
bhaak|9 years ago
But there are real world characters that don't have precomposed forms (IIRC e.g. indic scripts).
kps|9 years ago