(no title)
hannibal5 | 12 years ago
I know one proprietary library for Unicode that can be used for search (from multiple different sources of UTF strings), indexing etc. and claims to support full Unicode that deals with all things involved, including directionality, surrogates, control chars etc. Their internal representation of stings for string processing is vector of displayed characters objects (not code points). Using UTF-* encoding as internal representation works only for simple string processing for subset of Unicode.
No comments yet.