On Feb 15, 2014 12:43 PM, "Pierre Joye" <
[email protected]> wrote:
On Feb 15, 2014 2:35 AM, "Rasmus Lerdorf" <
[email protected]> wrote:
What we really need is an awesome small and fast Unicode library that
does everything ICU does but faster and in less code while using UTF-8
as its internal storage so we don't have to convert on each and every
operation. There are a ton of non-obvious things beyond simple string
manipulation. String collation alone is massively complicated, for
example.
interesting. There are other but it has to be chosen very carefully :)
https://github.com/josephg/librope claims to be fast and compliant. Added
to my list.
If I'm reading that correctly it does the substring stuff on already converted UTF8 strings? It needs a UTF8 conversion of even a UTF16 string to work ... I think?