Re: bugs.php.net & php 6
John Crenshaw wrote:
Wait, is the default going to be "Unicode" (wide, always 2 bytes per char, I.E. more memory consumption) or "UTF-8" (1 byte for the first 127, more bytes for wider text, mostly unchanged memory consumption)? I thought it was originally a conversion to Unicode, but that was scrapped? Can someone clarify?
I seem to recall that the original plan was along the lines of windows wide string? But it was the fact that unicode is wider then 16 bit that this was simply wrong? Trying to shoehorn things into the wrong structure was just not working and making the job more difficult?
Keeping things simple really requires 4 bytes per character, even if one of those is never used, but it does make sense when manipulating strings? However most of the time UTF8 works happily and only becomes a problem when a multibyte character gets cropped because the processing does not know about it?
--
Lester Caine - G8HFL
-----------------------------
Contact - http://lsces.co.uk/wiki/?page=contact
L.S.Caine Electronic Services - http://lsces.co.uk
EnquirySolve - http://enquirysolve.com/
Model Engineers Digital Workshop - http://medw.co.uk//
Firebird - http://www.firebirdsql.org/index.php
Thread (9 messages)