On Fri, Mar 14, 2014 at 11:46 AM, Lester Caine <[email protected]> wrote:
> Yasuo Ohgaki wrote:
>
>> I've checked libmbfl AUTHORS in ext/mbstring. There are too many.
>> Switching multibyte filter is easier, I'll use ICU for it. Then there is
>> no
>> obstacle building mbstring by default.
>>
>
> Slight aside but relevant re. regular expressions library ... What is
> wrong with the unicode mode of preg? I'd just been using it without even
> thinking after moving over from ereg.
>
Nothing is wrong with it, PCRE has very good support for UTF-8 (including
character properties and extended grapheme clusters). Can we just deprecate
mb_ereg? It seems totally useless and just confuses people. If you want to
match regular expressions on non-UTF-8 just do a conversion beforehand (or
use a sane encoding right away, you know).
Nikita