Re: Re: [php6] Unicode support, options?

From: Date: Fri, 14 Mar 2014 11:20:02 +0000
Subject: Re: Re: [php6] Unicode support, options?
References: 1 2 3 4 5 6 7 8 9 10 11 12 13  Groups: php.internals 
Request: Send a blank email to [email protected] to get a copy of this message
On Fri, Mar 14, 2014 at 11:46 AM, Lester Caine <[email protected]> wrote:

> Yasuo Ohgaki wrote:
>
>> I've checked libmbfl AUTHORS in ext/mbstring. There are too many.
>> Switching multibyte filter is easier, I'll use ICU for it. Then there is
>> no
>> obstacle building mbstring by default.
>>
>
> Slight aside but relevant re. regular expressions library ... What is
> wrong with the unicode mode of preg? I'd just been using it without even
> thinking after moving over from ereg.
>

Nothing is wrong with it, PCRE has very good support for UTF-8 (including
character properties and extended grapheme clusters). Can we just deprecate
mb_ereg? It seems totally useless and just confuses people. If you want to
match regular expressions on non-UTF-8 just do a conversion beforehand (or
use a sane encoding right away, you know).

Nikita


Thread (34 messages)

« previous php.internals (#73162) next »