Hi Nikita,
On Fri, Mar 14, 2014 at 8:20 PM, Nikita Popov <[email protected]> wrote:
>
> Nothing is wrong with it, PCRE has very good support for UTF-8 (including
> character properties and extended grapheme clusters). Can we just deprecate
> mb_ereg? It seems totally useless and just confuses people. If you want to
> match regular expressions on non-UTF-8 just do a conversion beforehand (or
> use a sane encoding right away, you know).
Encoding conversion would not work always. i.e. there are number of vendor
specific extensions. Therefore, native encoding regex is required for those
who need to handle chars.
Regards,
--
Yasuo Ohgaki
[email protected]