Re: [php6] Unicode support, options?

From: Ivan Enderlin @ Hoa Date: Thu, 20 Feb 2014 16:10:25 +0000

Subject: Re: [php6] Unicode support, options?

References: 1 Groups: php.internals

Request: Send a blank email to [email protected] to get a copy of this message

On 20/02/2014 06:54, Pierre Joye wrote:
hi,
Hello :-),


Unicode still remains one of the top requested features in PHP.

However as Rasmus and other stated earlier, it is not a trivial job.
Some of the keys point we need to take care of are:

- UTF-8 storage
- UTF-8 support for almost (if not all) existing string APIs
- Performance

As of today, I did not find any library covering at least two of these
key points.

[snip]

I would like to begin to discuss our option now already. I am not
asking to get in all implementation details from a userland point of
view (like u"some text" or addng new APIs or not) but only to see what
we can do internally to work with UTF-8 string.
Just a little note: using a u"foobar" syntax would help to switch from one to another light or heavy implementation internally, and thus, it would help to cover at least two of the key points described above.

I would mention the Rust implementation of UTF-8 strings [1, 2]. It's fast, it's safe and it has a nice large API. I don't say I want to see PHP using Rust. I think it would be hard to do (even if it will certainly benefit PHP), but the algorithms they used can be a source of inspiration for us. Maybe we should consider it if we decide to have our own implementation instead of using a third library.


Cheers.

[1] https://github.com/mozilla/rust/blob/master/src/libstd/str.rs
[2] http://static.rust-lang.org/doc/master/std/str/index.html

-- 
Ivan Enderlin
Developer of Hoa
http://hoa-project.net/

PhD. student at DISC/Femto-ST (Vesontio) and INRIA (Cassis)
http://disc.univ-fcomte.fr/ and http://www.inria.fr/

Member of HTML and WebApps Working Group of W3C
http://w3.org/

Thread (34 messages)

Pierre JoyeThu, 20 Feb 2014 05:54:21 +0000
Crypto CompressThu, 20 Feb 2014 15:04:34 +0000
Pierre JoyeThu, 20 Feb 2014 15:44:10 +0000
Ivan Enderlin @ HoaThu, 20 Feb 2014 15:48:29 +0000
Pierre JoyeThu, 20 Feb 2014 15:53:53 +0000
Ivan Enderlin @ HoaThu, 20 Feb 2014 15:55:28 +0000
Andrey HristovThu, 20 Feb 2014 15:56:49 +0000
Johannes SchlüterThu, 20 Feb 2014 16:25:44 +0000
Crypto CompressThu, 20 Feb 2014 21:04:41 +0000
Pierre JoyeFri, 21 Feb 2014 02:58:59 +0000
Lester CaineFri, 21 Feb 2014 12:04:09 +0000
Pierre JoyeFri, 21 Feb 2014 12:30:14 +0000
Lester CaineFri, 21 Feb 2014 13:28:44 +0000
Ivan Enderlin @ HoaThu, 20 Feb 2014 16:10:25 +0000
Marc BennewitzFri, 21 Feb 2014 19:49:08 +0000
Pierre JoyeThu, 27 Feb 2014 06:13:38 +0000Re: [php6] Unicode support, options?
Lester CaineThu, 27 Feb 2014 09:57:12 +0000Re: Re: [php6] Unicode support, options?
Pierre JoyeThu, 27 Feb 2014 10:28:38 +0000
Lester CaineThu, 27 Feb 2014 10:51:50 +0000
Pierre JoyeThu, 27 Feb 2014 11:05:32 +0000
Lester CaineThu, 27 Feb 2014 11:32:52 +0000
Crypto CompressThu, 13 Mar 2014 11:28:51 +0000
Yasuo OhgakiThu, 13 Mar 2014 23:07:34 +0000
Crypto CompressFri, 14 Mar 2014 07:49:00 +0000
Yasuo OhgakiFri, 14 Mar 2014 08:31:13 +0000
Pierre JoyeFri, 14 Mar 2014 08:52:09 +0000
Crypto CompressFri, 14 Mar 2014 09:19:18 +0000
Yasuo OhgakiFri, 14 Mar 2014 09:53:04 +0000
Yasuo OhgakiFri, 14 Mar 2014 10:21:34 +0000
Lester CaineFri, 14 Mar 2014 10:46:38 +0000
Nikita PopovFri, 14 Mar 2014 11:20:02 +0000
Alexey ZakhlestinFri, 14 Mar 2014 11:33:02 +0000
Yasuo OhgakiFri, 14 Mar 2014 22:11:20 +0000
Yasuo OhgakiFri, 14 Mar 2014 22:04:29 +0000

« previous	php.internals (#72714)	next »

From:	Ivan Enderlin @ Hoa	Date:	Thu, 20 Feb 2014 16:10:25 +0000
Subject:	Re: [php6] Unicode support, options?
References:	1	Groups:	php.internals
Request:	Send a blank email to [email protected] to get a copy of this message