Re: Unicode strings?

From: Crypto Compress Date: Thu, 13 Mar 2014 11:28:41 +0000

Subject: Re: Unicode strings?

References: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Groups: php.internals

Request: Send a blank email to [email protected] to get a copy of this message

Am 13.03.2014 10:18, schrieb Lester Caine:
Crypto Compress wrote:
Good point, but users should use NFC UTF-8 without BOM for variable/function
names.
It would be documentation issue.

in the languages i know combining diacritics are not common so can't evaluate
how practical it is to type those. Would it be impossible to change code with a
dumb editor?

$café !== $café
0x63 0x61 0x66 0xC3 0xA9
0x63 0x61 0x66 0x65 0xCC 0x81

'cryptocompress' (is that really on your passport :( )

This is exactly the area we need to agree on a plan moving forward.

There are a number of options on the table

1 - Limit variable and other names to 'ASCII' only characters so that case folding can be maintained.

2 - Remove 'case insensitivity' but not just for point 1 reasons.
( I see this as your example being two different strings ;) )

3 - Allow unicode names to be used in places where they currently cause problems.

Not actually using unicode variable names myself, I still don't understand where the problems are with '3' except for the simple comparison case where normalizing and case conversion creates a minefield? People are currently using unicode in these areas and understand many of the restrictions?


My unverified assumption is: The performance impact (cli without opcache) is too huge to get this right. If we do not get this right, there are complainers. How shall we die?

Виталий

Thread (28 messages)

Lester CaineTue, 11 Mar 2014 10:31:30 +0000
Crypto CompressTue, 11 Mar 2014 11:06:18 +0000
Lester CaineTue, 11 Mar 2014 12:27:50 +0000
Andrea FauldsTue, 11 Mar 2014 17:43:13 +0000
Crypto CompressWed, 12 Mar 2014 09:49:57 +0000
Lester CaineWed, 12 Mar 2014 10:16:01 +0000
Crypto CompressWed, 12 Mar 2014 10:27:19 +0000
Crypto CompressWed, 12 Mar 2014 10:33:24 +0000
Pierre JoyeWed, 12 Mar 2014 10:54:33 +0000
Crypto CompressWed, 12 Mar 2014 11:14:18 +0000
Lester CaineWed, 12 Mar 2014 11:49:15 +0000
Crypto CompressWed, 12 Mar 2014 12:00:43 +0000
Lester CaineWed, 12 Mar 2014 12:20:42 +0000
Crypto CompressWed, 12 Mar 2014 12:41:23 +0000
Yasuo OhgakiThu, 13 Mar 2014 01:10:11 +0000
Rasmus LerdorfThu, 13 Mar 2014 00:01:24 +0000
Crypto CompressThu, 13 Mar 2014 01:22:31 +0000
Yasuo OhgakiThu, 13 Mar 2014 01:53:36 +0000
Crypto CompressThu, 13 Mar 2014 08:33:08 +0000
Lester CaineThu, 13 Mar 2014 09:18:06 +0000
Crypto CompressThu, 13 Mar 2014 11:28:41 +0000
Stas MalyshevThu, 13 Mar 2014 19:59:41 +0000
Yasuo OhgakiThu, 13 Mar 2014 20:32:39 +0000
Andrea FauldsThu, 13 Mar 2014 20:36:41 +0000
Yasuo OhgakiThu, 13 Mar 2014 20:45:21 +0000
Lester CaineThu, 13 Mar 2014 09:06:11 +0000
Pierre JoyeTue, 11 Mar 2014 13:12:48 +0000
Yasuo OhgakiTue, 11 Mar 2014 21:55:04 +0000

« previous	php.internals (#73108)	next »

From:	Crypto Compress	Date:	Thu, 13 Mar 2014 11:28:41 +0000
Subject:	Re: Unicode strings?
References:	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16	Groups:	php.internals
Request:	Send a blank email to [email protected] to get a copy of this message