Re: Unicode strings?

From: Crypto Compress Date: Thu, 13 Mar 2014 08:33:08 +0000

Subject: Re: Unicode strings?

References: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Groups: php.internals

Request: Send a blank email to [email protected] to get a copy of this message

Hi Yasuo,

        That's not a hole in the design. It was quite deliberate and
        it had
        little to do with Unicode at the time. It was a deliberate
        effort to not
        artificially limit identifiers beyond that which the language
        syntax
        naturally prevented. Think <space> ; , { } ( ) etc.


    IMHO it was the right decision to no artificially limit
    identifiers and it is a fair trade-off for case-insensitivity
    without unicode (class ß{} class SS{}).
    With unicode identifiers there is at least one more problem
    through normalization to consider. somewhat simplified: $☀☁ and
     $⛅ (=== in unicode)


Good point, but users should use NFC UTF-8 without BOM for variable/function names.
It would be documentation issue.

in the languages i know combining diacritics are not common so can't evaluate how practical it is to type those. Would it be impossible to change code with a dumb editor?

$café !== $café
0x63 0x61 0x66 0xC3 0xA9
0x63 0x61 0x66 0x65 0xCC 0x81

cryptocompress

Thread (28 messages)

Lester CaineTue, 11 Mar 2014 10:31:30 +0000
Crypto CompressTue, 11 Mar 2014 11:06:18 +0000
Lester CaineTue, 11 Mar 2014 12:27:50 +0000
Andrea FauldsTue, 11 Mar 2014 17:43:13 +0000
Crypto CompressWed, 12 Mar 2014 09:49:57 +0000
Lester CaineWed, 12 Mar 2014 10:16:01 +0000
Crypto CompressWed, 12 Mar 2014 10:27:19 +0000
Crypto CompressWed, 12 Mar 2014 10:33:24 +0000
Pierre JoyeWed, 12 Mar 2014 10:54:33 +0000
Crypto CompressWed, 12 Mar 2014 11:14:18 +0000
Lester CaineWed, 12 Mar 2014 11:49:15 +0000
Crypto CompressWed, 12 Mar 2014 12:00:43 +0000
Lester CaineWed, 12 Mar 2014 12:20:42 +0000
Crypto CompressWed, 12 Mar 2014 12:41:23 +0000
Yasuo OhgakiThu, 13 Mar 2014 01:10:11 +0000
Rasmus LerdorfThu, 13 Mar 2014 00:01:24 +0000
Crypto CompressThu, 13 Mar 2014 01:22:31 +0000
Yasuo OhgakiThu, 13 Mar 2014 01:53:36 +0000
Crypto CompressThu, 13 Mar 2014 08:33:08 +0000
Lester CaineThu, 13 Mar 2014 09:18:06 +0000
Crypto CompressThu, 13 Mar 2014 11:28:41 +0000
Stas MalyshevThu, 13 Mar 2014 19:59:41 +0000
Yasuo OhgakiThu, 13 Mar 2014 20:32:39 +0000
Andrea FauldsThu, 13 Mar 2014 20:36:41 +0000
Yasuo OhgakiThu, 13 Mar 2014 20:45:21 +0000
Lester CaineThu, 13 Mar 2014 09:06:11 +0000
Pierre JoyeTue, 11 Mar 2014 13:12:48 +0000
Yasuo OhgakiTue, 11 Mar 2014 21:55:04 +0000

« previous	php.internals (#73100)	next »

From:	Crypto Compress	Date:	Thu, 13 Mar 2014 08:33:08 +0000
Subject:	Re: Unicode strings?
References:	1 2 3 4 5 6 7 8 9 10 11 12 13 14	Groups:	php.internals
Request:	Send a blank email to [email protected] to get a copy of this message