Re: PHP True Async RFC

From: Larry Garfield Date: Wed, 05 Mar 2025 18:30:07 +0000

Subject: Re: PHP True Async RFC

References: 1 2 3 Groups: php.internals

Request: Send a blank email to [email protected] to get a copy of this message

On Wed, Mar 5, 2025, at 3:37 AM, Edmond Dantes wrote:
> Good day, Larry.
>
>> First off, as others have said, thank you for a thorough and detailed proposal.
> Thanks!
>
>> * A series of free-standing functions.
>> * That only work if the scheduler is active.
>> * The scheduler being active is a run-once global flag.
>> * So code that uses those functions is only useful based on a global state not present in
>> that function.
>> * And a host of other seemingly low-level objects that have a myriad of methods on them
>> that do, um, stuff.
>> * Oh, and a lot of static methods, too, instead of free-standing functions.
>
> Suppose these shortcomings don’t exist, and we have implemented the 
> boldest scenario imaginable. We introduce Structured Concurrency, 
> remove low-level elements, and possibly even get rid of Future. Of 
> course, there are no functions like startScheduler or anything like 
> that.
>
>  1. In this case, how should PHP handle Fiber and all the behavior 
> associated with it? Should Fiber be declared deprecated and removed 
> from the language? What should the flow be?

I'm not sure yet.  I was quite hesitant about Fibers when they went in because they were so
low-level, but the authors were confident that it was enough for a user-space toolchain to be
iterated on quickly that everyone could use.  That clearly didn't pan out as intended (Revolt
exists, but usage of it is still rare), so here we are with a half-finished API.

Thinking aloud, perhaps we could cause new Fiber to create an automatic async block? 
Or we do deprecate it and discourage its use.  Something to think through, certainly.

>  2. What should be done with I/O functions? Should they remain 
> blocking, with a separate API provided as an extension?

The fact that IO functions become transparently async when appropriate is the best part of the
current RFC.  Please keep that. :-)

>  3. Would it be possible to convince the maintainers of XDEBUG and 
> other extensions to rewrite their code to support the new model? ( *If 
> you're reading this question now, please share your opinion.* )

I cannot speak for Derick.

>  4. If transparent concurrency is introduced for I/O in point 2, what 
> should be done with Revolt + AMPHP? This would break their code. 
> Should an additional function or option be introduced to switch PHP 
> into "legacy mode"?

Also an excellent question, to which I do not yet have an answer.  (See previous point about Fibers
being half-complete.)  I would want to involve Aaron, Christian, and Ces-Jan before trying to make
any suggestions here.

> Structured concurrency is a great thing. However, I’d like to avoid 
> changing the language syntax and make something closer to Go’s 
> semantics. I’ll think about it and add this idea to my TODO.

Well, as noted in the article, structured concurrency done right means *not* having unstructured
concurrency.  Having Go-style async and then building a structured nursery system on top of it means
you cannot have any of the guarantees of the structured approach, because the other one is still
poking out the side and leaking.  We're already stuck with mutable-by-default, global
variables, and other things that prevent us from making helpful assumptions.  Please, let's try
to avoid that for async.  We don't need more gotos.

>> async $context {
>> // $context is an object of AsyncContext, and can be passed around as such.
>> // It is the *only* way to span anything async, or interact with the async controls.
>> // If a function doesn't take an AsyncContext param, it cannot control async.  This is
>> good.
>
> This is a very elegant solution. Theoretically.
>
> However, in practice, if you require explicitly passing the context to 
> all functions, it leads to the following consequences:
>
>  1. The semantics of all functions increase by one additional parameter 
> (*Signature bloat*).

No, just those functions/objects that necessarily involve running async control commands.  Most
wouldn't.  They would just silently context switch when they hit an IO operation (which as
noted above is transparency supported, which is what makes this work) and otherwise behave the same.

But if something does actively need to do async stuff, it should have a context to work within. 
It's the same discussion as:

A: "Pass/inject a DB connection to a class that needs it, don't just call a global db()
function."
B: "But then I have to pass it to all these places explicitly!"
A: "That's a sign your SQL is too scattered around the code base. Fix that first and your
problem goes away."

Explicit flow control is how you avoid bugs.  It's also self-documenting, as it's patently
obvious what code expects to run in an async context and which doesn't care.

>  2. If an asynchronous call needs to be added to a function, and other 
> functions depend on it, then the semantics of all dependent functions 
> must be changed as well. 

This is no different than DI of any other service.  I have restructured code to handle temporary
contexts before.  (My AttributeUtils and Serde libraries.)  The result was... much better code than
I had before.  I'm glad I made those refactors.

> In this example, there is another aspect: the fact that async execution 
> is explicitly limited to a specific scope. This is essentially the same 
> as startScheduler, and it is one of the options I was considering.
>
> Of course, startScheduler can be replaced with a construction like 
> async(function() { ... }).
> This means that async execution is only active within the closure, and 
> coroutines can only be created inside that closure.
>
> This is one of the semantic solutions that allows removing 
> startScheduler, but at the implementation level, it is exactly the 
> same.
>
> What do you think about this?

That looks mostly like the async block syntax I proposed, spelled differently.  The main difference
is that the body of the wrapped function would need to explicitly use any variables
from scope that it wanted, rather than getting them implicitly.  Whether that's good or bad is
probably subjective.

But it would allow for a syntax like this for the context, which is quite similar to how database
transactions are often done:

$val = async(function(AsyncContext $ctx) use ($stuff, $fn) {
  $result = [];
  foreach ($stuff as $item) {
    $result[] = $ctx->run($fn);
  }

  // We block/wait here until all subtasks are complete, then the async() call returns this value.
  return $result;
});

And of course in both cases you could use a pre-defined callable instead of inlining one.  At this
point I think it's mostly a stylistic difference, function vs block.

>> I'm not convinced that sticking arbitrary key/value pairs into the Context object is
>> wise;
>
> Why not? 
>
>> that's global state by another name
>
>   Static variables inside a function are also global state. Are you 
> against static variables?

Vocally, in fact. :-)

>> But if we must, the above would handle all the inheritance and override stuff quite
>> naturally. Possibly with:
>
>  How will a context with open string keys help preserve service data 
> that the service doesn't want to expose to anyone? The Key() solution 
> is essentially the same as Symbol in JS, which is used for the same 
> purpose. Of course, we could add a coroutine static $var construct to 
> the language syntax. But it's all the same just syntactic sugar that 
> would require more code to support. 

I cannot speak to JS Symbols as I haven't used them.  I am just vhemently opposed to globals,
no matter how many layers they're wrapped in. :-)  Most uses could be replaced by proper DI or
partial application.

>> [$in, $out] = Channel::create($buffer_size);
>
> This semantics require the programmer to remember that two variables 
> actually point to the same object. If a function has multiple channels, 
> this makes the code quite verbose. Additionally, such channels are 
> inconvenient to store in lists because their structure becomes more 
> complex.
>
> I would suggest a slightly different solution:
>
> <code php>
> $in = new Channel()->getProducer();
> async myFunction($in->getConsumer());
> <code>
>
> This semantics do not restrict the programmer in usage patterns while 
> still allowing interaction with the channel through a well-defined 
> contract.

I'd go slightly differently if you wanted to go that route:

$ch = new Channel($buffer_size);
$in = $ch->producer();
$out = $ch->consumer();

// You do most interaction with $in and $out.

I could probably work with that as well.

(Or even just $ch->inPipe and $ch->outPipe, now that we have nice property support.)

But the overall point, I think, is avoiding implicit modal logic.  If my code doesn't need to
care if it's in an async world, it doesn't care.  If it does, then I need an explicit
async world to work within, rather than relying on one implicitly existing, I hope.  And I
shouldn't have to think about "who owns this end of this channel".  I just have an in
and out hose I stick stuff into and pull out from, kthxbye.

> Thanks for the great examples, and a special thanks for the article.
> I also like the definition of context.
>
> Ed

--Larry Garfield

Thread (110 messages)

Edmond DantesSat, 01 Mar 2025 09:11:18 +0000
Rob LandersSat, 01 Mar 2025 09:22:49 +0000
Edmond DantesSat, 01 Mar 2025 10:22:55 +0000
Rowan Tommins [IMSoP]Sat, 01 Mar 2025 17:20:30 +0000
Edmond DantesSat, 01 Mar 2025 18:34:41 +0000
Rob LandersSat, 01 Mar 2025 18:44:01 +0000
Edmond DantesSat, 01 Mar 2025 19:33:14 +0000
Rob LandersSun, 02 Mar 2025 14:08:35 +0000
Edmond DantesMon, 03 Mar 2025 09:34:40 +0000
Daniil GentiliMon, 03 Mar 2025 10:20:10 +0000
Edmond DantesMon, 03 Mar 2025 12:05:02 +0000
Daniil GentiliMon, 03 Mar 2025 12:21:27 +0000
Edmond DantesMon, 03 Mar 2025 15:09:48 +0000
Edmond DantesMon, 03 Mar 2025 15:46:28 +0000
Nicolas GrekasMon, 03 Mar 2025 16:03:25 +0000
Edmond DantesMon, 03 Mar 2025 19:59:26 +0000
Edmond DantesMon, 03 Mar 2025 18:26:43 +0000
Larry GarfieldTue, 04 Mar 2025 18:36:37 +0000
Eugene SidelnykTue, 04 Mar 2025 22:54:19 +0000
Rob LandersTue, 04 Mar 2025 23:09:03 +0000
Edmond DantesWed, 05 Mar 2025 10:30:52 +0000
MorganWed, 05 Mar 2025 22:04:29 +0000
Rowan Tommins [IMSoP]Wed, 05 Mar 2025 08:39:54 +0000
Edmond DantesWed, 05 Mar 2025 09:37:49 +0000
Larry GarfieldWed, 05 Mar 2025 18:30:07 +0000
Edmond DantesWed, 05 Mar 2025 21:10:15 +0000
Rowan Tommins [IMSoP]Wed, 05 Mar 2025 23:10:30 +0000
Rowan Tommins [IMSoP]Wed, 05 Mar 2025 23:50:13 +0000
Edmond DantesThu, 06 Mar 2025 07:49:18 +0000
Rowan Tommins [IMSoP]Thu, 06 Mar 2025 09:17:28 +0000
Edmond DantesThu, 06 Mar 2025 11:31:22 +0000
Rowan Tommins [IMSoP]Thu, 06 Mar 2025 22:26:06 +0000
Rob LandersFri, 07 Mar 2025 09:20:04 +0000
Edmond DantesFri, 07 Mar 2025 18:19:57 +0000
Larry GarfieldThu, 06 Mar 2025 04:11:23 +0000
Edmond DantesThu, 06 Mar 2025 08:52:17 +0000
Daniil GentiliThu, 06 Mar 2025 09:58:32 +0000
Edmond DantesThu, 06 Mar 2025 12:18:56 +0000
Larry GarfieldThu, 06 Mar 2025 19:07:34 +0000
Edmond DantesFri, 07 Mar 2025 09:24:31 +0000
Rowan Tommins [IMSoP]Fri, 07 Mar 2025 21:53:47 +0000
Edmond DantesSat, 08 Mar 2025 07:32:39 +0000
Rowan Tommins [IMSoP]Fri, 07 Mar 2025 09:39:33 +0000
Larry GarfieldFri, 07 Mar 2025 22:01:47 +0000
Rowan Tommins [IMSoP]Fri, 07 Mar 2025 23:21:12 +0000
Rob LandersSat, 08 Mar 2025 07:38:44 +0000
Eugene SidelnykSat, 08 Mar 2025 08:06:02 +0000
Rob LandersSat, 08 Mar 2025 08:13:14 +0000
Eugene SidelnykSat, 08 Mar 2025 08:23:26 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Edmond DantesSat, 08 Mar 2025 12:24:37 +0000
Iliya Miroslavov IlievSat, 08 Mar 2025 12:34:24 +0000
[email protected]Thu, 01 Jan 1970 00:00:00 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Rowan Tommins [IMSoP]Sat, 08 Mar 2025 13:01:49 +0000
Iliya Miroslavov IlievSat, 08 Mar 2025 13:17:01 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Rob LandersSat, 08 Mar 2025 13:45:57 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Edmond DantesSat, 08 Mar 2025 18:37:37 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Larry GarfieldSat, 08 Mar 2025 18:10:53 +0000
Edmond DantesSat, 08 Mar 2025 18:16:14 +0000
Alexandru PătrănescuSat, 08 Mar 2025 23:16:37 +0000
Edmond DantesSun, 09 Mar 2025 08:05:06 +0000
Alexandru PătrănescuSun, 09 Mar 2025 09:04:14 +0000
Edmond DantesSun, 09 Mar 2025 09:30:10 +0000
Rob LandersSun, 09 Mar 2025 09:04:04 +0000
Edmond DantesSun, 09 Mar 2025 09:53:29 +0000
Iliya Miroslavov IlievSat, 08 Mar 2025 14:11:06 +0000
Rowan Tommins [IMSoP]Sat, 08 Mar 2025 20:56:12 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Rowan Tommins [IMSoP]Sat, 08 Mar 2025 22:13:10 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Derick RethansSat, 08 Mar 2025 23:12:24 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Edmond DantesSun, 09 Mar 2025 08:42:30 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Edmond DantesSun, 09 Mar 2025 16:56:57 +0000
Larry GarfieldMon, 10 Mar 2025 04:43:51 +0000
Edmond DantesMon, 10 Mar 2025 09:19:10 +0000
Daniil GentiliMon, 10 Mar 2025 10:30:22 +0000
Edmond DantesMon, 10 Mar 2025 11:13:27 +0000
Daniil GentiliMon, 10 Mar 2025 11:20:28 +0000
Rowan Tommins [IMSoP]Sun, 09 Mar 2025 12:16:56 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Edmond DantesSat, 08 Mar 2025 08:24:52 +0000
Rob LandersSat, 08 Mar 2025 08:26:51 +0000
Eugene SidelnykSat, 08 Mar 2025 08:59:45 +0000
Daniil GentiliThu, 01 Jan 1970 00:00:00 +0000
Edmond DantesSat, 08 Mar 2025 07:58:37 +0000
Daniil GentiliThu, 06 Mar 2025 10:00:07 +0000
Jakub ZelenkaWed, 05 Mar 2025 10:58:05 +0000
Edmond DantesWed, 05 Mar 2025 12:23:20 +0000
Jakub ZelenkaWed, 05 Mar 2025 16:55:23 +0000
Edmond DantesWed, 05 Mar 2025 17:50:37 +0000
Larry GarfieldWed, 05 Mar 2025 18:38:24 +0000
Edmond DantesSat, 08 Mar 2025 07:05:37 +0000
Larry GarfieldSat, 08 Mar 2025 18:48:39 +0000
Edmond DantesSat, 08 Mar 2025 20:22:39 +0000
Rowan Tommins [IMSoP]Sun, 09 Mar 2025 13:17:43 +0000
Edmond DantesSun, 09 Mar 2025 18:13:01 +0000
Iliya Miroslavov IlievSun, 09 Mar 2025 20:54:17 +0000
Edmond DantesSun, 09 Mar 2025 21:17:33 +0000
Rob LandersSun, 09 Mar 2025 22:04:40 +0000
Larry GarfieldMon, 10 Mar 2025 03:55:21 +0000
Rowan Tommins [IMSoP]Mon, 10 Mar 2025 10:59:56 +0000
Edmond DantesMon, 10 Mar 2025 11:36:50 +0000

« previous	php.internals (#126583)	next »

From:	Larry Garfield	Date:	Wed, 05 Mar 2025 18:30:07 +0000
Subject:	Re: PHP True Async RFC
References:	1 2 3	Groups:	php.internals
Request:	Send a blank email to [email protected] to get a copy of this message