@@ -16,12 +16,12 @@ summary of the motivation and animated sketch of the design in action.
16
16
* [ Task] ( #task )
17
17
* [ Current task] ( #current-task )
18
18
* [ Context-Local Storage] ( #context-local-storage )
19
- * [ Subtask and Supertask] ( #subtask-and-supertask )
20
19
* [ Structured concurrency] ( #structured-concurrency )
21
20
* [ Streams and Futures] ( #streams-and-futures )
22
21
* [ Waiting] ( #waiting )
23
22
* [ Backpressure] ( #backpressure )
24
23
* [ Returning] ( #returning )
24
+ * [ Borrows] ( #borrows )
25
25
* [ Async ABI] ( #async-abi )
26
26
* [ Async Import ABI] ( #async-import-abi )
27
27
* [ Async Export ABI] ( #async-export-abi )
@@ -251,70 +251,64 @@ reason why "context-local" storage is not called "task-local" storage (where a
251
251
For details, see [ ` context.get ` ] in the AST explainer and [ ` canon_context_get ` ]
252
252
in the Canonical ABI explainer.
253
253
254
- ### Subtask and Supertask
255
-
256
- Each component-to-component call necessarily creates a new task in the callee.
257
- The callee task is a ** subtask** of the calling task (and, conversely, the
258
- calling task is a ** supertask** of the callee task. This sub/super relationship
259
- is immutable and doesn't change over time (until the callee task completes and
260
- is destroyed).
261
-
262
- The Canonical ABI's Python code represents the subtask relationship between a
263
- caller ` Task ` and a callee ` Task ` via the Python [ ` Subtask ` ] class. Whereas a
264
- ` Task ` object is created by each call to [ ` canon_lift ` ] , a ` Subtask ` object is
265
- created by each call to [ ` canon_lower ` ] . This allows ` Subtask ` s to store the
266
- state that enforces the caller side of the Canonical ABI rules.
267
-
268
254
### Structured concurrency
269
255
270
- To realize the above goals of always having a well-defined cross-component
271
- async callstack, the Component Model's Canonical ABI enforces [ Structured
272
- Concurrency] by dynamically requiring that a task waits for all its subtasks to
273
- [ return] ( #returning ) before the task itself is allowed to finish. This means
274
- that a subtask cannot be orphaned and there will always be an async callstack
275
- rooted at an invocation of an export by the host. Moreover, at any one point in
276
- time, the set of tasks active in a linked component graph form a forest of
277
- async call trees which e.g., can be visualized using a traditional flamegraph.
278
-
279
- The Canonical ABI's Python code enforces Structured Concurrency by incrementing
280
- a per-task "` num_subtasks ` " counter when a subtask is created, decrementing
281
- when the subtask [ returns] ( #returning ) , and trapping if ` num_subtasks > 0 ` when
282
- a task attempts to exit.
283
-
284
- There is a subtle nuance to these Structured Concurrency rules deriving from
285
- the fact that subtasks may continue execution after [ returning] ( #returning )
286
- their value to their caller. The ability to execute after returning value is
287
- necessary for being able to do work off the caller's critical path. A concrete
288
- example is an HTTP service that does some logging or billing operations after
289
- finishing an HTTP response, where the HTTP response is the return value of the
290
- [ ` wasi:http/handler.handle ` ] function. Since the ` num_subtasks ` counter is
291
- decremented when a subtask * returns* (as opposed to * exits* ), this means that
292
- subtasks may continue execution even once their supertask has exited. To
293
- maintain Structured Concurrency (for purposes of checking [ reentrance] ,
294
- scheduler prioritization and debugging/observability), we can consider
295
- the supertask to still be alive but in the process of "asynchronously
296
- tail-calling" its still-executing subtasks. (For scenarios where one
297
- component wants to non-cooperatively bound the execution of another
298
- component, a separate "[ blast zone] " feature is necessary in any
299
- case.)
300
-
301
- This async call tree provided by Structured Concurrency interacts naturally
302
- with the ` borrow ` handle type and its associated dynamic rules for preventing
303
- use-after-free. When a caller initially lends an ` own ` ed or ` borrow ` ed handle
304
- to a callee, a "` num_lends ` " counter on the lent handle is incremented when the
305
- subtask starts and decremented when the caller is notified that the subtask has
306
- [ returned] ( #returning ) . If the caller tries to drop a handle while the handle's
307
- ` num_lends ` is greater than zero, it traps. Symmetrically, each ` borrow ` handle
308
- passed to a callee task increments a "` num_borrows ` " counter on the task that
309
- is decremented when the ` borrow ` handle is dropped. With async calls, there can
310
- of course be multiple overlapping async tasks and thus ` borrow ` handles must
311
- remember which particular task's ` num_borrows ` counter to drop. If a task
312
- attempts to return (which, for ` async ` tasks, means calling ` task.return ` ) when
313
- its ` num_borrows ` is greater than zero, it traps. These interlocking rules for
314
- the ` num_lends ` and ` num_borrows ` fields inductively ensure that nested async
315
- call trees that transitively propagate ` borrow ` ed handles maintain the
316
- essential invariant that dropping an ` own ` ed handle never destroys a resource
317
- while there is any ` borrow ` handle anywhere pointing to that resource.
256
+ Calling * into* a component creates a ` Task ` to track ABI state related to the
257
+ * callee* (like "number of outstanding borrows"). Calling * out* of a component
258
+ creates a ` Subtask ` to track ABI state related to the * caller* (like "which
259
+ handles have been lent"). When one component calls another, there is thus a
260
+ ` Subtask ` +` Task ` pair that collectively maintains the overall state of the call
261
+ and enforces that both components uphold their end of the ABI contract. But
262
+ when the host calls into a component, there is only a ` Task ` and,
263
+ symmetrically, when a component calls into the host, there is only a ` Subtask ` .
264
+
265
+ Based on this, the call stack at any point in time when a component calls a
266
+ host-defined import will have a callstack of the general form:
267
+ ```
268
+ [Host caller] <- [Task] <- [Subtask+Task]* <- [Subtask] <- [Host callee]
269
+ ```
270
+ Here, the ` <- ` arrow represents the ` supertask ` relationship that is immutably
271
+ established when first making the call. A paired ` Subtask ` and ` Task ` have the
272
+ same ` supertask ` and can thus be visualized as a single node in the callstack.
273
+
274
+ (These concepts are represented in the Canonical ABI Python code via the
275
+ [ ` Task ` ] and [ ` Subtask ` ] classes.)
276
+
277
+ One semantically-observable use of this async call stack is to distinguish
278
+ between hazardous ** recursive reentrance** , in which a component instance is
279
+ reentered when one of its tasks is already on the callstack, from
280
+ business-as-usual ** sibling reentrance** , in which a component instance is
281
+ freshly reentered when its other tasks are suspended waiting on I/O. Recursive
282
+ reentrance currently always traps, but may be allowed (and indicated to core
283
+ wasm) in an opt-in manner in the [ future] ( #TODO ) .
284
+
285
+ The async call stack is also useful for non-semantic purposes such as providing
286
+ backtraces when debugging, profiling and distributed tracing. While particular
287
+ languages can and do maintain their own async call stacks in core wasm state,
288
+ without the Component Model's async call stack, linkage * between* different
289
+ languages would be lost at component boundaries, leading to a loss of overall
290
+ context in multi-component applications.
291
+
292
+ There is an important nuance to the Component Model's minimal form of
293
+ Structured Concurrency compared to Structured Concurrency support that appears
294
+ in popular source language features/libraries. Often, "Structured Concurrency"
295
+ refers to an invariant that all "child" tasks finish or are cancelled before a
296
+ "parent" task completes. However, the Component Model doesn't force subtasks to
297
+ [ return] ( #returning ) or be cancelled before the supertask returns (this is left
298
+ as an option to particular source langauges to enforce or not). The reason for
299
+ not enforcing a stricter form of Structured Concurrency at the Component
300
+ Model level is that there are important use cases where forcing a supertask to
301
+ stay resident simply to wait for a subtask to finish would waste resources
302
+ without tangible benefit. Instead, we can say that once the core wasm
303
+ implementing a supertask finishes execution, the supertask semantically "tail
304
+ calls" any still-live subtasks, staying technically-alive until they complete,
305
+ but not consuming real resources. Concretely, this means that a supertask that
306
+ finishes executing stays on the callstack of any still-executing subtasks for
307
+ the abovementioned purposes until all transitive subtasks finish.
308
+
309
+ For scenarios where one component wants to * non-cooperatively* put an upper
310
+ bound on execution of a call into another component, a separate "[ blast zone] "
311
+ feature is necessary in any case (due to iloops and traps).
318
312
319
313
### Streams and Futures
320
314
@@ -486,6 +480,28 @@ A task may not call `task.return` unless it is in the "started" state. Once
486
480
finish once it is in the "returned" state. See the [ ` canon_task_return ` ]
487
481
function in the Canonical ABI explainer for more details.
488
482
483
+ ### Borrows
484
+
485
+ Component Model async support is careful to ensure that ` borrow ` ed handles work
486
+ as expected in an asynchronous setting, extending the dynamic enforcement used
487
+ for synchronous code:
488
+
489
+ When a caller initially lends an ` own ` ed or ` borrow ` ed handle to a callee, a
490
+ ` num_lends ` counter on the lent handle is incremented when the subtask starts
491
+ and decremented when the caller is notified that the subtask has
492
+ [ returned] ( #returning ) . If the caller tries to drop a handle while the handle's
493
+ ` num_lends ` is greater than zero, the caller traps. Symmetrically, each
494
+ ` borrow ` handle passed to a callee increments a ` num_borrows ` counter on the
495
+ callee task that is decremented when the ` borrow ` handle is dropped. If a
496
+ callee task attempts to return when its ` num_borrows ` is greater than zero, the
497
+ callee traps.
498
+
499
+ In an asynchronous setting, the only generalization necessary is that, since
500
+ there can be multiple overlapping async tasks executing in a component
501
+ instance, a borrowed handle must track * which* task's ` num_borrow ` s was
502
+ incremented so that the correct counter can be decremented.
503
+
504
+
489
505
## Async ABI
490
506
491
507
At an ABI level, native async in the Component Model defines for every WIT
@@ -962,6 +978,7 @@ comes after:
962
978
[ `Task.enter` ] : CanonicalABI.md#task-state
963
979
[ `Task.wait` ] : CanonicalABI.md#task-state
964
980
[ `Waitable` ] : CanonicalABI.md#waitable-state
981
+ [ `Task` ] : CanonicalABI.md#task-state
965
982
[ `Subtask` ] : CanonicalABI.md#subtask-state
966
983
[ Stream State ] : CanonicalABI.md#stream-state
967
984
[ Future State ] : CanonicalABI.md#future-state
0 commit comments