vm: rework the exception handling #1061

zerbina · 2023-12-21T23:32:21Z

Summary

Implement a different approach to exception handling in the VM, fixing
most of the issues around raise, except, and finally. This makes
the VM's exception handling implementation the most complete out of the
existing backends.

Details

Exception and finally handling was very incomplete, with many non-
simple cases not working (e.g., finally not intercepting breaks,
the current exception not being reset once handled, etc.).

New exception handling implementation:

instead of the safepoint-based approach (opcTry), a separate
instruction-to-exception-handler lookup table is used
each call frame uses its own lookup table; setting the lookup table
is done via the new SetEh instruction
the table stores mappings from instruction position to the position
of special-purpose exception handling (=EH) instructions
when an instruction raises, it is looked up in the table. If no
associated EH instruction is found, the call instruction in the above
frame is looked up, then the call instruction in the frame, etc.
if no EH instruction is found even after unwinding the call stack,
the exception is treated as unhandled and reported to the supervisor
if an EH instruction is found, an internal EH thread is spawned for
the raised exception and the EH instruction(s) are executed
the EH instructions describe which except and finally clauses are
visited and in what order
entering an except or finally clause from an EH thread sets the
current exception (the one returned by getCurrentException) to the
thread's associated exception

This affords for a large amount of flexibility with exception handling.

New finally implementation:

the Finally and FinallyEnd opcodes stay, but they now work
differently
each finally section is associated with a control register
the control register stores the information necessary for knowing
what to do when the end of the section (FinallyEnd) is reached
how execution continues at the end of a finally section depends on
how the Finally instruction was reached:
- if reached by normal control-flow, execution continues at the
  instruction designated by the Finally instruction
- if reached via an Enter instruction, execution continues at the
  instruction following the Enter instruction
- if reached from exception handling, the EH thread is resumed
jumping from outside a finally section to within one is forbidden

The Enter and Leave are two new instructions:

Enter is used for redirecting to finally sections and works as
described above
Leave is used for terminating open EH threads when exiting an
except clause or when exiting a finally clause through
unstructured control-flow (e.g., break)

When a break or return exits one or more try clauses with
attached finally clauses, vmgen emits an Enter instruction
targeting each, prior to the final jump. Similarly, vmgen emits an
Leave instruction for each finally and except clause exited
through unstructured control-flow.

Considered alternatives

Much of the mentioned changes could have also been implemented on-top
of the safe-point mechanism. This, however, would not allow supporting
clean-up-only control-flow (that is, all exception handler being
skipped and only certain finally section being executed), which, while
not used at the moment, could in the future become useful for
implementing panics.

Code generator details

a SetEh instruction is always emitted, even if a code fragment /
procedure has no instruction-to-EH mappings. This is a temporary
limitation
all IndCall and IndCallAsgn instructions (i.e., calls that
weren't lowered into dedicated instruction) are treated as raising
the EH instructions are emitted together with the normal bytecode;
no separate code generation pass is used

Future direction

The MIR (and subsequently the CgNode IR) is planned to use goto-
based control-flow primitives, similar to the ones that the VM uses.
This means that much of the semantics-related decision-making (e.g.,
where to insert Leave instructions) is going to move out of vmgen.

Tests

Multiple tests for things that previously didn't work with the VM are
added. In addition, the knownIssue marker is removed from multiple
exception tests that now succeed.

Summary ======= Implement a different approach to exception handling in the VM, fixing many (all?) of the issues around `raise`, `except`, and `finally`. This makes the VM's exception handling implementation the most complete out of the existing backends. Details ======= Exception and `finally` handling was very incomplete, with many non- simple cases not working (e.g., `finally` not intercepting `break`s, the current exception not being reset once handled, etc.). **New exception handling implementation:** * instead of the safepoint-based approach (`opcTry`), a separate instruction-to-exception-handler lookup table is used * each call frame uses its own lookup table; setting the lookup table is done via the new `SetEh` instruction * the table stores mappings from instruction position to the position of special-purpose exception handling (=EH) instructions * when an instruction raises, it is looked up in the table. If no associated EH instruction is found, the call instruction in the above frame is looked up, then the call instruction in the frame, etc. * if no EH instruction is found even after unwinding the call stack, the exception is treated as unhandled and reported to the supervisor * if an EH instruction is found, an internal EH thread is spawned for the raised exception and the EH instruction(s) are executed * the EH instructions describe which `except` and `finally` clauses are visited and in what order * entering an `except` or `finally` clause from an EH thread sets the current exception (the one returned by `getCurrentException`) to the thread's associated exception This affords for a large amount of flexibility with exception handling. **New `finally` implementation:** * the `Finally` and `FinallyEnd` opcodes stay, but they now work differently * each finally section is associated with a *control register* * the control register stores the information necessary for knowing what to do when the end of the section (`FinallyEnd`) is reached * how execution continues at the end of a finally section depends on how the `Finally` instruction was reached: * if reached by normal control-flow, execution continues at the instruction designated by the `Finally` instruction * if reached via an `Enter` instruction, execution continues at the instruction following the `Enter` instruction * if reached from exception handling, the EH thread is resumed * jumping from outside a finally section to within one is forbidden The `Enter` and `Leave` are two new instructions: * `Enter` is used for redirecting to finally sections and works as described above * `Leave` is used for terminating open EH threads when exiting an `except` clause or when exiting a `finally` clause through unstructured control-flow (e.g., `break`) When a `break` or `return` exits one or more `try` clauses with attached `finally` clauses, `vmgen` emits an `Enter` instruction targeting each, prior to the final jump. Similarly, `vmgen` emits an `Leave` instruction for each `finally` and `except` clause exited through unstructured control-flow. Considered alternatives ----------------------- Much of the mentioned changes could have also been implemented on-top of the safe-point mechanism. This, however, would not allow supporting clean-up-only control-flow (that is, all exception handler being skipped and only certain finally section being executed), which, while not used at the moment, could in the future become useful for implementing panics. Code generator details ---------------------- * a `SetEh` instruction is always emitted, even if a code fragment / procedure, has no instruction-to-EH mappings. This is a temporary limitation * all `IndCall` and `IndCallAsgn` instructions (i.e., calls that weren't lowered into dedicated instruction) are treated as raising * the EH instructions are emitted together with the normal bytecode; no separate code generation pass is used Future direction ---------------- The MIR (and subsequently the `CgNode` IR) is planned to use goto- based control-flow primitives, similar to the ones that the VM uses. This means that much of the semantics-related decision-making (e.g., where to insert `Leave` instructions) is going to move out of `vmgen`.

`vm.setCurrentException` always incremented the provided refs ref- counter, even if the ref was nil (which is a valid case). An `isNotNil` guard is added to prevent the ref-counter being incremented.

PR-local bug: exiting a finally section via `FinallyEnd` didn't continue stack frame unwinding, which would result in execution being aborted, even though there were exception handlers present. `opRaise` is split up into two procedures: `findEh` (find the EH code associated with an instruction) and `resumeEh` (execute the EH thread and unwind). This is so that `FinallyEnd` can use `resumeEh` to unwind correctly.

The tests succeed with the VM now.

The `SetEh` instruction patching was implemented incorrectly, resulting in incorrect operand value when the lower bound is >= 256.

It doesn't work at the moment, but it's also not entirely clear whether it actually should.

`initIntReg` changed to doing the register cleanup itself.

zerbina · 2024-02-04T19:04:58Z

There's is an exception-related situation that none of the backends handle correctly: re-raising the active exception and immediately catching it within a finally clause (refer to tests/exception/thandle_in_finally.nim). The expectations made across the compiler and code generators is that:

if a finally intercepts non-exceptional control-flow and the end of the finally is reached, control-flow resumes at the original target (or next intercepting finally
if a finally intercepts exceptional control-flow and the end of the finally is reached, the exception is further

Handling the active exception within the finally clause makes resuming raising the exception when reaching the end of the finally impossible, since there's no more active exception. The only sensible thing to do in this case would be to "fall" out of the finally, but this goes against expectation number 2, and none of the backends properly supports this at the moment.

My personal opinion is that re-raising outside of an except clause should be statically disallowed, which would resolve the above problem. In addition, a re-raise would no longer be able to raise a ReraiseDefect, since it's guaranteed at compile-time that an exception to re-raise exists.

proc reraise() = raise

try:
  raise CatchableError.newException("")
except:
  reraise()

would stop working too, but I don't think that this would be a bad thing.

saem

The test case coverage expansion is especially awesome.

I have a few minor suggestions, mostly soft.

tests/exception/tfinally6.nim

tests/exception/thandle_in_finally.nim

saem · 2024-02-04T21:32:40Z

tests/exception/thandle_in_finally.nim

+  '''
+"""
+
+# XXX: it's questionable whether this really should work / be allowed


I'm thinking this through myself:

on line 17 we decide to 'cancel the current computation', targeting the handler on line 29

on line 20 we decide to 'retarget the cancellation handler', now on line 21

upon leaving the line 21 cancellation handler, we decide to continue the rest of the computation

Reading through my characterization, I think the 'precise' question is, should a finally be allowed re-parent an existing exception (cancellation)? I believe the answer is "yes, it should" because if not then, the property we'd be after is that finally cannot raise, which seems infeasible/not very useful. I consider this a variant of raising an entirely new exception that overrides the old one, except in this case our 'new exception' 'adopts' the data of the old one.

Hm, I'm not sure I completely follow. A finally could be allowed to raise without having to allow re-targeting an existing exception to a handler within the finally (e.g., by disallowing re-raise outside of except branches.

In your model, would line 28 be executed?

In your model, would line 28 be executed?

That's a great question, I want to say yes. But that would mean we entered the finally through exceptional means, it handled the exception internally, then the treatment would have to be flipped to non-exceptional.

Which seems to defer the determination of how to leave a finally that much more. Where each control flow branch within a finally can change the handling... ooph.

Perhaps a static restriction is a better bet, because we can always lift the restriction later when/if we're willing to do that extra analysis and think through all the implications (extra accumulated experience in the future will also be beneficial).

Which seems to defer the determination of how to leave a finally that much more. Where each control flow branch within a finally can change the handling... ooph.

Yeah, this is the reason why I'd prefer not supporting the re-targeting.

Perhaps a static restriction is a better bet, because we can always lift the restriction later when/if we're willing to do that extra analysis and think through all the implications (extra accumulated experience in the future will also be beneficial).

Okay, in that case I would make a follow up that statically disallows a re-raise outside of except branches.

Awesome, and thanks for working through the reasoning with me.

tests/exception/tleave_except.nim

tests/exception/traise_and_handle_in_except.nim

Co-authored-by: Saem Ghani <[email protected]>

* reword the description to better highlight the uncertainty * change the test's success condition to all three steps * fix the test running forever due the signal handler being entered over and over

zerbina · 2024-02-05T01:13:35Z

I've changed the test description of thandle_in_finally.nim a bit, and also updated the expected sequence to be 1, 2, 3 (for the case that handling the exception within the finally branch should be supported one day).

There's was also the problem of the test running forever (an infinite loop caused by the SIGSEGV signal handler triggering itself), which I fixed by disabling the custom signal handlers (-d:noSignalHandler).

zerbina · 2024-02-05T02:08:31Z

Thank you for the review, Saem.

saem · 2024-02-05T18:13:10Z

/merge

github-actions · 2024-02-05T18:13:35Z

Merge requested by: @saem

Contents after the first section break of the PR description has been removed and preserved below:

To-Do

fix the remaining issue where exception propagation doesn't continue after the last finally in a procedure

enable the exceptions category for the VM target (separate PR)

improve the exception/finally related test suite ~~(possibly by finishing Add language specification for finally clause #256)~~

Notes for Reviewers

a major prerequisite for reworking the MIR control-flow primitives

zerbina added bug Something isn't working enhancement New feature or request compiler/vm Embedded virtual machine labels Dec 21, 2023

zerbina added this to the VM refactoring and improvements milestone Dec 21, 2023

zerbina mentioned this pull request Jan 7, 2024

tests: enable the exceptions category for all targets #1082

Merged

zerbina added 9 commits February 2, 2024 16:19

Merge branch 'devel' into vm-rework-exception-handling

453140c

vm: fix illegal ref-counter increment

d632598

`vm.setCurrentException` always incremented the provided refs ref- counter, even if the ref was nil (which is a valid case). An `isNotNil` guard is added to prevent the ref-counter being incremented.

tests: remove knownIssue marker from some tests

3b8c28a

The tests succeed with the VM now.

vmgen: fix wrong SetEh patching

812bb09

The `SetEh` instruction patching was implemented incorrectly, resulting in incorrect operand value when the lower bound is >= 256.

tests: add tests for the fixed issues

d33acc5

tests: add a test for the "reraise-and-handle-in-finally" case

6e9f7ec

It doesn't work at the moment, but it's also not entirely clear whether it actually should.

Merge branch 'devel' into vm-rework-exception-handling

5830868

post-merge fix-up

8d68357

`initIntReg` changed to doing the register cleanup itself.

saem approved these changes Feb 4, 2024

View reviewed changes

zerbina marked this pull request as draft February 4, 2024 22:36

zerbina and others added 3 commits February 4, 2024 23:37

fix multiple typos in comments

1b51e92

Co-authored-by: Saem Ghani <[email protected]>

tests: add extra assertion

960185a

Co-authored-by: Saem Ghani <[email protected]>

tests: update the thandle_in_finally test

7a0a245

* reword the description to better highlight the uncertainty * change the test's success condition to all three steps * fix the test running forever due the signal handler being entered over and over

zerbina marked this pull request as ready for review February 5, 2024 01:14

chore-runner bot added this pull request to the merge queue Feb 5, 2024

Merged via the queue into nim-works:devel with commit 752029c Feb 5, 2024
40 checks passed

zerbina mentioned this pull request Feb 6, 2024

only allow re-raise in scope of except #1170

Merged

zerbina deleted the vm-rework-exception-handling branch February 8, 2024 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vm: rework the exception handling #1061

vm: rework the exception handling #1061

zerbina commented Dec 21, 2023 •

edited by github-actions bot

Loading

zerbina commented Feb 4, 2024

saem left a comment

saem Feb 4, 2024

zerbina Feb 4, 2024

saem Feb 4, 2024 •

edited

Loading

zerbina Feb 5, 2024

saem Feb 5, 2024

zerbina commented Feb 5, 2024

zerbina commented Feb 5, 2024

saem commented Feb 5, 2024

github-actions bot commented Feb 5, 2024

To-Do

Notes for Reviewers

vm: rework the exception handling #1061

vm: rework the exception handling #1061

Conversation

zerbina commented Dec 21, 2023 • edited by github-actions bot Loading

Summary

Details

Considered alternatives

Code generator details

Future direction

Tests

zerbina commented Feb 4, 2024

saem left a comment

Choose a reason for hiding this comment

saem Feb 4, 2024

Choose a reason for hiding this comment

zerbina Feb 4, 2024

Choose a reason for hiding this comment

saem Feb 4, 2024 • edited Loading

Choose a reason for hiding this comment

zerbina Feb 5, 2024

Choose a reason for hiding this comment

saem Feb 5, 2024

Choose a reason for hiding this comment

zerbina commented Feb 5, 2024

zerbina commented Feb 5, 2024

saem commented Feb 5, 2024

github-actions bot commented Feb 5, 2024

To-Do

Notes for Reviewers

zerbina commented Dec 21, 2023 •

edited by github-actions bot

Loading

saem Feb 4, 2024 •

edited

Loading