Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gh-129236: Use stackpointer in free threaded GC #129240

Merged
merged 5 commits into from
Jan 29, 2025

Conversation

colesbury
Copy link
Contributor

@colesbury colesbury commented Jan 23, 2025

The stack pointers in interpreter frames are nearly always valid now, so use them when visiting each thread's frame. For now, don't collect objects with deferred references in the rare case that we see a frame with a NULL stack pointer.

The stack pointers in interpreter frames are nearly always valid now, so
use them when visiting each thread's frame. For now, don't collect
objects with deferred references in the rare case that we see a frame
with a NULL stack pointer.
{
_Py_FOR_EACH_TSTATE_BEGIN(interp, p) {
for (_PyInterpreterFrame *f = p->current_frame; f != NULL; f = f->previous) {
PyObject *executable = PyStackRef_AsPyObjectBorrow(f->f_executable);
if (executable == NULL || !PyCode_Check(executable)) {
if (f->owner >= FRAME_OWNED_BY_INTERPRETER) {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I also updated this to match the condition in Python/gc.c.

@colesbury
Copy link
Contributor Author

Benchmark: 1% faster

The most extreme results are:

  • gc_traversal: 1.97x faster
  • create_gc_cycles: 1.05x slower

The impactful part of this change is that stack marking (from GH-128807) is much more effective now because we are also looking at non-deferred references. gc_traversal is mostly live objects, so the extra stack marking helps a lot. create_gc_cycles is mostly dead objects, so the extra stack marking doesn't help.

From some separate benchmarking, I think that skipping the extra frame initialization has a much smaller effect (close to 0.1%).


_PyStackRef *top = f->stackpointer;
if (top == NULL) {
// GH-129236: The stackpointer may be NULL. Skip this frame
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we know the cases when stackpointer can be NULL here?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see it can be because of PyStackRef_CLOSE, perhaps add that here too or hint to check gc_visit_thread_stacks_mark_alive?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've updated the comment.

@nascheme
Copy link
Member

This looks okay to me (although I'm not very familiar with how the stack currently works in ceval). Perhaps this comment can be expanded, since it took me a while to convince myself that it should work.

// GH-129236: Hack to avoid collect objects with deferred references
// if we see a frame without a valid stack pointer.

As I understand: the object can have one or more deferred references to it that are not accounted for in the refcnt value and will also not be traced by tp_traverse. By adding one to the "gc refs" value, we prevent the GC from incorrectly concluding this object is part of a garbage cycle (all other refs to it are accounted for).

@colesbury
Copy link
Contributor Author

@nascheme - I've updated the comment and logic in mark_heap_visitor. I got rid of the modification to the "gc refs". I hope that it's a bit more clear now.

@colesbury
Copy link
Contributor Author

@nascheme, would you please take another look at this?

@nascheme
Copy link
Member

I'm not sure about the expectation that f->stackpointer is either set correctly or is NULL. Based on reading discussion about it, I understand that's supposed to be the case. The changes made to gc_free_threading.c look good to me. So, I think this change is ready to merge.

Copy link
Contributor

@mpage mpage left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks correct to me, provided (as Neil said) the stack pointer is either always correct or NULL when the gc runs.

@colesbury colesbury merged commit 5ff2fbc into python:main Jan 29, 2025
44 checks passed
@colesbury colesbury deleted the gh-129236-gc-stackpointer branch January 29, 2025 15:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants