scx_layered: Implement sticky modulation optimization #1690

kkdwivedi · 2025-04-17T13:39:10Z

No description provided.

Signed-off-by: Kumar Kartikeya Dwivedi <[email protected]>

etsal · 2025-04-17T14:45:09Z

scheds/rust/scx_layered/src/bpf/main.bpf.c

+	if (!active_sticky_mod)
+		return cpu;
+
+	cpu_ctx = lookup_cpu_ctx(prev_cpu);


Nit: Factoring out into a small inline function can avoid all the gotos.

etsal · 2025-04-17T14:46:13Z

scheds/rust/scx_layered/src/bpf/main.bpf.c

@@ -1195,6 +1245,55 @@ static void layer_kick_idle_cpu(struct layer *layer)
 	scx_bpf_put_idle_cpumask(idle_smtmask);
 }

+SEC("tp_btf/sched_switch")


What is the overhead of the extra probe? Both in terms of having an extra probe and in terms of the code itself. Can we roll this into our starting/stopping methods?

The probe itself should be pretty fast, but I think longer term hooking into starting/stopping methods is the better way, so I will do that.

etsal · 2025-04-17T14:50:03Z

scheds/rust/scx_layered/src/bpf/main.bpf.c

+llc:
+	if (!(cpumask = cast_mask(llc->cpumask)))
+		goto out;
+	bpf_for(i, 0, nr_possible_cpus) {


(Comment: I'm surprised we don't have an FFS based foreach cpu in cpumask primitive)

Yeah, if there's a better way to do this, I'm all ears. It's suboptimal, especially on machines with a lot of CPUs. Even on Bergamo, we're iterating over 88 CPUs for every 8 we want to test.

One thing that comes to mind is storing start and end CPUs in llc_ctx, but that requires the assignment to be sequential.

layered stores iteration indices in per-cpu/llc arrays. It's a bit more setup work but overall not that bad. But yeah, bit-wise iterators would be great.

etsal · 2025-04-17T14:51:31Z

Looks good overall, my main nit would be rolling the stat update logic into our own methods instead of a sched_switch-based probe, it is both cleaner (doesn't grab tasks that belong to other schedulers) and more consistent

likewhatevs · 2025-04-18T18:29:13Z

IIUC the default is what is done currently/without this flag, right? If so this is good.

Could you update the documentation to suggest like, what a reasonable default is, or set the default to disabled and add an enable flag which when passed defaults to a reasonable value?

kkdwivedi force-pushed the stickmod branch from af16be8 to e53da8b Compare April 17, 2025 13:40

scx_layered: Implement sticky modulation optimization

96a06be

Signed-off-by: Kumar Kartikeya Dwivedi <[email protected]>

kkdwivedi force-pushed the stickmod branch from e53da8b to 96a06be Compare April 17, 2025 13:58

etsal reviewed Apr 17, 2025

View reviewed changes

etsal self-requested a review April 17, 2025 14:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scx_layered: Implement sticky modulation optimization #1690

scx_layered: Implement sticky modulation optimization #1690

kkdwivedi commented Apr 17, 2025

etsal Apr 17, 2025

etsal Apr 17, 2025

kkdwivedi Apr 18, 2025

etsal Apr 17, 2025

kkdwivedi Apr 18, 2025

kkdwivedi Apr 18, 2025

htejun Apr 18, 2025

etsal commented Apr 17, 2025

likewhatevs commented Apr 18, 2025 •

edited

Loading

scx_layered: Implement sticky modulation optimization #1690

Are you sure you want to change the base?

scx_layered: Implement sticky modulation optimization #1690

Conversation

kkdwivedi commented Apr 17, 2025

etsal Apr 17, 2025

Choose a reason for hiding this comment

etsal Apr 17, 2025

Choose a reason for hiding this comment

kkdwivedi Apr 18, 2025

Choose a reason for hiding this comment

etsal Apr 17, 2025

Choose a reason for hiding this comment

kkdwivedi Apr 18, 2025

Choose a reason for hiding this comment

kkdwivedi Apr 18, 2025

Choose a reason for hiding this comment

htejun Apr 18, 2025

Choose a reason for hiding this comment

etsal commented Apr 17, 2025

likewhatevs commented Apr 18, 2025 • edited Loading

likewhatevs commented Apr 18, 2025 •

edited

Loading