Adding Pinned host buffer benchmarks #576

nirandaperera · 2025-10-10T21:31:35Z

This PR adds benchmarks for pinned host buffer

Depends on #549

copy-pr-bot · 2025-10-10T21:31:38Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

nirandaperera · 2025-10-13T19:15:41Z

Results

System:
NVIDIA-SMI 580.82.07 Driver Version: 580.82.07 CUDA Version: 13.0

nvcc: NVIDIA (R) Cuda compiler driver
Cuda compilation tools, release 13.0, V13.0.88
Build cuda_13.0.r13.0/compiler.36424714_0

Signed-off-by: niranda perera <[email protected]>

madsbk

Overall looks good but we need to run a smoke test in CI: https://github.com/rapidsai/rapidsmpf/blob/f67d8ecc3a5ff83d3d0bf92dc10f2be2a499ea24/ci/run_cpp_benchmark_smoketests.sh

…_buffer_bench

Signed-off-by: niranda perera <[email protected]>

wence-

The priming benchmark is measuring very misleading timings. Please also cull all the LLM-produced comments that just explain in words exactly what the next line of code says in code.

cpp/benchmarks/bench_pinned_memory_resources.cpp

wence- · 2025-10-22T11:02:35Z

cpp/benchmarks/bench_pinned_memory_resources.cpp

+        auto latency_to_first = std::chrono::duration_cast<std::chrono::nanoseconds>(
+                                    first_allocation_time - start_time
+        )
+                                    .count();
+        auto first_round_duration_ns =
+            std::chrono::duration_cast<std::chrono::nanoseconds>(
+                first_round_end - start_time
+            )
+                .count();
+        auto second_round_duration_ns =
+            std::chrono::duration_cast<std::chrono::nanoseconds>(
+                second_round_end - first_round_end
+            )
+                .count();


These timings are at least partly nonsense. The latency to first makes sense. The first_round_duration kind of doesn't because we've synced after the very first allocation, but ok. I guess it makes kind of sense.

The second_round_duration includes the time to deallocate all of the allocations from the first round. This makes no sense.

Oh yeah. Good point. I was simply following this
https://github.com/rapidsai/rmm/blob/branch-25.12/cpp/benchmarks/async_priming/async_priming_bench.cpp
I guess the same issue is here as well.

cpp/benchmarks/bench_pinned_memory_resources.cpp

Co-authored-by: Lawrence Mitchell <[email protected]>

Signed-off-by: niranda perera <[email protected]>

…_buffer_bench

nirandaperera added improvement Improves an existing functionality non-breaking Introduces a non-breaking change labels Oct 10, 2025

nirandaperera added 3 commits October 17, 2025 14:58

adding benchmark

30263cd

Signed-off-by: niranda perera <[email protected]>

adding more benchmarks

717e523

Signed-off-by: niranda perera <[email protected]>

more benchmarks

ddc5452

Signed-off-by: niranda perera <[email protected]>

nirandaperera force-pushed the pinned_host_buffer_bench branch from f6a2faf to ddc5452 Compare October 17, 2025 22:13

nirandaperera marked this pull request as ready for review October 17, 2025 22:13

nirandaperera requested review from a team as code owners October 17, 2025 22:13

madsbk requested changes Oct 20, 2025

View reviewed changes

nirandaperera added 2 commits October 21, 2025 15:50

Merge branch 'main' of github.com:rapidsai/rapidsmpf into pinned_host…

d10967f

…_buffer_bench

adding smoke test

18ec4b3

Signed-off-by: niranda perera <[email protected]>

nirandaperera requested a review from a team as a code owner October 21, 2025 23:00

nirandaperera requested review from madsbk and rockhowse October 21, 2025 23:00

wence- requested changes Oct 22, 2025

View reviewed changes

nirandaperera and others added 4 commits October 22, 2025 10:03

Apply suggestions from code review

ced5d0c

Co-authored-by: Lawrence Mitchell <[email protected]>

Apply suggestions from code review

796d808

Co-authored-by: Lawrence Mitchell <[email protected]>

addressiong comments

7abef89

Signed-off-by: niranda perera <[email protected]>

Merge branch 'main' of github.com:rapidsai/rapidsmpf into pinned_host…

5a3810f

…_buffer_bench

nirandaperera requested a review from wence- October 22, 2025 18:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding Pinned host buffer benchmarks #576

Adding Pinned host buffer benchmarks #576

Uh oh!

nirandaperera commented Oct 10, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Oct 10, 2025

Uh oh!

nirandaperera commented Oct 13, 2025

Uh oh!

madsbk left a comment

Uh oh!

wence- left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wence- Oct 22, 2025

Uh oh!

nirandaperera Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding Pinned host buffer benchmarks #576

Are you sure you want to change the base?

Adding Pinned host buffer benchmarks #576

Uh oh!

Conversation

nirandaperera commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Oct 10, 2025

Uh oh!

nirandaperera commented Oct 13, 2025

Results

Uh oh!

madsbk left a comment

Choose a reason for hiding this comment

Uh oh!

wence- left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wence- Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

nirandaperera Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nirandaperera commented Oct 10, 2025 •

edited

Loading