Skip to content
This repository was archived by the owner on Aug 31, 2025. It is now read-only.
This repository was archived by the owner on Aug 31, 2025. It is now read-only.

[FEATURE] DotMP.GPU.Parallel.ParallelForReduction #132

@computablee

Description

@computablee

Is your feature request related to a problem? Please describe.

There needs to be an analog to DotMP.Parallel.ParallelForReduction in the GPU API.

Describe the solution you'd like.

We should implement a GPU-based tree reduction, since that will run in O(log n) time. Nvidia has a good slide deck on that. This can be implemented behind-the-scenes, though we need to determine how to handle scalars on the GPU, since currently the only manner of data transfer is via the GPU.Buffer object, which only supports arrays.

Additional context.

We might need a PR first which handles scalars to/from the GPU.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestgpuDeals with the GPU API

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions