-
NVIDIA
- Sunnyvale, CA
Highlights
- Pro
Popular repositories Loading
-
cppnow_presentations_2016
cppnow_presentations_2016 PublicPresentation materials for C++Now 2016
-
cub_historical_2019_2020
cub_historical_2019_2020 PublicForked from NVIDIA/cub
Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.
-
wg21_p2300_execution
wg21_p2300_execution Public`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
-
2016_berkeley_cpp_summit_presentations
2016_berkeley_cpp_summit_presentations PublicPresentation materials for the 2016 Berkeley C++ Summit
-
boost.lockfree
boost.lockfree PublicHartmut Kaiser and Bryce Lelbach's version of Boost.Lockfree
144 contributions in the last year
Day of Week | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | |||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Contribution activity
February 2025
Created 2 commits in 1 repository
Created a pull request in NVIDIA/cccl that received 4 comments
[CUB/docs] Add missing closing braces to BlockReduce
kernel examples in CUB docs.
All of the CUB BlockReduce
kernel examples are missing a closing brace. This is a documentation only change.
Reviewed 1 pull request in 1 repository
NVIDIA/cccl
1 pull request
-
[cuda.cooperative] Add missing overloads to block.reduce and block.sum
This contribution was made on Feb 5
Created an issue in NVIDIA/cccl that received 1 comment
Add CUB block reduce primitive that returns the reduced result in all threads
We need an AllReduce
/AllSum
that performs a reduction distributes the reduction across the entire block. I need this exposed in both CUB and cuda.c…
Opened 7 other issues in 2 repositories
NVIDIA/cccl
5
open
1
closed
-
Accept NumPy types in cuda.cooperative
This contribution was made on Feb 22
-
[BUG]: cuda.cooperative should validate the sizes of arrays passed to CUB APIs that takes a fixed-size C array
This contribution was made on Feb 22
-
[BUG]: cuda.cooperative passes string dtype parameters through to C++
This contribution was made on Feb 22
-
s/threads_in_block/threads_per_block/ in cuda.cooperative
This contribution was made on Feb 22
-
CUB has no tests for multidimensional BlockLoad/BlockStore
This contribution was made on Feb 22
-
[BUG]: In cuda.cooperative linking the same algorithm twice leads to multiple definitions
This contribution was made on Feb 22
NVIDIA/numba-cuda
1
open
-
[FEA] Decompose Dim3s (
x, y, z = cuda.blockDim
)This contribution was made on Feb 22