You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I guess it means no "native support" for half-precision. FWIW Ada emulates FP16 using FP32, hence FP16 and FP32 have the same tflops. In contrast, recent AMD architectures (RDNA, CDNA) has FP16 performance 2x as much as FP32.
Yeah, thats probably the reason. But it still is missleading - because of course one can do FP16 on Ada and the Tensor Cores do have native FP16 support, so its not like the card can´t process or store FP16/BF16..
Is Tensor core support possible in OPENCL ?
Core: +200 MHz, VRAM +1000 MHz, Power limit: 600W
Platform: NVIDIA CUDA
Device: NVIDIA GeForce RTX 4090
Driver version : 531.61 (Win64)
Compute units : 128
Clock frequency : 2520 MHz
The text was updated successfully, but these errors were encountered: