WIP: DP4AMatMul fix matmul for subgoup size 64 GPUs #23637
Merged
Azure Pipelines / Windows CPU CI Pipeline (ort_training_apis_x64_release build_ort_training_apis_x64_release)
succeeded
Feb 12, 2025 in 45m 12s
ort_training_apis_x64_release build_ort_training_apis_x64_release succeeded
Loading