Skip to content

Conversation

CellularSyntax
Copy link

This PR improves the build logic in cuda_init.py to fix platform-specific issues during CUDA extension compilation on Windows.

Fixes:

  • Split -Xptxas flags for Windows to avoid nvcc errors
  • Uses combined flags for Linux where safe
  • Platform-specific handling of extra_ldflags (/LIBPATH vs. -L)

Tested on: Windows 11 (Anaconda, CUDA 12.8, PyTorch 2.4.0)
Not tested on Linux, but preserves the original behavior for Linux systems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant