Skip to content

[Community] ARM64/Blackwell Docker Deployment for DGX Spark #78

@adrian-greenneuron

Description

@adrian-greenneuron

Hi OpenFold team,

I wanted to share my experience getting OpenFold3 running on the NVIDIA DGX Spark (Grace Blackwell GB10, ARM64). I couldn't find any existing examples of this setup, so I'm documenting what worked for me.

What I Built

A Docker deployment with the following fixes:

  1. DeepSpeed compute_121 error — NVCC doesn't recognize compute_121, so I patched op_builder/builder.py to map it to compute_120
  2. Triton kernels — Required triton-nightly to support sm_121
  3. JIT compilation overhead — Used a docker commit workflow to bake pre-compiled kernels into the image (reduces cold start from ~3 min to ~9 sec)

Benchmarks

Example Inference Time
Ubiquitin (76 residues) 9s
Protein-ligand (MCL1) 3m 16s

Repo

👉 https://github.com/adrian-greenneuron/openfold3-DGX-Spark


Questions for the community:

  • Has anyone else gotten OpenFold3 running on ARM64 or Blackwell GPUs?
  • Are there better approaches to the DeepSpeed/Triton compatibility issues I encountered?
  • Would it be useful to upstream any of these fixes?

I hope this helps others working with similar hardware. Feedback and suggestions are very welcome!

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions