[pull] master from ggml-org:master by pull[bot] · Pull Request #1159 · LongLeCE/llama.cpp

pull · 2026-05-08T08:42:02Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

* convert : fix RuntimeError when stripping FP8 KV-cache scales In ModelBase._generate_nvfp4_tensors the final cleanup loop iterates self.model_tensors.keys() and calls del on the same dict, which raises RuntimeError: dictionary changed size during iteration when a ModelOpt NVFP4 model also has FP8 KV-cache scales (e.g. mmangkad/Qwen3.6-35B-A3B-NVFP4 and any modelopt config with kv_cache_quant_algo: FP8). Wrap the keys view in list() so the deletions happen on a snapshot. * re-add another accidentally removed list --------- Co-authored-by: Sigbjørn Skjæret <[email protected]>

* Q4_0 MoE CLC pass sanity check * release program * opencl: fix whitespace * opencl: remove unused cl_program * opencl: break #if block to make it more clear * opencl: adjust format --------- Co-authored-by: Li He <[email protected]>

arthw and others added 6 commits May 8, 2026 06:54

fix script error (#22795sycl : )

6a2a251

opencl: add q4_0 MoE GEMM for Adreno (#22731)

f3e8d14

* Q4_0 MoE CLC pass sanity check * release program * opencl: fix whitespace * opencl: remove unused cl_program * opencl: break #if block to make it more clear * opencl: adjust format --------- Co-authored-by: Li He <[email protected]>

ggml: update SCHED_DEBUG output to use ggml_op_desc() (#22825)

3e941b8

vulkan: fix spv shadowing (#22760)

6d57a49

CUDA: lower-case PCI bus id, standardize for ggml (#22820)

a8fd165

pull Bot locked and limited conversation to collaborators May 8, 2026

pull Bot added the ⤵️ pull label May 8, 2026

pull Bot merged commit a8fd165 into LongLeCE:master May 8, 2026
11 of 17 checks passed

github-actions Bot added Nvidia GPU examples python ggml SYCL OpenCL Vulkan labels May 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] master from ggml-org:master#1159

[pull] master from ggml-org:master#1159
pull[bot] merged 6 commits into
LongLeCE:masterfrom
ggml-org:master

pull Bot commented May 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

pull Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pull Bot commented May 8, 2026 •

edited

Loading