Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

graph : avoid huge warm-up graphs for MoE models
#14753 opened Jul 18, 2025 by ggerganov Loading…
Fix MinicpmV model converter and clip to avoid using hardcode. examples python python script changes
#14750 opened Jul 18, 2025 by gryffindor-rr Loading…
[ROCm] Fix HIP version check for HIPBLAS V2 API compatibility ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14744 opened Jul 17, 2025 by danielholanda Loading…
metal: SSM_SCAN performance Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#14743 opened Jul 17, 2025 by gabe-l-hart Loading…
Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14741 opened Jul 17, 2025 by ORippler Loading…
Improve Mistral models integration with llama.cpp python python script changes
#14737 opened Jul 17, 2025 by juliendenize Draft
Documentation: Update build.md's Vulkan section documentation Improvements or additions to documentation
#14736 opened Jul 17, 2025 by rspOverflow Loading…
CUDA: skip masked out KQ slices in mma FA kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#14735 opened Jul 17, 2025 by JohannesGaessler Loading…
vulkan: Add logging for bf16 features to ggml_vk_print_gpu_info (#13274) ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14707 opened Jul 16, 2025 by Peter0x44 Loading…
Fix KleidiAI compilation errors with -DGGML_NATIVE=OFF (issue #14464) ggml changes relating to the ggml tensor library for machine learning
#14700 opened Jul 15, 2025 by baonudesifeizhai Loading…
kleidiai: add support for get_rows ggml changes relating to the ggml tensor library for machine learning
#14676 opened Jul 14, 2025 by chaxu01 Loading…
Add Pad Reflect 1D CUDA support ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14659 opened Jul 13, 2025 by YavorGIvanov Loading…
Add CUDA non-contiguous Unary Ops support build Compilation issues documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#14639 opened Jul 11, 2025 by YavorGIvanov Loading…
common: add config presets for falcon
#14638 opened Jul 11, 2025 by 0xs1d Loading…
OpenCL: add mul_mat_f16_f32_image kernel ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14635 opened Jul 11, 2025 by rmatif Loading…
HIP: Enable Matrix cores for MMQ Kernels, Enable stream-K for CDNA 3 devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14624 opened Jul 10, 2025 by deepsek Loading…
tool: add convertation of text/parquet to custom format build Compilation issues examples
#14622 opened Jul 10, 2025 by lexasub Loading…
ProTip! no:milestone will show everything without a milestone.