-
Notifications
You must be signed in to change notification settings - Fork 11.4k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
sycl: use DNN in the first part of ggml_sycl_mul_mat_batched_sycl
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12972
opened Apr 16, 2025 by
lslusarczyk
•
Draft
ggml: Re-enable CUDA graphs in presence of CONT and DUP nodes
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12970
opened Apr 16, 2025 by
agray3
Loading…
Resolved half rope,multi-EOS issues in convert_hf_togguf.py for GLM4Z Model
python
python script changes
#12957
opened Apr 15, 2025 by
piDack
Loading…
rpc : add RPC_CMD_HELLO
examples
ggml
changes relating to the ggml tensor library for machine learning
#12955
opened Apr 15, 2025 by
rgerganov
Loading…
graph : make FA compatible with MLA + add initial Metal kernels
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12953
opened Apr 15, 2025 by
ggerganov
Loading…
rpc : do not wait for response when sending RPC_CMD_SET_TENSOR
ggml
changes relating to the ggml tensor library for machine learning
set b = ub when b > ub with embedding
examples
server
#12940
opened Apr 14, 2025 by
ahmedshakill
Loading…
server : use std::move whenever possible
examples
server
#12936
opened Apr 14, 2025 by
ngxson
Loading…
vulkan: enable coopmat2 FA gqa and split_k optimizations more often
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12931
opened Apr 13, 2025 by
jeffbolznv
Loading…
gguf-py : GGUF Editor GUI - Python + Qt
python
python script changes
#12930
opened Apr 13, 2025 by
christopherthompson81
Loading…
mtmd : add methods to access
mtmd_image_tokens
examples
#12906
opened Apr 11, 2025 by
ngxson
Loading…
Get CPU model in ggml_backend_cpu_device_context on FreeBSD
ggml
changes relating to the ggml tensor library for machine learning
#12902
opened Apr 11, 2025 by
yurivict
Loading…
cuda: fix compilation error (#12893)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12894
opened Apr 11, 2025 by
lizhenneng
Loading…
llama-bench: enhance benchmark with improved token throughput measurements
examples
#12874
opened Apr 10, 2025 by
thevishalagarwal
Loading…
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX
ggml
changes relating to the ggml tensor library for machine learning
#12871
opened Apr 10, 2025 by
slaren
Loading…
opencl: fix incorrect local_size index in profiling log
ggml
changes relating to the ggml tensor library for machine learning
#12868
opened Apr 10, 2025 by
kimminsu38oo
Loading…
CANN: Add support for async operator submission
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#12864
opened Apr 10, 2025 by
hipudding
Loading…
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12858
opened Apr 10, 2025 by
Alcpz
Loading…
2 of 3 tasks
gguf-py: byteswapping improvements
python
python script changes
#12851
opened Apr 9, 2025 by
AlekseiNikiforovIBM
Loading…
metal : add memory pool for temp allocs
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12850
opened Apr 9, 2025 by
ggerganov
Loading…
9 tasks done
Previous Next
ProTip!
no:milestone will show everything without a milestone.