-
Notifications
You must be signed in to change notification settings - Fork 12.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
llama-bench : use local GPUs along with RPC servers
examples
#14917
opened Jul 28, 2025 by
rgerganov
Loading…
opencl: add ops docs
documentation
Improvements or additions to documentation
#14910
opened Jul 28, 2025 by
lhez
Loading…
opencl: fixed a typo
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14908
opened Jul 27, 2025 by
l29ah
Loading…
cuda : add softcap fusion
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14907
opened Jul 27, 2025 by
CISC
Loading…
ggml : repack block_iq4_nlx8 (AVX)
ggml
changes relating to the ggml tensor library for machine learning
#14904
opened Jul 27, 2025 by
ggerganov
Loading…
1 task
Vulkan: Fix minor debug mode issues
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14899
opened Jul 27, 2025 by
0cc4m
Loading…
ggml-cpu : deduplicate scalar implementations
ggml
changes relating to the ggml tensor library for machine learning
#14897
opened Jul 27, 2025 by
xctan
Loading…
SYCL: Add set_rows support for quantized types
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14883
opened Jul 26, 2025 by
qnixsynapse
Loading…
GGML: Fix leak of backend buffer memory address in RPC
ggml
changes relating to the ggml tensor library for machine learning
#14882
opened Jul 26, 2025 by
struct
Loading…
model: add hunyuan dense
python
python script changes
#14878
opened Jul 25, 2025 by
stevenkuang-tencent
Loading…
Extend test case filtering
testing
Everything test related
#14865
opened Jul 24, 2025 by
tlemo
Loading…
Adding chat template support for Granite model
testing
Everything test related
#14864
opened Jul 24, 2025 by
smdesai
Loading…
mtmd : add support for Voxtral
documentation
Improvements or additions to documentation
examples
python
python script changes
#14862
opened Jul 24, 2025 by
ngxson
Loading…
test-backend-ops: enables perf/eval testing of composite ops
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#14833
opened Jul 23, 2025 by
etasnadi
Loading…
graph : reduce splits for recurrent and hybrid models
performance
Speed related topics
#14825
opened Jul 23, 2025 by
compilade
Loading…
feat(batched): Add functionality to upload benchmark test results
examples
#14811
opened Jul 22, 2025 by
MengAiDev
Loading…
convert : handle pre-quantized models
enhancement
New feature or request
python
python script changes
#14810
opened Jul 22, 2025 by
compilade
Loading…
2 tasks
opencl: tiled mul_mat with local memory for f16 and f32
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
Previous Next
ProTip!
no:milestone will show everything without a milestone.