Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

CUDA: add roll
#14919 opened Jul 28, 2025 by am17an Loading…
repack : optimize mul_mat_id path
#14918 opened Jul 28, 2025 by ggerganov Loading…
1 task
opencl: add ops docs documentation Improvements or additions to documentation
#14910 opened Jul 28, 2025 by lhez Loading…
opencl: fixed a typo ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14908 opened Jul 27, 2025 by l29ah Loading…
cuda : add softcap fusion ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#14907 opened Jul 27, 2025 by CISC Loading…
ggml : repack block_iq4_nlx8 (AVX) ggml changes relating to the ggml tensor library for machine learning
#14904 opened Jul 27, 2025 by ggerganov Loading…
1 task
Vulkan: Add Integer Dot Product mul_mat_vec shader for legacy quants ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14903 opened Jul 27, 2025 by 0cc4m Draft
Vulkan: Fix minor debug mode issues ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14899 opened Jul 27, 2025 by 0cc4m Loading…
ggml-cpu : deduplicate scalar implementations ggml changes relating to the ggml tensor library for machine learning
#14897 opened Jul 27, 2025 by xctan Loading…
SYCL: Add set_rows support for quantized types ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14883 opened Jul 26, 2025 by qnixsynapse Loading…
GGML: Fix leak of backend buffer memory address in RPC ggml changes relating to the ggml tensor library for machine learning
#14882 opened Jul 26, 2025 by struct Loading…
model: add hunyuan dense python python script changes
#14878 opened Jul 25, 2025 by stevenkuang-tencent Loading…
Support intern-s1 python python script changes
#14875 opened Jul 25, 2025 by RunningLeon Loading…
Extend test case filtering testing Everything test related
#14865 opened Jul 24, 2025 by tlemo Loading…
Adding chat template support for Granite model testing Everything test related
#14864 opened Jul 24, 2025 by smdesai Loading…
mtmd : add support for Voxtral documentation Improvements or additions to documentation examples python python script changes
#14862 opened Jul 24, 2025 by ngxson Loading…
imatrix : use GGUF by default examples
#14842 opened Jul 24, 2025 by compilade Loading…
SvelteKit-based WebUI examples script Script related server
#14839 opened Jul 23, 2025 by allozaur Draft
test-backend-ops: enables perf/eval testing of composite ops ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14833 opened Jul 23, 2025 by etasnadi Loading…
graph : reduce splits for recurrent and hybrid models performance Speed related topics
#14825 opened Jul 23, 2025 by compilade Loading…
convert : handle pre-quantized models enhancement New feature or request python python script changes
#14810 opened Jul 22, 2025 by compilade Loading…
2 tasks
opencl: tiled mul_mat with local memory for f16 and f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14809 opened Jul 22, 2025 by lhez Draft
ProTip! no:milestone will show everything without a milestone.