Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ggml: Add initial WebGPU backend devops improvements to build systems and github actions documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning python python script changes
#14521 opened Jul 3, 2025 by reeselevine Loading…
vulkan: Handle updated FA dim2/3 definition ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14518 opened Jul 3, 2025 by jeffbolznv Loading…
kv-cache : prepare K/V buffers for separation
#14517 opened Jul 3, 2025 by ggerganov Loading…
graph : prepare for 4D mask
#14515 opened Jul 3, 2025 by ggerganov Loading…
batch : add n_used count
#14512 opened Jul 3, 2025 by ggerganov Loading…
batch : add optional for sequential equal split
#14511 opened Jul 3, 2025 by ggerganov Loading…
MUSA: upgrade musa sdk to <<TBD>> ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14498 opened Jul 2, 2025 by yeahdongcn Draft
Allow truncation when embedding examples server
#14493 opened Jul 2, 2025 by huydt84 Loading…
vulkan: unpack more values at a time for iquants mat mul ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14485 opened Jul 1, 2025 by netrunnereve Loading…
llama : reuse compute graphs
#14482 opened Jul 1, 2025 by ggerganov Loading…
3 of 7 tasks
opencl : add GELU_ERF ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14476 opened Jul 1, 2025 by CISC Loading…
Pr/7191 build Compilation issues devops improvements to build systems and github actions python python script changes
#14447 opened Jun 29, 2025 by esrakorkmz Loading…
ggml : implement GEGLU_ERF and GEGLU_QUICK ops Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#14445 opened Jun 29, 2025 by CISC Loading…
Added CI with RISC-V RVV1.0 Hardware devops improvements to build systems and github actions
#14439 opened Jun 29, 2025 by alitariq4589 Loading…
model : add hunyuan moe python python script changes
#14425 opened Jun 27, 2025 by ngxson Loading…
4 tasks done
ggml : add ggml_scale_bias Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14417 opened Jun 27, 2025 by ngxson Draft
[CANN] weight format to nz for Ascend310P3 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14407 opened Jun 27, 2025 by tqgy6 Loading…
OpenCL: add conv2d kernel ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14403 opened Jun 26, 2025 by rmatif Loading…
ggml : add pointer to attach user data ggml changes relating to the ggml tensor library for machine learning
#14397 opened Jun 26, 2025 by koush Loading…
compare-commits.sh: support both llama-bench and test-backend-ops python python script changes script Script related
#14392 opened Jun 26, 2025 by yeahdongcn Loading…
ProTip! Filter pull requests by the default branch with base:master.