Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

musa: fix build warnings (unused variable) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14561 opened Jul 7, 2025 by yeahdongcn Loading…
Add PLaMo-2 model examples python python script changes
#14560 opened Jul 7, 2025 by mitmul Draft
vulkan: optimizations for deepseek prompt processing ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14555 opened Jul 6, 2025 by jeffbolznv Loading…
vulkan: optimize flash attention split_k_reduce ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14554 opened Jul 6, 2025 by jeffbolznv Loading…
CUDA: add set rows for f32 and f16 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14551 opened Jul 6, 2025 by am17an Loading…
opencl: add set_rows for f16 and f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14547 opened Jul 6, 2025 by lhez Loading…
OpenCL: add tiled mul_mat_f16_f32 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14535 opened Jul 4, 2025 by rmatif Loading…
ggml: fix typo in ggml.c ggml changes relating to the ggml tensor library for machine learning
#14531 opened Jul 4, 2025 by zhouwg Loading…
CUDA: add bf16 and i32 to getrows ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14529 opened Jul 4, 2025 by am17an Loading…
ggml: Add initial WebGPU backend devops improvements to build systems and github actions documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning python python script changes
#14521 opened Jul 3, 2025 by reeselevine Loading…
kv-cache : prepare K/V buffers for separation
#14517 opened Jul 3, 2025 by ggerganov Loading…
MUSA: upgrade musa sdk to <<TBD>> ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14498 opened Jul 2, 2025 by yeahdongcn Draft
Allow truncation when embedding examples server
#14493 opened Jul 2, 2025 by huydt84 Loading…
llama : reuse compute graphs examples
#14482 opened Jul 1, 2025 by ggerganov Loading…
6 of 15 tasks
Pr/7191 build Compilation issues devops improvements to build systems and github actions python python script changes
#14447 opened Jun 29, 2025 by esrakorkmz Loading…
Added CI with RISC-V RVV1.0 Hardware devops improvements to build systems and github actions
#14439 opened Jun 29, 2025 by alitariq4589 Loading…
ProTip! Filter pull requests by the default branch with base:master.