-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
train: add simple loading already tokenized data from parquet dataset
build
Compilation issues
examples
ggml: Add initial WebGPU backend
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#14521
opened Jul 3, 2025 by
reeselevine
Loading…
vulkan: Handle updated FA dim2/3 definition
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14518
opened Jul 3, 2025 by
jeffbolznv
Loading…
mtmd : Fix 32-bit narrowing issue in export-lora and mtmd clip
examples
#14503
opened Jul 2, 2025 by
kiwi142857
Loading…
MUSA: upgrade musa sdk to <<TBD>>
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14498
opened Jul 2, 2025 by
yeahdongcn
•
Draft
vulkan: unpack more values at a time for iquants mat mul
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14485
opened Jul 1, 2025 by
netrunnereve
Loading…
Compute buffer and KV-cache aware layer distribution for multi-GPU inference
#14484
opened Jul 1, 2025 by
borebot
Loading…
opencl : add GELU_ERF
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14476
opened Jul 1, 2025 by
CISC
Loading…
server : (webui) let server send locally-defined default webui settings
examples
server
#14468
opened Jun 30, 2025 by
woof-dog
Loading…
Chore: batch prompts, extract tensors specific layer
examples
#14463
opened Jun 30, 2025 by
VakantieModus
Loading…
Pr/7191
build
Compilation issues
devops
improvements to build systems and github actions
python
python script changes
#14447
opened Jun 29, 2025 by
esrakorkmz
Loading…
ggml : implement GEGLU_ERF and GEGLU_QUICK ops
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#14445
opened Jun 29, 2025 by
CISC
Loading…
Added CI with RISC-V RVV1.0 Hardware
devops
improvements to build systems and github actions
#14439
opened Jun 29, 2025 by
alitariq4589
Loading…
model : add hunyuan moe
python
python script changes
#14425
opened Jun 27, 2025 by
ngxson
Loading…
4 tasks done
ggml : add ggml_scale_bias
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
[CANN] weight format to nz for Ascend310P3
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#14407
opened Jun 27, 2025 by
tqgy6
Loading…
OpenCL: add conv2d kernel
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14403
opened Jun 26, 2025 by
rmatif
Loading…
ggml : add pointer to attach user data
ggml
changes relating to the ggml tensor library for machine learning
#14397
opened Jun 26, 2025 by
koush
Loading…
compare-commits.sh: support both llama-bench and test-backend-ops
python
python script changes
script
Script related
#14392
opened Jun 26, 2025 by
yeahdongcn
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.