Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

vulkan: Split large mul_mat_id to fit in shared memory ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#14451 opened Jun 29, 2025 by jeffbolznv Loading…
convert : correct gemma 3n conversion python python script changes
#14450 opened Jun 29, 2025 by ngxson Loading…
vulkan: support softmax/FA batch and broadcast ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14449 opened Jun 29, 2025 by jeffbolznv Loading…
Pr/7191 build Compilation issues devops improvements to build systems and github actions python python script changes
#14447 opened Jun 29, 2025 by esrakorkmz Loading…
ggml : implement GEGLU_ERF and GEGLU_QUICK ops Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#14445 opened Jun 29, 2025 by CISC Loading…
Added CI with RISC-V RVV1.0 Hardware devops improvements to build systems and github actions
#14439 opened Jun 29, 2025 by alitariq4589 Loading…
memory : correctly handle failure in apply()
#14438 opened Jun 29, 2025 by ggerganov Loading…
ggml : support broadcast for ggml_soft_max_ext and ggml_flash_attn_ext Apple Metal https://en.wikipedia.org/wiki/Metal_(API) Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#14435 opened Jun 28, 2025 by ggerganov Loading…
2 of 5 tasks
model : add hunyuan moe python python script changes
#14425 opened Jun 27, 2025 by ngxson Draft
2 of 4 tasks
ggml : add ggml_scale_bias Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14417 opened Jun 27, 2025 by ngxson Draft
[CANN]update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14411 opened Jun 27, 2025 by noemotiovon Loading…
[CANN] weight format to nz for Ascend310P3 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14407 opened Jun 27, 2025 by tqgy6 Loading…
OpenCL: add conv2d kernel ggml changes relating to the ggml tensor library for machine learning
#14403 opened Jun 26, 2025 by rmatif Loading…
ggml : add pointer to attach user data ggml changes relating to the ggml tensor library for machine learning
#14397 opened Jun 26, 2025 by koush Loading…
compare-commits.sh: support both llama-bench and test-backend-ops python python script changes script Script related
#14392 opened Jun 26, 2025 by yeahdongcn Loading…
Add Conv2d for CPU ggml changes relating to the ggml tensor library for machine learning
#14388 opened Jun 26, 2025 by am17an Loading…
ggml-cpu: Build variant targeting Neoverse-V2 ggml changes relating to the ggml tensor library for machine learning
#14380 opened Jun 25, 2025 by ckastner Loading…
Q2k interleaving implementation - x86/x64 SIMD ggml changes relating to the ggml tensor library for machine learning
#14373 opened Jun 25, 2025 by Srihari-mcw Loading…
docs: fix broken url in main readme
#14371 opened Jun 25, 2025 by justinclift-prvidr Loading…
test-backend-ops: add support for specifying output format testing Everything test related
#14368 opened Jun 25, 2025 by yeahdongcn Loading…
llama : add high-throughput mode Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning
#14363 opened Jun 24, 2025 by ggerganov Draft
11 of 19 tasks
llama : expose C API to get layer device type
#14358 opened Jun 24, 2025 by okaris Loading…
ProTip! Exclude everything labeled bug with -label:bug.