-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: handle noncontig in the final case of ggml_vk_get_cpy_pipeline
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14378
opened Jun 25, 2025 by
jeffbolznv
Loading…
webui: preserve partial content when streaming errors occur
examples
server
#14374
opened Jun 25, 2025 by
Aaryan-549
Loading…
5 of 8 tasks
Q2k interleaving implementation - x86/x64 SIMD
ggml
changes relating to the ggml tensor library for machine learning
#14373
opened Jun 25, 2025 by
Srihari-mcw
Loading…
test-backend-ops: add support for specifying output format
testing
Everything test related
#14368
opened Jun 25, 2025 by
yeahdongcn
Loading…
vulkan: Add fusion support for RMS_NORM+MUL
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#14366
opened Jun 24, 2025 by
jeffbolznv
•
Draft
ggml : add pointer to attach user data
ggml
changes relating to the ggml tensor library for machine learning
#14365
opened Jun 24, 2025 by
koush
Loading…
llama : add high-throughput mode
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
CUDA: add bf16 and f32 support to cublas_mul_mat_batched
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14361
opened Jun 24, 2025 by
am17an
Loading…
build: refine toplevel .gitignore
script
Script related
#14355
opened Jun 24, 2025 by
zhouwg
Loading…
1 task done
Add script to test op perf and compare
python
python script changes
script
Script related
#14354
opened Jun 24, 2025 by
yeahdongcn
Loading…
vulkan: Increase workgroup size for GLU, for performance
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14345
opened Jun 23, 2025 by
jeffbolznv
Loading…
Make the shell scripts cross-platform
devops
improvements to build systems and github actions
examples
script
Script related
server
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#14341
opened Jun 23, 2025 by
vedranmiletic
Loading…
vulkan: lock accesses of pinned_memory vector
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14333
opened Jun 22, 2025 by
jeffbolznv
Loading…
ggml-cpu: Pass on tag_name to the feature scoring
ggml
changes relating to the ggml tensor library for machine learning
#14332
opened Jun 22, 2025 by
ckastner
Loading…
Fix appearance of the chats list context menu for the browser Safari
examples
server
#14322
opened Jun 22, 2025 by
rntk
Loading…
ggml-cpu: enable IBM NNPA Vector Intrinsics
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#14317
opened Jun 21, 2025 by
taronaeo
Loading…
GitHub workflow: set RPATH to "@loader_path" / "$ORIGIN" to ensure executables and dynamic libraries search for dependencies in their origin directory.
devops
improvements to build systems and github actions
examples
#14309
opened Jun 20, 2025 by
rotemdan
Loading…
Fix Windows Null Pointer Bug and Enhance Memory Operations in ggml-sycl
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14290
opened Jun 20, 2025 by
MengAiDev
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.