Download Latest Version koboldcpp-1.100.1 source code.zip (45.6 MB)
Email in envelope

Get an email when there's a new version of KoboldCpp

Home / v1.0.5
Name Modified Size InfoDownloads / Week
Parent folder
llamacpp_for_kobold.exe 2023-03-25 10.6 MB
llamacpp-for-kobold-1.0.5 source code.tar.gz 2023-03-25 2.4 MB
llamacpp-for-kobold-1.0.5 source code.zip 2023-03-25 2.4 MB
README.md 2023-03-25 633 Bytes
Totals: 4 Items   15.4 MB 0

llamacpp-for-kobold-1.0.5

  • Merged the upstream fixes for 65b
  • Clamped max thread count to 4, it actually provides better results as it is memory bottlenecked.
  • Added support for select kv data type, defaulting to f32 instead of f16
  • Added more default build flags
  • Added softprompts endpoint

To use, download and run the llamacpp_for_kobold.exe Alternatively, drag and drop a compatible quantized model for llamacpp on top of the .exe, or run it and manually select the model in the popup dialog.

and then once loaded, you can connect like this (or use the full koboldai client): http://localhost:5001

Source: README.md, updated 2023-03-25