Download Latest Version koboldcpp-1.100.1 source code.zip (45.6 MB)
Email in envelope

Get an email when there's a new version of KoboldCpp

Home / v1.29
Name Modified Size InfoDownloads / Week
Parent folder
koboldcpp.exe 2023-06-07 20.4 MB
koboldcpp-1.29 source code.tar.gz 2023-06-07 8.9 MB
koboldcpp-1.29 source code.zip 2023-06-07 9.0 MB
README.md 2023-06-07 1.0 kB
Totals: 4 Items   38.3 MB 0

koboldcpp-1.29

KoboldCpp Changes: - Added BLAS batch size to the KoboldCpp Easy Launcher GUI. - Merged the upstream K-quantization implementations for OpenBLAS. Note that the new K-quants are still not supported in CLBlast yet. Please remain on the regular quantization formats to use CLBlast for now. - Fixed LLAMA 3B OOM errors and a few other OOMs. - Multiple bugfixes and improvements in Lite, including streaming for aesthetic chat mode.

To use, download and run the koboldcpp.exe, which is a one-file pyinstaller. Alternatively, drag and drop a compatible ggml model on top of the .exe, or run it and manually select the model in the popup dialog.

and then once loaded, you can connect like this (or use the full koboldai client): http://localhost:5001

For more information, be sure to run the program with the --help flag. This release also includes a zip file containing the libraries and the koboldcpp.py script, for those who prefer not use to the one-file pyinstaller.

Source: README.md, updated 2023-06-07