docs: document the 0.13.1 release

This commit is contained in:
Meng Zhang 2024-07-10 10:24:36 +09:00
parent cfdf70fe36
commit a14efb5ce8
2 changed files with 14 additions and 0 deletions

7
.changes/v0.13.1.md Normal file
View File

@ -0,0 +1,7 @@
## v0.13.1 (2024-07-10)
### Fixed and Improvements
* Bump llama.cpp version to b3334, supporting Deepseek V2 series models.
* Turn on fast attention for Qwen2-1.5B model to fix the quantization error.
* Properly set number of GPU layers (to zero) when device is CPU.

View File

@ -5,6 +5,13 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html), adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html),
and is generated by [Changie](https://github.com/miniscruff/changie). and is generated by [Changie](https://github.com/miniscruff/changie).
## v0.13.1 (2024-07-10)
### Fixed and Improvements
* Bump llama.cpp version to b3334, supporting Deepseek V2 series models.
* Turn on fast attention for Qwen2-1.5B model to fix the quantization error.
* Properly set number of GPU layers (to zero) when device is CPU.
## v0.13.0 (2024-06-28) ## v0.13.0 (2024-06-28)
### Features ### Features