diff --git a/.changes/v0.13.1.md b/.changes/v0.13.1.md
new file mode 100644
index 000000000..c13b7ad71
--- /dev/null
+++ b/.changes/v0.13.1.md
@@ -0,0 +1,7 @@
+## v0.13.1 (2024-07-10)
+
+### Fixed and Improvements
+
+* Bump llama.cpp version to b3334, supporting Deepseek V2 series models.
+* Turn on fast attention for Qwen2-1.5B model to fix the quantization error.
+* Properly set number of GPU layers (to zero) when device is CPU.
\ No newline at end of file
diff --git a/CHANGELOG.md b/CHANGELOG.md
index 0df45b336..d25b05a86 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -5,6 +5,13 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html),
 and is generated by [Changie](https://github.com/miniscruff/changie).
 
+## v0.13.1 (2024-07-10)
+
+### Fixed and Improvements
+
+* Bump llama.cpp version to b3334, supporting Deepseek V2 series models.
+* Turn on fast attention for Qwen2-1.5B model to fix the quantization error.
+* Properly set number of GPU layers (to zero) when device is CPU.
 ## v0.13.0 (2024-06-28)
 
 ### Features