v0.3.0 [Unreleased]

Features

Fixes and improvements

v0.2.2 (10/09/2023)

Fixes and improvements

Fix a critical issue that might cause request dead locking in ctranslate2 backend (when loading is heavy)

v0.2.1 (10/03/2023)

Features

Chat Model & Web Interface

We have introduced a new argument, --chat-model, which allows you to specify the model for the chat playground located at http://localhost:8080/playground

To utilize this feature, use the following command in the terminal:

tabby serve --device metal --model TabbyML/StarCoder-1B --chat-model TabbyML/Mistral-7B

ModelScope Model Registry

Mainland Chinese users have been facing challenges accessing Hugging Face due to various reasons. The Tabby team is actively working to address this issue by mirroring models to a hosting provider in mainland China called modelscope.cn.

# Download from the Modelscope registry
TABBY_REGISTRY=modelscope tabby download --model TabbyML/WizardCoder-1B

Fixes and improvements

Implemented more accurate UTF-8 incremental decoding in the GitHub pull request.
Fixed the stop words implementation by utilizing RegexSet to isolate the stop word group.
Improved model downloading logic; now Tabby will attempt to fetch the latest model version if there's a remote change, and the local cache key becomes stale.
set default num_replicas_per_device for ctranslate2 backend to increase parallelism.

1.5 KiB Raw Blame History

v0.3.0 [Unreleased]

Features

Fixes and improvements

v0.2.2 (10/09/2023)

Fixes and improvements

v0.2.1 (10/03/2023)

Features

Chat Model & Web Interface

ModelScope Model Registry

Fixes and improvements

1.5 KiB

Raw Blame History