Update USAGE.md

This commit is contained in:
Alienpups 2024-02-08 19:52:57 +01:00
parent 1556dd3c15
commit fc6329704b

View File

@ -1,6 +1,7 @@
# Usage # Usage
## Table of Content ## Table of Content
- [Use in MyShell](#use-in-myshell): directly use the Instant Voice Clone and TTS services. - [Use in MyShell](#use-in-myshell): directly use the Instant Voice Clone and TTS services.
- [Minimal Demo](#minimal-demo): quickly try OpenVoice and do not require high quality. - [Minimal Demo](#minimal-demo): quickly try OpenVoice and do not require high quality.
- [Linux Install](#linux-install): for researchers and developers only. - [Linux Install](#linux-install): for researchers and developers only.
@ -10,19 +11,25 @@
For most users, the most convenient way is to directly use the free TTS and Instant Voice Clone services in MyShell. For most users, the most convenient way is to directly use the free TTS and Instant Voice Clone services in MyShell.
### TTS ### TTS
Go to [https://app.myshell.ai/explore](https://app.myshell.ai/explore) and follow the instructions below: Go to [https://app.myshell.ai/explore](https://app.myshell.ai/explore) and follow the instructions below:
<div align="center"> <div align="center">
<img src="../resources/tts-guide.png" width="1200"/> <img src="../resources/tts-guide.png" width="1200"/>
</div> </div>
### Voice Clone ### Voice Clone
Go to [https://app.myshell.ai/explore](https://app.myshell.ai/explore) and follow the instructions below: Go to [https://app.myshell.ai/explore](https://app.myshell.ai/explore) and follow the instructions below:
<div align="center"> <div align="center">
<img src="../resources/voice-clone-guide.png" width="61200"/> <img src="../resources/voice-clone-guide.png" width="61200"/>
</div> </div>
## Minimal Demo ## Minimal Demo
For users who want to quickly try OpenVoice and do not require high quality or stability, click any of the following links: For users who want to quickly try OpenVoice and do not require high quality or stability, click any of the following links:
<div align="center"> <div align="center">
<a href="https://www.lepton.ai/playground/openvoice"><img src="../resources/lepton-hd.png" height="28"></a> <a href="https://www.lepton.ai/playground/openvoice"><img src="../resources/lepton-hd.png" height="28"></a>
&nbsp;&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;
@ -32,14 +39,17 @@ For users who want to quickly try OpenVoice and do not require high quality or s
</div> </div>
## Linux Install ## Linux Install
This section is only for developers and researchers who are familiar with Linux, Python and PyTorch. Clone this repo, and run This section is only for developers and researchers who are familiar with Linux, Python and PyTorch. Clone this repo, and run
``` ```
conda create -n openvoice python=3.9 conda create -n openvoice python=3.9
conda activate openvoice conda activate openvoice
conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.7 -c pytorch -c nvidia conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.7 -c pytorch -c nvidia
pip install -r requirements.txt pip install -r requirements.txt
``` ```
Download the checkpoint from [here](https://myshell-public-repo-hosting.s3.amazonaws.com/checkpoints_1226.zip) and extract it to the `checkpoints` folder
Download the checkpoint from [here](https://myshell-public-repo-hosting.s3.amazonaws.com/checkpoints_1226.zip) and extract it to the `checkpoints` folder
**1. Flexible Voice Style Control.** **1. Flexible Voice Style Control.**
Please see [`demo_part1.ipynb`](../demo_part1.ipynb) for an example usage of how OpenVoice enables flexible style control over the cloned voice. Please see [`demo_part1.ipynb`](../demo_part1.ipynb) for an example usage of how OpenVoice enables flexible style control over the cloned voice.
@ -54,3 +64,7 @@ The base speaker model can be replaced with any model (in any language and style
**4. Tips to Generate Natural Speech.** **4. Tips to Generate Natural Speech.**
There are many single or multi-speaker TTS methods that can generate natural speech, and are readily available. By simply replacing the base speaker model with the model you prefer, you can push the speech naturalness to a level you desire. There are many single or multi-speaker TTS methods that can generate natural speech, and are readily available. By simply replacing the base speaker model with the model you prefer, you can push the speech naturalness to a level you desire.
## Windows Install (VS Code)
Please use [this guide](https://github.com/Alienpups/OpenVoice/blob/main/docs/USAGE_WINDOWS.md) if you want to install and use OpenVoice on Windows