From 5f56d5ff5d9972b4b41f13bcc4d159f10efd2739 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?=E5=88=98=E9=91=AB?= Date: Fri, 19 Sep 2025 13:44:33 +0800 Subject: [PATCH] FX: update README --- README.md | 9 ++++++++- 1 file changed, 8 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 2f18ae2..f81bc57 100644 --- a/README.md +++ b/README.md @@ -50,7 +50,7 @@ By default, when you first run the script, the model will be downloaded automati - Download VoxCPM-0.5B ``` from huggingface_hub import snapshot_download - snapshot_download("openbmb/VoxCPM-0.5B",local_files_only=local_files_only) + snapshot_download("openbmb/VoxCPM-0.5B") ``` - Download ZipEnhancer and SenseVoice-Small. We use ZipEnhancer to enhance speech prompts and SenseVoice-Small for speech prompt ASR in the web demo. ``` @@ -98,6 +98,13 @@ voxcpm --text "VoxCPM is an innovative end-to-end TTS model from ModelBest, desi --output out.wav \ --denoise +# (Optinal) Voice cloning (reference audio + transcript file) +voxcpm --text "VoxCPM is an innovative end-to-end TTS model from ModelBest, designed to generate highly expressive speech." \ + --prompt-audio path/to/voice.wav \ + --prompt-file "/path/to/text-file" \ + --output out.wav \ + --denoise + # 3) Batch processing (one text per line) voxcpm --input examples/input.txt --output-dir outs # (optional) Batch + cloning