Skip to content

Commit

Permalink
finish vocoder
Browse files Browse the repository at this point in the history
  • Loading branch information
cantabile-kwok committed Oct 8, 2023
1 parent 60eea03 commit c967411
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 3 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# \[Working in Progress\] VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
# VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
> This is the official implementation of [VoiceFlow](https://arxiv.org/abs/2309.05027).
![traj](resources/traj.png)
Expand Down
6 changes: 4 additions & 2 deletions hifigan/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,11 +8,13 @@ The detailed information is:
| LJSpeech | 16k | 256 | 1024 | True |
| LibriTTS | 16k | 200 | 800 | True |

The trained checkpoint on both datasets are provided online. You can unzip them to sub-folders in `exp/`.
The trained checkpoint on both datasets are provided online. You can unzip them to sub-folders in `exp/`:
* LJSpeech: [link](https://huggingface.co/cantabile-kwok/hifigan-ljspeech-1024-256/resolve/main/train_hifigan.ljspeech.zip)
* LibriTTS: [link](https://huggingface.co/cantabile-kwok/hifigan-libritts-800-200/resolve/main/train_hifigan.libritts.zip)

Vocoding can be done by
```shell
cd ../; source path.sh; cd -; # if path.sh not activated
bash generation.sh --dataset "ljspeech/libritts" --eval_dir /path/that/contains/feats.scp
bash generation.sh --dataset "ljspeech or libritts" --eval_dir /path/that/contains/feats.scp
```
The program will read feats.scp in $eval_dir and synthesize audio to save in that dir.

0 comments on commit c967411

Please sign in to comment.