Skip to content

Commit

Permalink
notes on real time voice conversion
Browse files Browse the repository at this point in the history
  • Loading branch information
Plachtaa committed Nov 28, 2024
1 parent e8299ef commit 761986a
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 1 deletion.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -100,6 +100,8 @@ Some performance testing has been done on a NVIDIA RTX 3060 Laptop GPU, results
You can adjust the parameters in the GUI according to your own device performance, the voice conversion stream should work well as long as Inference Time is less than Block Time.
Note that inference speed may drop if you are running other GPU intensive tasks (e.g. gaming, watching videos)

You may wish to use [VB-CABLE](https://vb-audio.com/Cable/) to route audio from GUI output stream to a virtual microphone.

*(GUI and audio chunking logic are modified from [RVC](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI), thanks for their brilliant implementation!)*

## Training🏋️
Expand Down
2 changes: 1 addition & 1 deletion real-time-gui.py
Original file line number Diff line number Diff line change
Expand Up @@ -434,7 +434,7 @@ def load(self):
self.output_devices_indices.index(sd.default.device[1])
],
"sr_type": "sr_model",
"block_time": 0.5,
"block_time": 0.3,
"crossfade_length": 0.04,
"extra_time_ce": 2.5,
"extra_time": 0.5,
Expand Down

0 comments on commit 761986a

Please sign in to comment.