notes on real time voice conversion

Plachtaa · Nov 28, 2024 · 761986a · 761986a
1 parent e8299ef
commit 761986a
Show file tree

Hide file tree

Showing 2 changed files with 3 additions and 1 deletion.
diff --git a/README.md b/README.md
@@ -100,6 +100,8 @@ Some performance testing has been done on a NVIDIA RTX 3060 Laptop GPU, results
 You can adjust the parameters in the GUI according to your own device performance, the voice conversion stream should work well as long as Inference Time is less than Block Time.  
 Note that inference speed may drop if you are running other GPU intensive tasks (e.g. gaming, watching videos)  
 
+You may wish to use [VB-CABLE](https://vb-audio.com/Cable/) to route audio from GUI output stream to a virtual microphone.  
+
 *(GUI and audio chunking logic are modified from [RVC](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI), thanks for their brilliant implementation!)*
 
 ## Training🏋️

diff --git a/real-time-gui.py b/real-time-gui.py
@@ -434,7 +434,7 @@ def load(self):
                             self.output_devices_indices.index(sd.default.device[1])
                         ],
                         "sr_type": "sr_model",
-                        "block_time": 0.5,
+                        "block_time": 0.3,
                         "crossfade_length": 0.04,
                         "extra_time_ce": 2.5,
                         "extra_time": 0.5,