From 2aec4967dfee9b9765e54204ebad20c01945624d Mon Sep 17 00:00:00 2001 From: Yuan_Tuo Date: Tue, 9 Jul 2024 16:38:39 +0800 Subject: [PATCH] Update README.md --- preprocessors/Emilia/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/preprocessors/Emilia/README.md b/preprocessors/Emilia/README.md index e61929b2..1b5dd523 100644 --- a/preprocessors/Emilia/README.md +++ b/preprocessors/Emilia/README.md @@ -140,8 +140,8 @@ The processed audio (default 24k sample rate) files will be saved into `input_fo We acknowledge the wonderful work by these excellent developers! - Source Separation: [UVR-MDX-NET-Inst_HQ_3](https://github.com/TRvlvr/model_repo/releases/tag/all_public_uvr_models) - VAD: [snakers4/silero-vad](https://github.com/snakers4/silero-vad) -- Speaker Diarization: [snakers4/silero-vad](https://github.com/snakers4/silero-vad) -- ASR: [m-bain/whisperX](https://github.com/m-bain/whisperX) +- Speaker Diarization: [pyannote/speaker-diarization-3.1](https://huggingface.co/pyannote/speaker-diarization-3.1) +- ASR: [m-bain/whisperX](https://github.com/m-bain/whisperX), using [faster-whisper](https://github.com/guillaumekln/faster-whisper) and [CTranslate2](https://github.com/OpenNMT/CTranslate2) backend. - DNSMOS Prediction: [DNSMOS P.835](https://github.com/microsoft/DNS-Challenge)