Metrify RawNet3/Resemblyzer as Keywords & Update READMEs #85

Merakist · 2024-01-01T14:14:52Z

Description

This pull request includes modifications to further integrate Resemblyzer into Amphion's evaluation process, which includes:

Modifying the cosine similarity calculation method: Previously, the resemblyzer_similarity.py used the scipy.spatial.distance.cosine method for calculating cosine similarity. This update replaces it with PyTorch's torch.nn.functional.cosine_similarity method, which is already used by other scripts to simplify the codebase and ensure uniformity.
Metrifying RawNet3/Resemblyzer as keywords: Altered the --metrics argument handling from speaker_similarity to rawnet3_similarity and resemblyzer_similarity as distinct options to streamline the evaluation process by removing the need for user input during script execution.

Testing

The definitions of the scipy and torch method for calculating cosine similarity are intrinsically similar (See Documentations for SciPy and Torch). The results using both methods are calculated, and the difference between the two values is accurate to 7th decimal place, see Scipy - Torch Result Comparison. Therefore, the two methods are interchangeable with accuracies ensured.

Changes

Amphion/evaluation/metrics/similarity/resemblyzer_similarity.py:
- Changed scipy.spatial.distance.cosine to torch.nn.functional.cosine_similarity method.
Amphion/bins/calc_metrics.py:
- Altered speaker_similarity to two distinct options: rawnet3_similarity and resemblyzer_similarity.
- Fixed bug of missing FAD.
Amphion/egs/metrics/README.md:
- Updated Amphion's Evaluation Recipe to include Resemblyzer, replaced the speaker_similarity keyword with rawnet3_similarity and resemblyzer_similarity, and fixed misspelling.
Amphion/README.md:
- Listing Resemblyzer as an available method for calculating speaker similarity results.

Usage

When calculating speaker similarity with Amphion/egs/metrics/run.sh using the command --metrics, the user shall select the desired model (RawNet3/Resemblyzer) for calculation with the corresponding keyword (rawnet3_similarity / resemblyzer_similarity).

RMSnow

Nice PR description

RMSnow · 2024-01-01T14:19:50Z

bins/calc_metrics.py

-                    deg_dir, ref_dir, dump_dir
-                )
-                result[metric] = str(similarity_score)
+        if metric in ["fad", "rawnet3_similarity"]:


Why is "fad" here? It seems that the original code does not contain the conditional judegment about fad. Is there a bug in the old code?

Yes. When modifying the input-selection part in the old code, the FAD part was mis-deleted. It should be here, as shown in the original commit, at line 64: 9682d0c#diff-4fa833e1c8dd8d05d182f8262a2cc5f727dc72a364db06f8acc5536eff3e6506

Owing to the firewalled status of huggingface.co on the Aliyun server, it went undetected because prior to testing calc_metrics.py, all parts concerning FAD had to be commented out, or else the script couldn't correctly initialize.

@VocodexElysium Please review this.

@VocodexElysium Please review this.

Basically, if your internet environment does not support getting access to Hugging Face in the terminal, importing FAD will cause errors since it is trying to connect with Hugging Face. You can avoid this by setting the correct VPN environment or just downloading the necessary things yourself and then adjusting the FAD code to import the model on your computer rather than downloading from Hugging Face (I think MingXuan did this successfully). So I don't think removing FAD-related code is necessary since it only involves the internet environment and is solvable.

Merakist · 2024-01-01T15:21:58Z

New commit: fixed a found typo in Amphion/egs/metrics/README.md: Aduio -> Audio

RMSnow

LGTM

VocodexElysium · 2024-01-02T10:12:21Z

@Merakist Calculating FAD needs a connection to Hugging Face. I think you need to specify that and also attach the methodology of downloading and specifying the folder to the pretrained model for calculating FAD when the internet cannot guarantee a connection to Hugging Face in the README.md.

RMSnow

@Merakist @VocodexElysium Please cooperate to refine the FAD-related doc to make it friendly and easy-to-understand for users

Merakist · 2024-01-02T15:30:49Z

Updated README. Proposed a solution for FAD to load local models when huggingface.co is unreachable in the command lines.

VocodexElysium · 2024-01-03T06:24:43Z

Updated README. Proposed a solution for FAD to load local models when huggingface.co is unreachable in the command lines.

Here are some advice:

Use another name instead of “Additional Troublesome Information”. It’s weird to use such a term especially when that connection error bug is not caused by us but by some “internet issues”.
Attach Hugging Face links for the pretrained models, people needs to know where to obtain them.
Do not show your absolute path in the script since it will cause leakage on your server’s info.

Merakist · 2024-01-03T07:44:53Z

@VocodexElysium Roger. The new commit condensed the title from Additional Troubleshooting Info to Troubleshooting (I believe you have misread), which should be more explicit. The links to the models are included and paths are modified.

VocodexElysium · 2024-01-03T14:01:52Z

@VocodexElysium Roger. The new commit condensed the title from Additional Troubleshooting Info to Troubleshooting (I believe you have misread), which should be more explicit. The links to the models are included and paths are modified.

I think you forgot to create folders for bert, roberta, and bart in the Amphion/pretrained directory. Please add them up.

Merakist · 2024-01-04T07:43:27Z

@VocodexElysium Roger. The new commit condensed the title from Additional Troubleshooting Info to Troubleshooting (I believe you have misread), which should be more explicit. The links to the models are included and paths are modified.

I think you forgot to create folders for bert, roberta, and bart in the Amphion/pretrained directory. Please add them up.

@VocodexElysium I have updated the README.md under Amphion/pretrained to reflect the model dependencies for the evaluation pipeline with file structure trees, and created folders for bert-base-uncased, facebook/bart-base and roberta-base under Amphion/pretrained.

VocodexElysium

Able to merge

Metrify RawNet3/Resemblyzer keyword and update READMEs

bc68a39

Merakist requested a review from lmxue January 1, 2024 14:15

RMSnow requested changes Jan 1, 2024

View reviewed changes

Merakist requested a review from RMSnow January 1, 2024 14:59

Metrify RawNet3/Resemblyzer keyword and update READMEs

7ab87a1

RMSnow approved these changes Jan 2, 2024

View reviewed changes

RMSnow requested changes Jan 2, 2024

View reviewed changes

Update README with Local Model Loading Instructions for FAD

14bbbb3

Merakist requested review from RMSnow and VocodexElysium January 2, 2024 15:31

Update README with Local Model Loading Instructions for FAD

609acf1

Merakist added 2 commits January 4, 2024 15:28

Update README with Local Model File Structures

b7e290c

Update README with Local Model File Structures

eb71d22

VocodexElysium reviewed Jan 6, 2024

View reviewed changes

VocodexElysium approved these changes Jan 6, 2024

View reviewed changes

RMSnow approved these changes Jan 6, 2024

View reviewed changes

RMSnow merged commit 9b287c8 into open-mmlab:main Jan 6, 2024
1 check passed

Merakist deleted the metrify-resemblyzer branch January 25, 2024 08:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrify RawNet3/Resemblyzer as Keywords & Update READMEs #85

Metrify RawNet3/Resemblyzer as Keywords & Update READMEs #85

Merakist commented Jan 1, 2024 •

edited

Loading

RMSnow left a comment

RMSnow Jan 1, 2024

Merakist Jan 1, 2024 •

edited

Loading

Merakist Jan 1, 2024

RMSnow Jan 1, 2024

VocodexElysium Jan 2, 2024

Merakist commented Jan 1, 2024

RMSnow left a comment

VocodexElysium commented Jan 2, 2024

RMSnow left a comment

Merakist commented Jan 2, 2024

VocodexElysium commented Jan 3, 2024

Merakist commented Jan 3, 2024

VocodexElysium commented Jan 3, 2024

Merakist commented Jan 4, 2024

VocodexElysium left a comment

Metrify RawNet3/Resemblyzer as Keywords & Update READMEs #85

Metrify RawNet3/Resemblyzer as Keywords & Update READMEs #85

Conversation

Merakist commented Jan 1, 2024 • edited Loading

Description

Testing

Changes

Usage

RMSnow left a comment

Choose a reason for hiding this comment

RMSnow Jan 1, 2024

Choose a reason for hiding this comment

Merakist Jan 1, 2024 • edited Loading

Choose a reason for hiding this comment

Merakist Jan 1, 2024

Choose a reason for hiding this comment

RMSnow Jan 1, 2024

Choose a reason for hiding this comment

VocodexElysium Jan 2, 2024

Choose a reason for hiding this comment

Merakist commented Jan 1, 2024

RMSnow left a comment

Choose a reason for hiding this comment

VocodexElysium commented Jan 2, 2024

RMSnow left a comment

Choose a reason for hiding this comment

Merakist commented Jan 2, 2024

VocodexElysium commented Jan 3, 2024

Merakist commented Jan 3, 2024

VocodexElysium commented Jan 3, 2024

Merakist commented Jan 4, 2024

VocodexElysium left a comment

Choose a reason for hiding this comment

Merakist commented Jan 1, 2024 •

edited

Loading

Merakist Jan 1, 2024 •

edited

Loading