Skip to content

Diarization Inconsistencies with Nova-2 and Limited Language Support in Other Models #1033

Discussion options

You must be logged in to vote

Yes, we've heard this a lot and understand the limits of Diarization. Our know our product team has been discussing improving it in 2025, but I don't have an ETA yet of when that improvement might be released.

Here are some suggestions to try:

  1. Prepend audio from the primary speaker: For short audio files (under 3 minutes), prepend a 30-second clip of the primary speaker's voice before the full audio. This gives the diarization model a reference point to more reliably identify that speaker throughout the rest of the audio. <https://deepgram.gitbook.io/help-center/faq/improving-diarization-by-prepending-audio-from-the-primary-speaker|Help Center>
  2. Use multichannel audio: When possible, use …

Replies: 4 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by deepgram-community
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
1 participant