Facing accuracy issues while transcribing Hindi (indian regional language) #147
-
Currently accuracy for Hindi is not good. Any suggestion to move in right direction to achieve good accuracy ? |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments
-
Hi @nagarajuprimefocus , I recommend you try changing the tier you are using. Hindi is available on the
The other option would be to try Deepgram's Whisper Cloud offering:
or try:
|
Beta Was this translation helpful? Give feedback.
-
Hi @SandraRodgers The input audio can be in either Hindi or English. I don't want to manually pass the language for each audio file. |
Beta Was this translation helpful? Give feedback.
-
Hi @rahulbansal16, with both Deepgram's enhanced and base model tiers and with our managed Whisper offering, you can use
Unfortunately however, the current default for Hindi is in Devanagari script, so if Hindi is detected, it will transcribe in Devanagari ( We understand that this is not ideal, and are exploring a future feature that will give customers more control over the languages and scripts that can be detected. |
Beta Was this translation helpful? Give feedback.
Hi @rahulbansal16, with both Deepgram's enhanced and base model tiers and with our managed Whisper offering, you can use
detect_language=true
, and the model will detect whether each file is in Hindi or English, and transcribe it in that language.https://api.deepgram.com/v1/listen?tier=enhanced&detect_language=true
https://api.deepgram.com/v1/listen?model=whisper&detect_language=true
Unfortunately however, the current default for Hindi is in Devanagari script, so if Hindi is detected, it will transcribe in Devanagari (
hi
language code). The only way to get Latin alphabet transcription is to know the file is in Hindi, and specifylanguage=hi-Latn
.We understand that this is not ideal, and …