Skip to content

Facing accuracy issues while transcribing Hindi (indian regional language) #147

Discussion options

You must be logged in to vote

Hi @rahulbansal16, with both Deepgram's enhanced and base model tiers and with our managed Whisper offering, you can use detect_language=true, and the model will detect whether each file is in Hindi or English, and transcribe it in that language.

https://api.deepgram.com/v1/listen?tier=enhanced&detect_language=true
https://api.deepgram.com/v1/listen?model=whisper&detect_language=true

Unfortunately however, the current default for Hindi is in Devanagari script, so if Hindi is detected, it will transcribe in Devanagari (hi language code). The only way to get Latin alphabet transcription is to know the file is in Hindi, and specify language=hi-Latn.

We understand that this is not ideal, and …

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by jpvajda
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
4 participants