Missing punctuation in Chinese Transcription #321
-
Which Deepgram product are you using?Deepgram API DetailsI notice that punctuation works well when transcribing English. However, when the input file is Chinese, with detect language set to true, the response missed probably 85% of the necessary periods. Is this something you all are aware of / is there a fix coming? If you are making a request to the Deepgram API, what is the full Deepgram URL you are making a request to?POST /v1/listen?punctuate=true&model=whisper-large&detect_language=true If you are making a request to the Deepgram API and have a request ID, please paste it below:42bf5903-6db5-407a-95c9-07d98d60d8b7 If possible, please attach your code or paste it into the text box.async function submitAsyncRequest(audioSource) { If possible, please attach an example audio file to reproduce the issue.No response |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Hi @RobZuazua, Deepgram does support punctuation more robustly in English than in Chinese. However, I also recommend trying |
Beta Was this translation helpful? Give feedback.
Hi @RobZuazua, Deepgram does support punctuation more robustly in English than in Chinese. However, I also recommend trying
smart_format=true
instead ofpunctuate=true
. Smart Formatting improves readability of the transcript overall, including general punctuation and grammar. You may still notice some gaps in formatting for Chinese, but that will give you the currently best available results. We continue to actively work on improving formatting across the many languages that we support.