wav file "invalid data received" #378

dvorhes · 2023-10-18T21:45:13Z

dvorhes
Oct 18, 2023

Are there any specific settings that deepgram expects for a wav file? frequency? mono/stereo?

Getting an invalid data received "bad request" error using boilerplate code seen here, shared june 6 by Jason Maldonis.

https://github.com/orgs/deepgram/discussions/188

Nov 8, 2023

Hi @dvorhes, thanks for sharing this update, and I'm glad to hear you did get this working with re-encoding. It does look like an issue originating from Adobe itself. See how the original file has a bitrate of 836 kb/s, while the re-encoded one has a bitrate of 768 kb/s, indicating that the original bitrate is incorrect.

If you're able to specify your preferred output encoding, we recommend MP3 with a bitrate of 192 kb/s, and constant bitrate (CBR; as opposed to VBR variable bitrate).

View full answer

jkroll-deepgram · 2023-10-19T21:19:10Z

jkroll-deepgram
Oct 19, 2023
Collaborator

Hi @dvorhes, you shouldn't need to match any particular specifications for audio frequency or number of channels. Deepgram accepts a wide variety of audio. There can be several reasons why this "invalid data received" error occurs.

One quick troubleshooting spot, are you using the code exactly as-is? That code sample is for an mp4 file type. Make sure to specify your audio mimetype as "audio/wav" rather than "audio/mp4".

If you can share a request ID from one of those "bad requests", we can look into it further as well. Ditto to sharing the exact code you're using, even if it's almost identical to the code sample in the other post you linked. This is likely to be some small tweak needed so Deepgram is receiving the type of audio it's expecting.

2 replies

dvorhes Oct 19, 2023
Author

code as follows, below.

Good call, I did have the mimetype as mp4 in this attempt-- had it as audio/wav in previous experiments, but even making that change here wasn't the fix unfortunately.

FYI in the main() function, PATH_TO_FILE is swapped to the absolute string path on local machine. A 48khz, 16bit wav, mono wav file.

strangely enough, I just got the below code to work ONCE. It printed a transcript. Since running it subsequent times, it has failed.

One such failure:

Error: {'err_code': 'Bad Request', 'err_msg': 'Bad Request: Invalid data received.', 'request_id': '31a85f4c-9685-47ce-9a4b-9de6b2235ebd'}

Thanks for your help.

from typing import Any
import requests
from creds.deepgram_keys import DEEPGRAM_KEYS


def send_audio_to_nova(blob: Any, audio_type) -> str:
    url = "https://api.deepgram.com/v1/listen?model=nova"
    headers = {}
    headers["Authorization"] = f"Token {DEEPGRAM_KEYS}"
    headers["Content-Type"] = audio_type
    print("audio_type: ", audio_type)

    try:
        response = requests.post(url, headers=headers, data=blob, timeout=10)

        if response.status_code == 200:
            data = response.json()
            print("Transcript:", data["results"]["channels"][0]["alternatives"][0]["transcript"])
            return data["results"]["channels"][0]["alternatives"][0]["transcript"]
        else:
            print("Error:", response.json())
            return ""
    except Exception as error:
        print("Error:", error)
        return ""


def main():

  # 48khz, 16bit wav, mono wav file.
  PATH_TO_FILE = '<path_to_file>'

  # Download the audio from: https://static.deepgram.com/examples/en_NatGen_Medical_DocDictation.m4a
  with open(PATH_TO_FILE, "rb") as f:
      blob = f.read()
  audio_type = "audio/wav"
  send_audio_to_nova(blob=blob, audio_type=audio_type)


if __name__ == "__main__":
    main()

jkroll-deepgram Oct 19, 2023
Collaborator

Unfortunately I'm not able to replicate your error off the bat. One test I did was to download one of our sample files (https://static.deepgram.com/examples/interview_speech-analytics.wav) and isolate just one channel for testing (ffmpeg -i interview_speech-analytics.wav -af "pan=mono|FC=FR" right_mono.wav). That's also a 48khz 16bit mono wav file and it transcribed fine with your code as-is.

With a longer file, I ran into Error: ('Connection aborted.', TimeoutError('The write operation timed out')) due to that timeout=10 setting, but when I bumped it to timeout=60, then longer files worked fine as well.

Looking at your request ID, I do see the "invalid data received" in our logs, but I'm still not able to reproduce the issue.

Have you tried running this code with only one file, or are you seeing it on multiple files?

Are you still able to reproduce it now? Or was it only happening yesterday, and perhaps we had some transient issue on our side that's since been resolved, and is no longer occurring?

dvorhes · 2023-11-06T20:20:30Z

dvorhes
Nov 6, 2023
Author

Hi Julia, Thanks for your reply. Finally back on this problem and I think I got it to work. The solution did indeed involve re-encoding via ffmpeg. My hunch is that the ffmpeg settings, mono/stereo or otherwise don't matter that much, rather it's the wav file's encoder metadata that is producing a problem. Does the deepgram API look for a specific encoder? For reference, the problem-file was encoded directly from Adobe Premiere. The working file was a re-encoded version of that file via ffmpeg. ffprobe data from both files here: NOT WORKING: Input #0, wav, from 'vo_transcription.wav': Metadata: encoded_by : Adobe Premiere Pro 2024.0 (Macin encoder : Adobe Photoshop 23.2 (20220128.orig.527 28d5e1a) (Macintosh) date : 2023-11-02 creation_time : 18:07:27 time_reference : 0 Duration: 00:02:38.20, bitrate: 836 kb/s Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 48000 Hz, 1 channels, s16, 768 kb/s DOES WORK: Input #0, wav, from 'vo_transcription_mono_test2.wav': Metadata: date : 2023-11-02 encoder : Lavf60.3.100 encoded_by : Adobe Premiere Pro 2024.0 (Macin Duration: 00:02:38.20, bitrate: 768 kb/s Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 48000 Hz, 1 channels, s16, 768 kb/s Any ideas?

…

On Thu, Oct 19, 2023 at 6:45 PM Julia Kroll ***@***.***> wrote: Unfortunately I'm not able to replicate your error off the bat. One test I did was to download one of our sample files ( https://static.deepgram.com/examples/interview_speech-analytics.wav) and isolate just one channel for testing (ffmpeg -i interview_speech-analytics.wav -af "pan=mono|FC=FR" right_mono.wav). That's also a 48khz 16bit mono wav file and it transcribed fine with your code as-is. With a longer file, I ran into Error: ('Connection aborted.', TimeoutError('The write operation timed out')) due to that timeout=10 setting, but when I bumped it to timeout=60, then longer files worked fine as well. Looking at your request ID, I do see the "invalid data received" in our logs, but I'm still not able to reproduce the issue. Have you tried running this code with only one file, or are you seeing it on multiple files? Are you still able to reproduce it now? Or was it only happening yesterday, and perhaps we had some transient issue on our side that's since been resolved, and is no longer occurring? — Reply to this email directly, view it on GitHub <#378 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AORTNQFEJLVNNID2G2HRW5TYAGUPXAVCNFSM6AAAAAA6GGAZ7SVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TGMZTGA3TK> . You are receiving this because you were mentioned.Message ID: ***@***.***>

-- www.bigfootage.net

1 reply

jkroll-deepgram Nov 8, 2023
Collaborator

Hi @dvorhes, thanks for sharing this update, and I'm glad to hear you did get this working with re-encoding. It does look like an issue originating from Adobe itself. See how the original file has a bitrate of 836 kb/s, while the re-encoded one has a bitrate of 768 kb/s, indicating that the original bitrate is incorrect.

If you're able to specify your preferred output encoding, we recommend MP3 with a bitrate of 192 kb/s, and constant bitrate (CBR; as opposed to VBR variable bitrate).

Answer selected by jpvajda

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deepgram

wav file "invalid data received" #378

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Deepgram

wav file "invalid data received" #378

dvorhes Oct 18, 2023

Replies: 2 comments · 3 replies

jkroll-deepgram Oct 19, 2023 Collaborator

dvorhes Oct 19, 2023 Author

jkroll-deepgram Oct 19, 2023 Collaborator

dvorhes Nov 6, 2023 Author

jkroll-deepgram Nov 8, 2023 Collaborator

dvorhes
Oct 18, 2023

Replies: 2 comments 3 replies

jkroll-deepgram
Oct 19, 2023
Collaborator

dvorhes Oct 19, 2023
Author

jkroll-deepgram Oct 19, 2023
Collaborator

dvorhes
Nov 6, 2023
Author

jkroll-deepgram Nov 8, 2023
Collaborator