Troubleshooting: Bumblebee audio transcription provides same text for few parsed webm files

The issue with Xav.next_frame
Probably it takes only first frame
Passing entire file like it was done in cool-whisper-server works

iex(10)> Nx.Serving.batched_run(Recognizer.AudioServing, {:file, "/home/maryna/Music/audio-17405164795554743983100781892.webm"})
%{
  chunks: [
    %{
      text: " Finally in temporary digital storage.",
      start_timestamp_seconds: nil,
      end_timestamp_seconds: nil
    }
  ]
}
3 Likes