I’ve mentioned it previously in this thread that I’m poking and playing with the Seeed Studio ReTerminal DM which is a $300-400 piece of industrial kiosk kit. I like the look and feel of it and it makes for a very promising prototype platform for anyone wanting to do visual or interactive stuff with Nerves.
Tom got most of the peripherals working via Linux drivers. So I can query a bunch of driver “files” to get light levels for example. But the mic was not working.
arecord -l did not give the expected device information.
So I’ve had a spelunk all across the Seeed studio overlays, added new firmware versions for RPi. Poked alsa settings. Then found a missing piece of linux config. Added that.
Suffice to say I have the microphone working on the ReTerminal DM.
And also, I’m fairly certain everything except that piece of linux config can be removed. Verifying that now…
I spent a bunch of time hacking away on a test script for the VAD ONNX thing that should give us the ability to know when the microphone(s) detect speech and only then do cool things with it. Should prevent picking up noise and doing unnecessary continuous compute. The current versions of the model are no good for those instructions. I still gotta get this working.
The next step would be running Whisper on the chunks that are speech. And that I already know I can do.
Also the ReTerminal DM has an M.2 slot and I believe Seeed Studio supports it and the ReComputer with the AI kit. They have benchmarks. So if/when we get AI Kit support into Nerves this workload could hopefully be accelerated.