Silent speech interface

(see also my survey article on this subject)

Silent speech interfaces attempt to discern speech without any (or with very little) audible utterance. They’re a form of Voice-based interface.

I’m excited about these systems as a possible poor man’s Brain-computer interface. In the ideal case, they’d allow pervasive, unobtrusive command and textual input, and even a slow form of synthetic telepathy. More practically, they’re perhaps a way to resolve Reading texts on computers is unpleasant by bringing elements of the Dynamic medium to physical books.

A huge variety of sensing modalities are possible here, all with different trade-offs, broadly focusing on:

  • cortical and nervous system signals (EEG, iEEG)
  • motor neuron signals (surface EMG)
  • motion of lips, face, jaw, vocal tract (ultrasound, video, radar, Doppler, strain gauges, magnetic implants, etc)
  • acoustics (special microphones for whispers / murmurs)

Many of these approaches are invasive or highly obtrusive with current and foreseen sensors, and I’m interested in consumer scenarios, so I’ll focus on non-invasive and relatively non-obtrusive approaches.

The best routes I’ve seen seem to be (as of 2022 / 2024):

References

Apart from the specific systems above, this book offers a helpful survey:
Freitas, J., Teixeira, A., Dias, M. S., & Silva, S. (2017). An Introduction to Silent Speech Interfaces. Springer International Publishing

Last updated 2024-09-11.