Google llc (20240249741). Guided Speech Enhancement Network simplified abstract

From WikiPatents
Jump to navigation Jump to search

Guided Speech Enhancement Network

Organization Name

google llc

Inventor(s)

George Chiachi Sung of San Diego CA (US)

Yang Yang of San Diego CA (US)

Shao-Fu Shih of Mountain View CA (US)

Hakan Erdogan of Lexington MA (US)

Jamie Menjay Lin of San Diego CA (US)

Guided Speech Enhancement Network - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240249741 titled 'Guided Speech Enhancement Network

The method described in the abstract involves processing audio data using a trained guided speech-enhancement network to enhance the audio signal captured by an audio input device and spatially-filtered by a beamformer.

  • The method receives reference audio data and spatially-filtered audio data from a beamformer.
  • The beamformer attenuates interfering signals in the spatially-filtered audio data based on additional audio data captured by other audio input devices.
  • A trained guided speech-enhancement network processes the reference audio data and spatially-filtered audio data to further attenuate interfering signals in the enhanced audio data.
  • The enhanced audio data output by the method has improved speech quality and reduced interference from background noise.
  • This technology combines beamforming and speech-enhancement techniques to enhance audio signals in noisy environments.

Potential Applications: - Teleconferencing systems - Hearing aid devices - Audio recording equipment - Speech recognition software - Noise-canceling headphones

Problems Solved: - Improving speech quality in noisy environments - Reducing interference from background noise - Enhancing audio signals for better clarity and intelligibility

Benefits: - Enhanced speech intelligibility - Improved audio quality in noisy environments - Better user experience in audio communication devices

Commercial Applications: Title: Advanced Speech-Enhancement Technology for Audio Devices This technology can be used in telecommunication systems, hearing aids, and audio recording equipment to improve speech quality and reduce background noise, enhancing user experience and product performance in noisy environments.

Questions about the technology: 1. How does the trained guided speech-enhancement network improve audio quality? The trained network processes reference and spatially-filtered audio data to further attenuate interfering signals, resulting in enhanced speech clarity. 2. What are the potential applications of this combined beamforming and speech-enhancement technology? This technology can be applied in various audio devices such as teleconferencing systems, hearing aids, and noise-canceling headphones to improve speech quality and reduce background noise.


Original Abstract Submitted

a method includes receiving, as input, reference audio data representing a reference audio signal captured by an audio input device. the method also includes receiving, as input, from a beamformer, spatially-filtered audio data representing an output of the beamformer, the beamformer configured to spatially filter, based on additional audio data captured by one or more additional audio input devices, the reference audio data to attenuate one or more interfering signals in the spatially-filtered audio data. the method processes, using a trained guided speech-enhancement network, the reference audio data and the spatially-filtered audio data to generate, as output, enhanced audio data, the guided speech-enhancement network processing the reference audio data and the spatially-filtered audio data to further attenuate, in the enhanced audio data, the one or more interfering signals attenuated by the beamformer.