18159679. Guided Speech Enhancement Network simplified abstract (GOOGLE LLC)

From WikiPatents
Jump to navigation Jump to search

Guided Speech Enhancement Network

Organization Name

GOOGLE LLC

Inventor(s)

George Chiachi Sung of San Diego CA (US)

Yang Yang of San Diego CA (US)

Shao-Fu Shih of Mountain View CA (US)

Hakan Erdogan of Lexington MA (US)

Jamie Menjay Lin of San Diego CA (US)

Guided Speech Enhancement Network - A simplified explanation of the abstract

This abstract first appeared for US patent application 18159679 titled 'Guided Speech Enhancement Network

The method described in the abstract involves processing audio data captured by an input device and spatially-filtered audio data from a beamformer to enhance the audio signal.

  • The method utilizes a trained guided speech-enhancement network to further attenuate interfering signals in the enhanced audio data.
  • The beamformer spatially filters the reference audio data to reduce interfering signals based on additional audio data from other input devices.
  • The trained guided speech-enhancement network processes both the reference audio data and spatially-filtered audio data to improve the overall audio quality.
  • The innovation aims to enhance speech clarity and reduce background noise in audio signals.
  • This technology can be applied in various audio processing systems, such as conference call systems, voice recognition software, and audio recording devices.

Potential Applications: - Conference call systems - Voice recognition software - Audio recording devices

Problems Solved: - Enhancing speech clarity - Reducing background noise in audio signals

Benefits: - Improved audio quality - Enhanced speech intelligibility - Better performance in noisy environments

Commercial Applications: Title: Advanced Speech Enhancement Technology for Audio Processing Systems Description: This technology can be utilized in conference call systems, voice recognition software, and audio recording devices to improve speech clarity and reduce background noise, enhancing overall audio quality and user experience. The market implications include increased demand for high-quality audio processing solutions in various industries.

Questions about the technology: 1. How does the trained guided speech-enhancement network improve audio quality? 2. What are the potential commercial uses of this advanced speech enhancement technology?


Original Abstract Submitted

A method includes receiving, as input, reference audio data representing a reference audio signal captured by an audio input device. The method also includes receiving, as input, from a beamformer, spatially-filtered audio data representing an output of the beamformer, the beamformer configured to spatially filter, based on additional audio data captured by one or more additional audio input devices, the reference audio data to attenuate one or more interfering signals in the spatially-filtered audio data. The method processes, using a trained guided speech-enhancement network, the reference audio data and the spatially-filtered audio data to generate, as output, enhanced audio data, the guided speech-enhancement network processing the reference audio data and the spatially-filtered audio data to further attenuate, in the enhanced audio data, the one or more interfering signals attenuated by the beamformer.