Synaptics Incorporated (20240257827). NEURAL TEMPORAL BEAMFORMER FOR NOISE REDUCTION IN SINGLE-CHANNEL AUDIO SIGNALS simplified abstract

From WikiPatents
Jump to navigation Jump to search

NEURAL TEMPORAL BEAMFORMER FOR NOISE REDUCTION IN SINGLE-CHANNEL AUDIO SIGNALS

Organization Name

Synaptics Incorporated

Inventor(s)

Saeed Mosayyebpour Kaskari of Irvine CA (US)

NEURAL TEMPORAL BEAMFORMER FOR NOISE REDUCTION IN SINGLE-CHANNEL AUDIO SIGNALS - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240257827 titled 'NEURAL TEMPORAL BEAMFORMER FOR NOISE REDUCTION IN SINGLE-CHANNEL AUDIO SIGNALS

Simplified Explanation

This patent application describes a system for speech enhancement using multi-frame beamforming with neural network supervision. It involves a linear filter, a deep neural network (DNN), a voice activity detector (VAD), and an IFC calculator to suppress noise in audio signals.

  • The system uses a DNN to determine the probability of speech in a current audio frame.
  • A VAD then decides if speech is present based on this probability.
  • An IFC calculator estimates an IFC vector using the DNN output and VAD indication.
  • The linear filter uses the IFC vector to reduce noise in the audio frame.

Key Features and Innovation

  • Multi-frame beamforming with neural network supervision for speech enhancement.
  • Integration of a DNN, VAD, and IFC calculator for noise suppression.
  • Real-time processing of audio signals to improve speech quality.

Potential Applications

  • Telecommunications for clearer audio in phone calls.
  • Audio recording devices for better sound quality.
  • Speech recognition systems for improved accuracy.

Problems Solved

  • Noise interference in audio signals.
  • Inaccurate speech detection.
  • Poor audio quality in communication systems.

Benefits

  • Enhanced speech clarity.
  • Improved noise reduction.
  • Better overall audio quality.

Commercial Applications

Title: Advanced Speech Enhancement System for Telecommunications This technology can be used in:

  • Smartphone applications for clearer calls.
  • Conference call systems for better audio quality.
  • Voice-controlled devices for improved speech recognition.

Prior Art

Research on multi-frame beamforming and neural network supervision in speech enhancement systems can be found in academic journals and patents related to audio signal processing.

Frequently Updated Research

Ongoing studies focus on optimizing neural network models for more accurate speech detection and noise suppression in real-time audio processing.

Questions about Speech Enhancement Systems

How does multi-frame beamforming improve speech quality?

Multi-frame beamforming combines information from multiple audio frames to enhance speech signals, reducing noise and improving clarity.

What are the key components of a speech enhancement system?

A speech enhancement system typically includes a DNN, VAD, IFC calculator, and linear filter to process audio signals for noise suppression and speech enhancement.


Original Abstract Submitted

this disclosure provides methods, devices, and systems for audio signal processing. the present implementations more specifically relate to multi-frame beamforming using neural network supervision. in some aspects, a speech enhancement system may include a linear filter, a deep neural network (dnn), a voice activity detector (vad), and an ifc calculator. the dnn infers a probability of speech (p) in a current frame of a single-channel audio signal based on a neural network model. the vad determines whether speech is present or absent in the current audio frame based on the probability of speech p. the ifc calculator may estimate an ifc vector based on the output of the dnn (such as the probability of speech p) and the output of the vad (such as an indication of whether speech is present in the current frame). the linear filter uses the ifc vector to suppress noise in the current audio frame.