Synaptics Incorporated (20240304204). LOW-LATENCY SPEECH ENHANCEMENT simplified abstract

From WikiPatents
Jump to navigation Jump to search

LOW-LATENCY SPEECH ENHANCEMENT

Organization Name

Synaptics Incorporated

Inventor(s)

Saeed Mosayyebpour Kaskari of Irvine CA (US)

LOW-LATENCY SPEECH ENHANCEMENT - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240304204 titled 'LOW-LATENCY SPEECH ENHANCEMENT

Simplified Explanation: This patent application discusses methods, devices, and systems for audio signal processing, specifically focusing on low-latency speech enhancement. The system processes time-domain samples of a signal to enhance speech quality.

  • The system receives frames of a signal, transforms them into frequency-domain samples using Fast Fourier Transform (FFT), and determines the probability of speech in the signal based on these samples.
  • It further decimates the frequency-domain samples to improve efficiency and accuracy in speech enhancement.

Key Features and Innovation:

  • Utilizes Fast Fourier Transform (FFT) for transforming time-domain samples into frequency-domain samples.
  • Determines the probability of speech in the signal based on the frequency-domain samples.
  • Decimates the frequency-domain samples to enhance efficiency in speech enhancement.

Potential Applications: This technology can be applied in:

  • Speech recognition systems
  • Telecommunication devices
  • Audio recording and editing software

Problems Solved:

  • Low-latency speech enhancement
  • Efficient processing of audio signals
  • Improved speech quality in noisy environments

Benefits:

  • Enhanced speech quality
  • Reduced latency in speech processing
  • Improved accuracy in speech recognition

Commercial Applications: Potential commercial uses include:

  • Speech recognition software for customer service centers
  • Audio processing tools for content creators
  • Communication devices for noisy environments

Prior Art: Prior research in speech enhancement and audio signal processing can be found in academic journals and patents related to FFT and speech recognition technologies.

Frequently Updated Research: Stay updated on advancements in FFT algorithms for audio signal processing and speech enhancement techniques for real-time applications.

Questions about Speech Enhancement: 1. How does decimation of frequency-domain samples improve speech enhancement? 2. What are the key differences between low-latency speech enhancement and traditional speech processing methods?


Original Abstract Submitted

this disclosure provides methods, devices, and systems for audio signal processing. the present implementations more specifically relate to low-latency speech enhancement. in some aspects, a speech enhancement system may receive a number (b) of frames of a signal, where each of the b frames include a number (n) of time-domain samples. the speech enhancement system may transform the b*n time-domain samples into b*n first frequency-domain samples based on an n-point fast fourier transform (fft), and may further transform the b*n first frequency-domain samples into b*n second frequency-domain samples based on a b-point fft. the speech enhancement system may determine a probability of speech in the signal based at least in part on the b*n second frequency-domain samples. in some implementations, the speech enhancement system may decimate the b*n second frequency-domain samples by a factor (d), and the probability of speech is determined based on the b*n/d decimated second frequency-domain samples.