LOW-LATENCY SPEECH ENHANCEMENT

Organization Name

Inventor(s)

Saeed Mosayyebpour Kaskari of Irvine CA (US)

LOW-LATENCY SPEECH ENHANCEMENT - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240304204 titled 'LOW-LATENCY SPEECH ENHANCEMENT

Simplified Explanation: This patent application discusses methods, devices, and systems for audio signal processing, specifically focusing on low-latency speech enhancement. The system processes time-domain samples of a signal to enhance speech quality.

The system receives frames of a signal, transforms them into frequency-domain samples using Fast Fourier Transform (FFT), and determines the probability of speech in the signal based on these samples.
It further decimates the frequency-domain samples to improve efficiency and accuracy in speech enhancement.

Key Features and Innovation:

Utilizes Fast Fourier Transform (FFT) for transforming time-domain samples into frequency-domain samples.
Determines the probability of speech in the signal based on the frequency-domain samples.
Decimates the frequency-domain samples to enhance efficiency in speech enhancement.

Potential Applications: This technology can be applied in:

Speech recognition systems
Telecommunication devices
Audio recording and editing software

Problems Solved:

Low-latency speech enhancement
Efficient processing of audio signals
Improved speech quality in noisy environments

Benefits:

Enhanced speech quality
Reduced latency in speech processing
Improved accuracy in speech recognition

Commercial Applications: Potential commercial uses include:

Speech recognition software for customer service centers
Audio processing tools for content creators
Communication devices for noisy environments

Prior Art: Prior research in speech enhancement and audio signal processing can be found in academic journals and patents related to FFT and speech recognition technologies.

Frequently Updated Research: Stay updated on advancements in FFT algorithms for audio signal processing and speech enhancement techniques for real-time applications.

Questions about Speech Enhancement: 1. How does decimation of frequency-domain samples improve speech enhancement? 2. What are the key differences between low-latency speech enhancement and traditional speech processing methods?

Original Abstract Submitted

this disclosure provides methods, devices, and systems for audio signal processing. the present implementations more specifically relate to low-latency speech enhancement. in some aspects, a speech enhancement system may receive a number (b) of frames of a signal, where each of the b frames include a number (n) of time-domain samples. the speech enhancement system may transform the b*n time-domain samples into b*n first frequency-domain samples based on an n-point fast fourier transform (fft), and may further transform the b*n first frequency-domain samples into b*n second frequency-domain samples based on a b-point fft. the speech enhancement system may determine a probability of speech in the signal based at least in part on the b*n second frequency-domain samples. in some implementations, the speech enhancement system may decimate the b*n second frequency-domain samples by a factor (d), and the probability of speech is determined based on the b*n/d decimated second frequency-domain samples.

Synaptics Incorporated (20240304204). LOW-LATENCY SPEECH ENHANCEMENT simplified abstract

Contents

LOW-LATENCY SPEECH ENHANCEMENT

Organization Name

Inventor(s)

LOW-LATENCY SPEECH ENHANCEMENT - A simplified explanation of the abstract

Original Abstract Submitted

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools