Synaptics Incorporated (20240304204). LOW-LATENCY SPEECH ENHANCEMENT simplified abstract
Contents
LOW-LATENCY SPEECH ENHANCEMENT
Organization Name
Inventor(s)
Saeed Mosayyebpour Kaskari of Irvine CA (US)
LOW-LATENCY SPEECH ENHANCEMENT - A simplified explanation of the abstract
This abstract first appeared for US patent application 20240304204 titled 'LOW-LATENCY SPEECH ENHANCEMENT
Simplified Explanation: This patent application discusses methods, devices, and systems for audio signal processing, specifically focusing on low-latency speech enhancement. The system processes time-domain samples of a signal to enhance speech quality.
- The system receives frames of a signal, transforms them into frequency-domain samples using Fast Fourier Transform (FFT), and determines the probability of speech in the signal based on these samples.
- It further decimates the frequency-domain samples to improve efficiency and accuracy in speech enhancement.
Key Features and Innovation:
- Utilizes Fast Fourier Transform (FFT) for transforming time-domain samples into frequency-domain samples.
- Determines the probability of speech in the signal based on the frequency-domain samples.
- Decimates the frequency-domain samples to enhance efficiency in speech enhancement.
Potential Applications: This technology can be applied in:
- Speech recognition systems
- Telecommunication devices
- Audio recording and editing software
Problems Solved:
- Low-latency speech enhancement
- Efficient processing of audio signals
- Improved speech quality in noisy environments
Benefits:
- Enhanced speech quality
- Reduced latency in speech processing
- Improved accuracy in speech recognition
Commercial Applications: Potential commercial uses include:
- Speech recognition software for customer service centers
- Audio processing tools for content creators
- Communication devices for noisy environments
Prior Art: Prior research in speech enhancement and audio signal processing can be found in academic journals and patents related to FFT and speech recognition technologies.
Frequently Updated Research: Stay updated on advancements in FFT algorithms for audio signal processing and speech enhancement techniques for real-time applications.
Questions about Speech Enhancement: 1. How does decimation of frequency-domain samples improve speech enhancement? 2. What are the key differences between low-latency speech enhancement and traditional speech processing methods?
Original Abstract Submitted
this disclosure provides methods, devices, and systems for audio signal processing. the present implementations more specifically relate to low-latency speech enhancement. in some aspects, a speech enhancement system may receive a number (b) of frames of a signal, where each of the b frames include a number (n) of time-domain samples. the speech enhancement system may transform the b*n time-domain samples into b*n first frequency-domain samples based on an n-point fast fourier transform (fft), and may further transform the b*n first frequency-domain samples into b*n second frequency-domain samples based on a b-point fft. the speech enhancement system may determine a probability of speech in the signal based at least in part on the b*n second frequency-domain samples. in some implementations, the speech enhancement system may decimate the b*n second frequency-domain samples by a factor (d), and the probability of speech is determined based on the b*n/d decimated second frequency-domain samples.