18582989. TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT simplified abstract (TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED)
Contents
- 1 TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT
- 1.1 Organization Name
- 1.2 Inventor(s)
- 1.3 TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT - A simplified explanation of the abstract
- 1.4 Simplified Explanation
- 1.5 Key Features and Innovation
- 1.6 Potential Applications
- 1.7 Problems Solved
- 1.8 Benefits
- 1.9 Commercial Applications
- 1.10 Prior Art
- 1.11 Frequently Updated Research
- 1.12 Questions about Speech Enhancement Technology
- 1.13 Original Abstract Submitted
TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT
Organization Name
TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
Inventor(s)
TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT - A simplified explanation of the abstract
This abstract first appeared for US patent application 18582989 titled 'TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT
Simplified Explanation
The patent application describes a training method and enhancement method for a speech enhancement model, along with related apparatus and programs. The method involves correcting amplitude values of a clean speech signal based on phase differences to improve speech quality.
Key Features and Innovation
- Training method for a speech enhancement model
- Correction of amplitude values based on phase differences
- Update of model parameters based on loss value
Potential Applications
This technology can be used in:
- Speech recognition systems
- Telecommunication devices
- Audio processing software
Problems Solved
- Improving speech quality in noisy environments
- Enhancing speech intelligibility
- Optimizing speech enhancement models
Benefits
- Clearer and more intelligible speech output
- Enhanced performance of speech recognition systems
- Improved user experience in noisy environments
Commercial Applications
- This technology can be applied in industries such as telecommunications, audio processing, and speech recognition software, enhancing the quality and performance of products in these sectors.
Prior Art
Readers can explore prior research on speech enhancement models, phase correction techniques, and noise reduction algorithms in the field of audio signal processing.
Frequently Updated Research
Stay updated on advancements in speech enhancement models, phase correction methods, and noise reduction algorithms to further improve the performance of speech processing systems.
Questions about Speech Enhancement Technology
How does this technology improve speech quality in noisy environments?
This technology corrects amplitude values of a clean speech signal based on phase differences, resulting in clearer and more intelligible speech output in noisy environments.
What are the potential applications of this speech enhancement model?
This technology can be applied in speech recognition systems, telecommunication devices, and audio processing software to enhance speech quality and intelligibility.
Original Abstract Submitted
A training method and an enhancement method for a speech enhancement model, an apparatus, an electronic device, a storage medium, and a program product are described. A training method for the model may include: invoking the model based on a noisy speech feature of a speech signal, to obtain first predicted mask values in an auditory domain. A first amplitude and first phase of the noisy speech signal, and a second amplitude and second phase of a clean speech signal may then be obtained. A phase difference at each frequency point may be determined based on the phases. The second amplitude corresponding to each frequency point may be corrected based on the phase difference at each frequency point, to obtain a corrected second amplitude corresponding to each frequency point. A loss value is determined, and parameters of the speech enhancement model may be updated based on the loss value.