TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT

Organization Name

TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor(s)

TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT - A simplified explanation of the abstract

This abstract first appeared for US patent application 18582989 titled 'TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT

Simplified Explanation

The patent application describes a training method and enhancement method for a speech enhancement model, along with related apparatus and programs. The method involves correcting amplitude values of a clean speech signal based on phase differences to improve speech quality.

Key Features and Innovation

Training method for a speech enhancement model
Correction of amplitude values based on phase differences
Update of model parameters based on loss value

Potential Applications

This technology can be used in:

Speech recognition systems
Telecommunication devices
Audio processing software

Problems Solved

Improving speech quality in noisy environments
Enhancing speech intelligibility
Optimizing speech enhancement models

Benefits

Clearer and more intelligible speech output
Enhanced performance of speech recognition systems
Improved user experience in noisy environments

Commercial Applications

This technology can be applied in industries such as telecommunications, audio processing, and speech recognition software, enhancing the quality and performance of products in these sectors.

Prior Art

Readers can explore prior research on speech enhancement models, phase correction techniques, and noise reduction algorithms in the field of audio signal processing.

Frequently Updated Research

Stay updated on advancements in speech enhancement models, phase correction methods, and noise reduction algorithms to further improve the performance of speech processing systems.

Questions about Speech Enhancement Technology

How does this technology improve speech quality in noisy environments?

This technology corrects amplitude values of a clean speech signal based on phase differences, resulting in clearer and more intelligible speech output in noisy environments.

What are the potential applications of this speech enhancement model?

This technology can be applied in speech recognition systems, telecommunication devices, and audio processing software to enhance speech quality and intelligibility.

Original Abstract Submitted

A training method and an enhancement method for a speech enhancement model, an apparatus, an electronic device, a storage medium, and a program product are described. A training method for the model may include: invoking the model based on a noisy speech feature of a speech signal, to obtain first predicted mask values in an auditory domain. A first amplitude and first phase of the noisy speech signal, and a second amplitude and second phase of a clean speech signal may then be obtained. A phase difference at each frequency point may be determined based on the phases. The second amplitude corresponding to each frequency point may be corrected based on the phase difference at each frequency point, to obtain a corrected second amplitude corresponding to each frequency point. A loss value is determined, and parameters of the speech enhancement model may be updated based on the loss value.

18582989. TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT simplified abstract (TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED)

Contents

TRAINING METHOD AND ENHANCEMENT METHOD FOR SPEECH ENHANCEMENT MODEL, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT

Organization Name

Inventor(s)