Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus

Organization Name

Inventor(s)

Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240029757 titled 'Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus

Simplified Explanation

The audio signal classification method described in the patent application involves determining the voice activity of a current audio frame. Based on this determination, the method decides whether to obtain and store the frequency spectrum fluctuation of the current audio frame in a memory. The method also updates the stored frequency spectrum fluctuations based on whether the audio frame is percussive music or activity from a historical audio frame. Finally, the method classifies the current audio frame as either a speech frame or a music frame by analyzing the statistics of the frequency spectrum fluctuations stored in the memory.

The method determines the voice activity of a current audio frame.
It decides whether to store the frequency spectrum fluctuation of the current audio frame based on the voice activity.
The method updates the stored frequency spectrum fluctuations based on the type of audio frame.
It classifies the current audio frame as speech or music by analyzing the statistics of the stored frequency spectrum fluctuations.

Potential Applications:

Speech recognition systems can benefit from accurate classification of speech frames.
Music analysis and recognition systems can benefit from accurate classification of music frames.
Audio processing applications can use this method to separate speech and music frames for further processing.

Problems Solved:

Accurate classification of speech and music frames in audio signals.
Efficient storage and updating of frequency spectrum fluctuations for classification purposes.

Benefits:

Improved accuracy in speech and music classification.
Enhanced performance of speech recognition and music analysis systems.
Efficient utilization of memory for storing frequency spectrum fluctuations.

Original Abstract Submitted

an audio signal classification method includes determining, according to voice activity of a current audio frame, whether to obtain a frequency spectrum fluctuation of the current audio frame and store the frequency spectrum fluctuation in a frequency spectrum fluctuation memory, and updating, according to whether the audio frame is percussive music or activity of a historical audio frame, frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory, and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory.

20240029757. Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus simplified abstract (HUAWEI TECHNOLOGIES CO., LTD.)

Contents

Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus

Organization Name

Inventor(s)