20240029757. Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus simplified abstract (HUAWEI TECHNOLOGIES CO., LTD.)

From WikiPatents
Jump to navigation Jump to search

Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus

Organization Name

HUAWEI TECHNOLOGIES CO., LTD.

Inventor(s)

Zhe Wang of Beijing (CN)

Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240029757 titled 'Linear Prediction Residual Energy Tilt-Based Audio Signal Classification Method and Apparatus

Simplified Explanation

The audio signal classification method described in the patent application involves determining the voice activity of a current audio frame. Based on this determination, the method decides whether to obtain and store the frequency spectrum fluctuation of the current audio frame in a memory. The method also updates the stored frequency spectrum fluctuations based on whether the audio frame is percussive music or activity from a historical audio frame. Finally, the method classifies the current audio frame as either a speech frame or a music frame by analyzing the statistics of the frequency spectrum fluctuations stored in the memory.

  • The method determines the voice activity of a current audio frame.
  • It decides whether to store the frequency spectrum fluctuation of the current audio frame based on the voice activity.
  • The method updates the stored frequency spectrum fluctuations based on the type of audio frame.
  • It classifies the current audio frame as speech or music by analyzing the statistics of the stored frequency spectrum fluctuations.

Potential Applications:

  • Speech recognition systems can benefit from accurate classification of speech frames.
  • Music analysis and recognition systems can benefit from accurate classification of music frames.
  • Audio processing applications can use this method to separate speech and music frames for further processing.

Problems Solved:

  • Accurate classification of speech and music frames in audio signals.
  • Efficient storage and updating of frequency spectrum fluctuations for classification purposes.

Benefits:

  • Improved accuracy in speech and music classification.
  • Enhanced performance of speech recognition and music analysis systems.
  • Efficient utilization of memory for storing frequency spectrum fluctuations.


Original Abstract Submitted

an audio signal classification method includes determining, according to voice activity of a current audio frame, whether to obtain a frequency spectrum fluctuation of the current audio frame and store the frequency spectrum fluctuation in a frequency spectrum fluctuation memory, and updating, according to whether the audio frame is percussive music or activity of a historical audio frame, frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory, and classifying the current audio frame as a speech frame or a music frame according to statistics of a part or all of effective data of the frequency spectrum fluctuations stored in the frequency spectrum fluctuation memory.