US Patent Application 18195121. METHOD AND APPARATUS FOR PROCESSING AUDIO FOR SCENE CLASSIFICATION simplified abstract

From WikiPatents
Jump to navigation Jump to search

METHOD AND APPARATUS FOR PROCESSING AUDIO FOR SCENE CLASSIFICATION

Organization Name

Samsung Electronics Co., Ltd.


Inventor(s)

Kyungrae Kim of Suwon-si (KR)

Woohyun Nam of Suwon-si (KR)

METHOD AND APPARATUS FOR PROCESSING AUDIO FOR SCENE CLASSIFICATION - A simplified explanation of the abstract

This abstract first appeared for US patent application 18195121 titled 'METHOD AND APPARATUS FOR PROCESSING AUDIO FOR SCENE CLASSIFICATION

Simplified Explanation

The patent application describes an audio processing method that uses neural networks to classify scenes in audio signals based on their temporal correlation.

  • The method involves obtaining a first audio signal and extracting a feature vector using a neural network.
  • A temporal correlation vector is then calculated to measure the similarity between the first feature vector and feature vectors extracted from previous audio signals.
  • Finally, a second neural network is used to classify the scene of the first audio signal based on the first feature vector, the previous feature vectors, and the temporal correlation vector.


Original Abstract Submitted

An audio processing method includes obtaining a first audio signal corresponding to a first frame; extracting a first feature vector by inputting the first audio signal to a first neural network; obtaining a temporal correlation vector representing a similarity between the first feature vector and at least one second feature vector extracted from at least one second audio signal corresponding to at least one second frame that is temporally before the first frame; and classifying a scene of the first audio signal by inputting the first feature vector, the at least one second feature vector, and the temporal correlation vector to a second neural network.