Microsoft technology licensing, llc (20240312477). Multichannel Audio Speech Classification simplified abstract

From WikiPatents
Jump to navigation Jump to search

Multichannel Audio Speech Classification

Organization Name

microsoft technology licensing, llc

Inventor(s)

Oron Nir of Herzeliya (IL)

Inbal Sagiv of Kfar-Saba (IL)

Maayan Yedidia of Ramat Gan (IL)

Fardau Van Neerden of Driel (NL)

Itai Norman of Tel Aviv (IL)

Multichannel Audio Speech Classification - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240312477 titled 'Multichannel Audio Speech Classification

    • Simplified Explanation:**

The patent application describes systems and methods for classifying multichannel audio speech. It involves receiving an audio signal with multiple channels, calculating average power values for each channel, and comparing correlation values to determine if the signal is speech-based communication.

    • Key Features and Innovation:**
  • Receiving an audio signal with multiple channels
  • Transcoding each channel to a predefined audio format
  • Calculating average power values for each channel
  • Determining correlation values between channels
  • Classifying the audio signal as speech-based communication based on correlation values
    • Potential Applications:**

This technology can be used in:

  • Speech recognition systems
  • Audio surveillance systems
  • Call center quality assurance
  • Voice-controlled devices
  • Audio content analysis
    • Problems Solved:**
  • Efficient classification of multichannel audio speech
  • Improved accuracy in identifying speech-based communication
  • Enhanced performance of audio processing devices
    • Benefits:**
  • Enhanced speech classification accuracy
  • Increased efficiency in audio signal processing
  • Improved performance of speech recognition systems
    • Commercial Applications:**
  • "Multichannel Audio Speech Classification Technology for Enhanced Speech Recognition and Audio Surveillance"
    • Prior Art:**

Prior art related to this technology may include research on audio signal processing, speech recognition systems, and audio content analysis.

    • Frequently Updated Research:**

Stay updated on advancements in audio signal processing, speech recognition technology, and machine learning algorithms for audio classification.

    • Questions about Multichannel Audio Speech Classification:**

1. How does this technology improve the accuracy of speech classification in audio signals? 2. What are the potential commercial applications of multichannel audio speech classification technology?


Original Abstract Submitted

examples of the present disclosure describe systems and methods for multichannel audio speech classification. in examples, an audio signal comprising multiple audio channels is received at a processing device. each of the audio channels in the audio signal is transcoded to a predefined audio format. for each of the transcoded audio channels, an average power value is calculated for one or more data windows in the audio signal. a correlation value is calculated between the average power value for each audio channel and the combined average power value of the other audio channels in the audio signal. each of the correlation values (or an aggregated correlation value for the audio channels) is then compared against a threshold value to determine whether the audio signal is to be classified as a speech-based communication. based on the classification, an action associated with the audio signal may be performed.