Multichannel Audio Speech Classification

Organization Name

microsoft technology licensing, llc

Inventor(s)

Oron Nir of Herzeliya (IL)

Inbal Sagiv of Kfar-Saba (IL)

Maayan Yedidia of Ramat Gan (IL)

Fardau Van Neerden of Driel (NL)

Itai Norman of Tel Aviv (IL)

Multichannel Audio Speech Classification - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240312477 titled 'Multichannel Audio Speech Classification

- Simplified Explanation:**

The patent application describes systems and methods for classifying multichannel audio speech. It involves receiving an audio signal with multiple channels, calculating average power values for each channel, and comparing correlation values to determine if the signal is speech-based communication.

- Key Features and Innovation:**
Receiving an audio signal with multiple channels
Transcoding each channel to a predefined audio format
Calculating average power values for each channel
Determining correlation values between channels
Classifying the audio signal as speech-based communication based on correlation values

- Potential Applications:**

This technology can be used in:

Speech recognition systems
Audio surveillance systems
Call center quality assurance
Voice-controlled devices
Audio content analysis

- Problems Solved:**
Efficient classification of multichannel audio speech
Improved accuracy in identifying speech-based communication
Enhanced performance of audio processing devices

- Benefits:**
Enhanced speech classification accuracy
Increased efficiency in audio signal processing
Improved performance of speech recognition systems

- Commercial Applications:**
"Multichannel Audio Speech Classification Technology for Enhanced Speech Recognition and Audio Surveillance"

- Prior Art:**

Prior art related to this technology may include research on audio signal processing, speech recognition systems, and audio content analysis.

- Frequently Updated Research:**

Stay updated on advancements in audio signal processing, speech recognition technology, and machine learning algorithms for audio classification.

- Questions about Multichannel Audio Speech Classification:**

1. How does this technology improve the accuracy of speech classification in audio signals? 2. What are the potential commercial applications of multichannel audio speech classification technology?

Original Abstract Submitted

examples of the present disclosure describe systems and methods for multichannel audio speech classification. in examples, an audio signal comprising multiple audio channels is received at a processing device. each of the audio channels in the audio signal is transcoded to a predefined audio format. for each of the transcoded audio channels, an average power value is calculated for one or more data windows in the audio signal. a correlation value is calculated between the average power value for each audio channel and the combined average power value of the other audio channels in the audio signal. each of the correlation values (or an aggregated correlation value for the audio channels) is then compared against a threshold value to determine whether the audio signal is to be classified as a speech-based communication. based on the classification, an action associated with the audio signal may be performed.

Microsoft technology licensing, llc (20240312477). Multichannel Audio Speech Classification simplified abstract

Contents

Multichannel Audio Speech Classification

Organization Name

Inventor(s)

Multichannel Audio Speech Classification - A simplified explanation of the abstract

Original Abstract Submitted

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools