US Patent Application 17804603. Keyword Detection for Audio Content simplified abstract

From WikiPatents
Jump to navigation Jump to search

Keyword Detection for Audio Content

Organization Name

Microsoft Technology Licensing, LLC


Inventor(s)

Zvi Figov of Modiin (IL)


Keyword Detection for Audio Content - A simplified explanation of the abstract

  • This abstract for appeared for US patent application number 17804603 Titled 'Keyword Detection for Audio Content'

Simplified Explanation

The present disclosure describes improved systems and methods for detecting keywords in audio content. The audio content is divided into smaller segments, and corresponding text segments are generated for each audio segment. Textual analysis is performed to generate phrase candidate values, and sentence embedding analysis is used to generate sentence embedding values. An average sentence embedding value is calculated, and each phrase candidate value is compared to this average value. If a phrase candidate value exceeds a certain threshold, it is labeled as a keyword.


Original Abstract Submitted

Examples of the present disclosure describe improved systems and methods for detecting keywords in audio content. In one example implementation, audio content is segmented into one or more audio segments. One or more text segments is generated, each text segment corresponding to each of the audio segments. For each text segment, one or more phrase candidate values is generated using a textual analysis, and one or more sentence embedding values is generated using a sentence embedding analysis. Next, an average sentence embedding value is calculated using the one or more sentence embedding values. Each of the one or more phrase candidate values is compared to the average sentence embedding value. Each phrase candidate value having a comparison value above a threshold value is labeled as representing a keyword.