Google llc (20240221750). KEY PHRASE SPOTTING simplified abstract

From WikiPatents
Jump to navigation Jump to search

KEY PHRASE SPOTTING

Organization Name

google llc

Inventor(s)

Wei Li of Mountain View CA (US)

Rohit Prakash Prabhavalkar of Santa Clara CA (US)

Kanury Kanishka Rao of Santa Clara CA (US)

Yanzhang He of Mountain View CA (US)

Ian C. Mcgraw of Menlo Park CA (US)

Anton Bakhtin of New York NY (US)

KEY PHRASE SPOTTING - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240221750 titled 'KEY PHRASE SPOTTING

The patent application describes methods, systems, and apparatus for detecting utterances of a key phrase in an audio signal using an attention mechanism and neural network layers.

  • Receiving an audio signal encoding one or more utterances.
  • Generating an attention output using an attention mechanism based on encodings from neural network layers.
  • Outputting whether the audio signal likely encodes the key phrase.
  • Providing the output indicating the likelihood of the key phrase in the audio signal.

Potential Applications: - Speech recognition systems - Voice-activated devices - Security systems for detecting specific phrases

Problems Solved: - Efficiently detecting key phrases in audio signals - Improving accuracy of speech recognition systems

Benefits: - Enhanced performance in voice recognition technology - Increased security in audio monitoring systems

Commercial Applications: Title: "Advanced Key Phrase Detection Technology for Voice Recognition Systems" This technology can be used in smart speakers, virtual assistants, and security systems for accurate and efficient detection of key phrases in audio signals.

Questions about Key Phrase Detection Technology: 1. How does the attention mechanism improve the accuracy of detecting key phrases in audio signals? - The attention mechanism focuses on relevant parts of the audio signal, enhancing the system's ability to detect key phrases accurately.

2. What are the potential limitations of using neural network layers in key phrase detection systems? - Neural network layers may require significant computational resources, potentially limiting real-time applications.


Original Abstract Submitted

methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting utterances of a key phrase in an audio signal. one of the methods includes receiving, by a key phrase spotting system, an audio signal encoding one or more utterances; while continuing to receive the audio signal, generating, by the key phrase spotting system, an attention output using an attention mechanism that is configured to compute the attention output based on a series of encodings generated by an encoder comprising one or more neural network layers; generating, by the key phrase spotting system and using attention output, output that indicates whether the audio signal likely encodes the key phrase; and providing, by the key phrase spotting system, the output that indicates whether the audio signal likely encodes the key phrase.