GOOGLE LLC (20240347060). CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S) simplified abstract

From WikiPatents
Jump to navigation Jump to search

CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S)

Organization Name

GOOGLE LLC

Inventor(s)

Victor Carbune of Zurich (CH)

Matthew Sharifi of Kilchberg (CH)

Ondrej Skopek of Zurich (CH)

Justin Lu of Zurich (CH)

Daniel Valcarce of Zurich (CH)

Kevin Kilgour of Oetwil an der Limmat (CH)

Mohamad Hassan Rom of Zurich (CH)

Nicolo D'ercole of Oberrieden (CH)

Michael Golikov of Merlischachen (CH)

CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S) - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240347060 titled 'CONTEXTUAL SUPPRESSION OF ASSISTANT COMMAND(S)

The patent application describes a process that uses warm word models to analyze a stream of audio data to identify specific words or phrases associated with an assistant command. This process also involves using an automatic speech recognition model to generate output based on the audio data, and then determining if the user intended for the assistant command to be executed. Additionally, the implementation may include speaker identification models to verify the user's identity and authorization to issue the command.

  • Utilizes warm word models to identify specific words or phrases in audio data related to assistant commands
  • Employs automatic speech recognition models to process audio data and determine user intent
  • Incorporates speaker identification models to verify user identity and authorization for command execution
  • Streamlines the process of recognizing and executing assistant commands through audio data analysis
  • Enhances user experience by accurately interpreting spoken commands and ensuring security measures are in place

Potential Applications: - Voice-controlled assistants - Smart home devices - Hands-free operation of electronic devices - Voice-activated software applications

Problems Solved: - Improving accuracy in recognizing assistant commands - Enhancing security by verifying user identity and authorization - Streamlining the process of executing commands based on audio input

Benefits: - Increased efficiency in executing assistant commands - Enhanced user experience through accurate voice recognition - Improved security measures for voice-activated systems

Commercial Applications: Title: "Enhanced Voice Recognition Technology for Assistant Commands" This technology can be applied in various industries such as: - Consumer electronics - Home automation - Automotive industry - Healthcare for hands-free operation of devices

Questions about Voice Recognition Technology: 1. How does this technology improve the accuracy of recognizing assistant commands? 2. What are the potential security implications of using speaker identification models in this process?

Frequently Updated Research: Stay updated on advancements in automatic speech recognition technology and speaker identification models to enhance the performance of this voice recognition system.


Original Abstract Submitted

some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (asr) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate asr output, and determine, based on processing the asr output, whether a user intended the assistant command to be performed. additional or alternative implementations can process the stream of audio data using a speaker identification (sid) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.