18244762. SELECTIVELY PROVIDING ENHANCED CLARIFICATION PROMPTS IN AUTOMATED ASSISTANT INTERACTIONS simplified abstract (GOOGLE LLC)

From WikiPatents
Jump to navigation Jump to search

SELECTIVELY PROVIDING ENHANCED CLARIFICATION PROMPTS IN AUTOMATED ASSISTANT INTERACTIONS

Organization Name

GOOGLE LLC

Inventor(s)

Matthew Sharifi of Kilchberg (CH)

Victor Carbune of Zurich (CH)

SELECTIVELY PROVIDING ENHANCED CLARIFICATION PROMPTS IN AUTOMATED ASSISTANT INTERACTIONS - A simplified explanation of the abstract

This abstract first appeared for US patent application 18244762 titled 'SELECTIVELY PROVIDING ENHANCED CLARIFICATION PROMPTS IN AUTOMATED ASSISTANT INTERACTIONS

Simplified Explanation

Implementations described in this patent application aim to improve the accuracy of voice recognition systems by addressing ambiguous spoken utterances. When an utterance can be interpreted as requesting multiple actions, an enhanced clarification prompt is provided to the user to disambiguate between the possible actions. This prompt goes beyond natural language and includes additional user interface elements to gather input.

  • Implementations receive audio data capturing a spoken utterance.
  • Based on the audio data, a recognition corresponding to the utterance is generated.
  • If the recognition is ambiguous, meaning it can be interpreted as requesting different actions exclusively, an enhanced clarification prompt is provided.
  • The enhanced clarification prompt includes elements beyond natural language to solicit further user input for disambiguation.
  • This approach replaces the traditional natural language-only clarification prompt.

Potential applications of this technology:

  • Voice assistants: Improving the accuracy and user experience of voice-controlled virtual assistants like Siri, Alexa, or Google Assistant.
  • Automotive systems: Enhancing voice recognition systems in cars for safer and more efficient hands-free operation.
  • Smart home devices: Improving voice-controlled devices like smart speakers or thermostats to better understand user commands.

Problems solved by this technology:

  • Ambiguity in spoken utterances: Addressing situations where a voice command can have multiple exclusive interpretations, leading to incorrect actions or confusion.
  • User frustration: Reducing frustration caused by voice assistants misunderstanding or misinterpreting commands.
  • Efficiency: Streamlining the user interaction process by quickly resolving ambiguous requests.

Benefits of this technology:

  • Improved accuracy: By providing an enhanced clarification prompt, the system can gather additional input to disambiguate user requests accurately.
  • Enhanced user experience: The inclusion of non-verbal elements in the prompt makes it easier for users to clarify their intended actions.
  • Time-saving: By quickly resolving ambiguous requests, the system can provide the desired response or action without unnecessary delays.


Original Abstract Submitted

Implementations described herein receive audio data that captures a spoken utterance, generate, based on processing the audio data, a recognition that corresponds to the spoken utterance, and determine, based on processing the recognition, that the spoken utterance is ambiguous (i.e., is interpretable as requesting performance of a first particular action exclusively and is also interpretable a second particular action exclusively). In response to determining that the spoken utterance is ambiguous, implementations determine to provide an enhanced clarification prompt that renders output that is in addition to natural language. The enhanced clarification prompt solicits further user interface input for disambiguating between the first particular action and the second particular action. Determining to provide the enhanced clarification prompt includes a current or prior determination to provide the enhanced clarification prompt instead of a natural language (NL) only clarification prompt that is restricted to rendering natural language.