Jump to content

Google llc (20250006184). MULTIMODAL INTENT UNDERSTANDING FOR AUTOMATED ASSISTANT

From WikiPatents
Revision as of 03:46, 25 March 2025 by Unknown user (talk) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

MULTIMODAL INTENT UNDERSTANDING FOR AUTOMATED ASSISTANT

Organization Name

google llc

Inventor(s)

Victor Carbune of Zurich CH

Matthew Sharifi of Kilchberg CH

MULTIMODAL INTENT UNDERSTANDING FOR AUTOMATED ASSISTANT

This abstract first appeared for US patent application 20250006184 titled 'MULTIMODAL INTENT UNDERSTANDING FOR AUTOMATED ASSISTANT

Original Abstract Submitted

implementations described herein include detecting a stream of audio data that captures a spoken utterance of the user and that captures ambient noise occurring within a threshold time period of the spoken utterance being spoken by the user. implementations further include processing a portion of the audio data that includes the ambient noise to determine ambient noise classification(s), processing a portion of the audio data that includes the spoken utterance to generate a transcription, processing both the transcription and the ambient noise classification(s) with a machine learning model to generate a user intent and parameter(s) for the user intent, and performing one or more automated assistant actions based on the user intent and using the parameter(s).

(Ad) Transform your business with AI in minutes, not months

Custom AI strategy tailored to your specific industry needs
Step-by-step implementation with measurable ROI
5-minute setup that requires zero technical skills
Get your AI playbook

Trusted by 1,000+ companies worldwide

Cookies help us deliver our services. By using our services, you agree to our use of cookies.