Jump to content

Google llc (20250010482). Multimodal Object Identification

From WikiPatents
Revision as of 10:00, 25 March 2025 by Unknown user (talk) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Multimodal Object Identification

Organization Name

google llc

Inventor(s)

Michael Joseph Quinlan of Sunnyvale CA US

Gabriel A. Cohen of Alameda CA US

Multimodal Object Identification

This abstract first appeared for US patent application 20250010482 titled 'Multimodal Object Identification

Original Abstract Submitted

methods, systems, and apparatus for receiving a command for controlling a robot, the command referencing an object, receiving sensor data for a portion of an environment of the robot, identifying, from the sensor data, a gesture of a human that indicates a spatial region located outside of the portion of the environment described by the sensor data, searching map data for the object, determining, based at least on searching the map data for the object referenced in the command, that the object referenced in the command is present in the spatial region, and in response to determining that the object referenced in the command is present in the spatial region, controlling the robot to perform an action with respect to the object referenced in the command.

(Ad) Transform your business with AI in minutes, not months

Custom AI strategy for your specific industry
Step-by-step implementation with clear ROI
5-minute setup - no technical skills needed
Get your AI playbook
Cookies help us deliver our services. By using our services, you agree to our use of cookies.