18746002. CLIP SEARCH WITH MULTIMODAL QUERIES (Tesla, Inc.)

From WikiPatents
Revision as of 07:46, 19 December 2024 by Wikipatents (talk | contribs) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

CLIP SEARCH WITH MULTIMODAL QUERIES

Organization Name

Tesla, Inc.

Inventor(s)

Matthew Wilson of Austin TX (US)

Tim Zaman of Austin TX (US)

Long Tran of Austin TX (US)

CLIP SEARCH WITH MULTIMODAL QUERIES

This abstract first appeared for US patent application 18746002 titled 'CLIP SEARCH WITH MULTIMODAL QUERIES



Original Abstract Submitted

Systems and methods are described herein to manage and search data generated during operation of a vehicle such as camera data generated by cameras. In an example, a system can obtain data associated with an input representing a query, extract an embedding representing one or more semantic elements, and compare the embedding to one or more predetermined embeddings. The system can select at least one predetermined embedding based on a degree of similarity between the predetermined embedding and the embedding extracted from the query. In examples, the data associated with an image corresponding to the predetermined embedding that was selected can be provided or included in a training dataset.