20250217405. Multimedia Data Search Using Multi- (Roku, .)
MULTIMEDIA DATA SEARCH USING MULTI-MODAL FEATURE EMBEDDINGS
Abstract: aspects of the disclosed technology provide solutions for searching objects within multimedia content based on multi-modal embeddings. an example method can include receiving media content including a plurality of video frames. the method can include steps for generating, using a pre-output layer of a machine learning algorithm, one or more multimodal feature embeddings describing at least one object for the plurality of video frames, receiving a query including a request to search the media content for a matching object, determining whether the media content includes the matching object based on the one or more multimodal feature embeddings describing the at least one object, and returning one or more results in response to determining that the media content includes the matching object. systems and machine-readable media are also provided.
Inventor(s): Gregory Garner, Sunil Ramesh
CPC Classification: G06F16/432 (Query formulation)
Search for rejections for patent application number 20250217405