Jump to content

Google llc (20240428587). Systems and Methods for Improved Video Understanding

From WikiPatents
Revision as of 02:53, 30 December 2024 by Unknown user (talk) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Systems and Methods for Improved Video Understanding

Organization Name

google llc

Inventor(s)

Anurag Arnab of Grenoble (FR)

Mostafa Dehghani of Amsterdam (NL)

Georg Heigold of Aachen (DE)

Chen Sun of San Francisco CA (US)

Mario Lucic of Adliswil (CH)

Cordelia Luise Schmid of Saint-Ismier (FR)

Systems and Methods for Improved Video Understanding

This abstract first appeared for US patent application 20240428587 titled 'Systems and Methods for Improved Video Understanding



Original Abstract Submitted

a computer-implemented method for classifying video data with improved accuracy includes obtaining, by a computing system comprising one or more computing devices, video data comprising a plurality of video frames; extracting, by the computing system, a plurality of video tokens from the video data, the plurality of video tokens comprising a representation of spatiotemporal information in the video data; providing, by the computing system, the plurality of video tokens as input to a video understanding model, the video understanding model comprising a video transformer encoder model; and receiving, by the computing system, a classification output from the video understanding model.

(Ad) Transform your business with AI in minutes, not months

Custom AI strategy for your specific industry
Step-by-step implementation with clear ROI
5-minute setup - no technical skills needed
Get your AI playbook
Cookies help us deliver our services. By using our services, you agree to our use of cookies.