Jump to content

20250166367. Object Detection U (QUALCOMM Technologies, .)

From WikiPatents

OBJECT DETECTION USING VISUAL LANGUAGE MODELS VIA LATENT FEATURE ADAPTATION WITH SYNTHETIC DATA

Abstract: systems and techniques are described herein for adapting a pretrained machine learning model. for instance, a process can include encoding a training image into a first feature vector, the training image including a first object located at a first location; generating a second feature vector based on a set of sinusoidal functions using a set of weights; combining the first feature vector with a second feature vector to generate a combined feature vector; processing the combined feature vector using a visual language model to obtain a second location for the first object; and adjusting the set of weights based on a comparison between the first location and the second location.

Inventor(s): Michael DORKENWALD, Yuki ASANO

CPC Classification: G06V10/86 (IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING)

Search for rejections for patent application number 20250166367


Cookies help us deliver our services. By using our services, you agree to our use of cookies.