18336423. VISUAL GROUNDING OF SELF-SUPERVISED REPRESENTATIONS FOR MACHINE LEARNING MODELS UTILIZING DIFFERENCE ATTENTION (Adobe Inc.)

From WikiPatents
Revision as of 07:32, 19 December 2024 by Wikipatents (talk | contribs) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

VISUAL GROUNDING OF SELF-SUPERVISED REPRESENTATIONS FOR MACHINE LEARNING MODELS UTILIZING DIFFERENCE ATTENTION

Organization Name

Adobe Inc.

Inventor(s)

Aishwarya Agarwal of Bengaluru (IN)

Srikrishna Karanam of Bangalore (IN)

Balaji Vasan Srinivasan of Bangalore (IN)

VISUAL GROUNDING OF SELF-SUPERVISED REPRESENTATIONS FOR MACHINE LEARNING MODELS UTILIZING DIFFERENCE ATTENTION

This abstract first appeared for US patent application 18336423 titled 'VISUAL GROUNDING OF SELF-SUPERVISED REPRESENTATIONS FOR MACHINE LEARNING MODELS UTILIZING DIFFERENCE ATTENTION



Original Abstract Submitted

The present disclosure relates to systems, non-transitory computer-readable media, and methods for utilizing difference attention to evaluate and/or train machine learning models. In particular, in some embodiments, the disclosed systems generate, utilizing a machine learning model, a first feature vector from a digital image. In one or more implementations, the disclosed systems generate a masked digital image by masking a region from the digital image. Additionally, in some embodiments, the disclosed systems generate, utilizing the machine learning model, a second feature vector from the masked digital image. Moreover, in some implementations, the disclosed systems determine a difference feature vector between the first feature vector and the second feature vector. Furthermore, in some embodiments, the disclosed systems generate, from the difference feature vector, a difference attention map reflecting a visual grounding of the machine learning model relative to the region.