Nec laboratories america, inc. (20250148766). LEVERAGING SEMANTIC INFORMATION FOR A MULTI-DOMAIN VISUAL AGENT
LEVERAGING SEMANTIC INFORMATION FOR A MULTI-DOMAIN VISUAL AGENT
Organization Name
nec laboratories america, inc.
Inventor(s)
Vijay Kumar Baikampady Gopalkrishna of Santa Clara CA US
Masoud Faraki of Redwood City CA US
Yumin Suh of Santa Clara CA US
Manmohan Chandraker of Santa Clara CA US
LEVERAGING SEMANTIC INFORMATION FOR A MULTI-DOMAIN VISUAL AGENT
This abstract first appeared for US patent application 20250148766 titled 'LEVERAGING SEMANTIC INFORMATION FOR A MULTI-DOMAIN VISUAL AGENT
Original Abstract Submitted
systems and methods for leveraging semantic information for a multi-domain visual agent. semantic information can be leveraged to obtain a multi-domain visual agent. to train the multi-domain visual agent, questions can be sampled from question templates for domain-specific label spaces to obtain a unified label space. the domain-specific labels from the domain-specific label spaces can be mapped into natural language descriptions (nld) to obtain mapped nld. the mapped nld can be converted into prompts by combining the questions sampled from the unified label space and the annotations. the semantic information can be learned by iteratively generating outputs from tokens extracted from the prompts using a large-language model (llm). the multi-domain visual agent (mdva) can be trained using the semantic information.
(Ad) Transform your business with AI in minutes, not months
Trusted by 1,000+ companies worldwide