Qualcomm incorporated (20240232258). SOUND SEARCH simplified abstract

From WikiPatents
Revision as of 06:39, 11 July 2024 by Wikipatents (talk | contribs) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

SOUND SEARCH

Organization Name

qualcomm incorporated

Inventor(s)

Rehana Mahfuz of San Diego CA (US)

Yinyi Guo of San Diego CA (US)

Erik Visser of San Diego CA (US)

SOUND SEARCH - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240232258 titled 'SOUND SEARCH

The device described in the abstract is capable of generating query caption embeddings based on a query and selecting caption embeddings from a set of media files in a file repository, representing sound captions.

  • Processors generate query caption embeddings and select caption embeddings from a set of media files based on similarity metrics.
  • Search results are generated to identify media files associated with selected caption embeddings.
  • Each sound caption includes a natural-language text description of a sound.
  • The technology aims to improve search and retrieval of media files based on sound descriptions.
      1. Potential Applications:

This technology can be used in content management systems, multimedia search engines, and audio recognition applications.

      1. Problems Solved:

This technology addresses the challenge of efficiently searching for media files based on sound descriptions.

      1. Benefits:

- Enhanced search capabilities for media files - Improved user experience in finding specific content - Efficient organization and retrieval of multimedia data

      1. Commercial Applications:

The technology can be utilized in online platforms for audio content search, digital asset management systems, and entertainment industry applications.

      1. Prior Art:

Researchers can explore existing patents related to audio content search, multimedia retrieval systems, and natural language processing technologies.

      1. Frequently Updated Research:

Stay updated on advancements in audio recognition technologies, multimedia search algorithms, and natural language processing techniques.

        1. Questions about the Technology:

1. How does this technology improve the search experience for users? 2. What are the potential limitations of using similarity metrics for selecting caption embeddings?


Original Abstract Submitted

a device includes one or more processors configured to generate one or more query caption embeddings based on a query. the processor(s) are further configured to select one or more caption embeddings from among a set of embeddings associated with a set of media files of a file repository. each caption embedding represents a corresponding sound caption, and each sound caption includes a natural-language text description of a sound. the caption embedding(s) are selected based on a similarity metric indicative of similarity between the caption embedding(s) and the query caption embedding(s). the processor(s) are further configured to generate search results identifying one or more first media files of the set of media files. each of the first media file(s) is associated with at least one of the caption embedding(s).