17967703. VISUAL AND TEXT SEARCH INTERFACE FOR TEXT-BASED VIDEO EDITING simplified abstract (Adobe Inc.)
Contents
- 1 VISUAL AND TEXT SEARCH INTERFACE FOR TEXT-BASED VIDEO EDITING
- 1.1 Organization Name
- 1.2 Inventor(s)
- 1.3 VISUAL AND TEXT SEARCH INTERFACE FOR TEXT-BASED VIDEO EDITING - A simplified explanation of the abstract
- 1.4 Simplified Explanation
- 1.5 Key Features and Innovation
- 1.6 Potential Applications
- 1.7 Problems Solved
- 1.8 Benefits
- 1.9 Commercial Applications
- 1.10 Prior Art
- 1.11 Frequently Updated Research
- 1.12 Questions about Visual and Text Search Interface for Video Transcripts
- 1.13 Original Abstract Submitted
VISUAL AND TEXT SEARCH INTERFACE FOR TEXT-BASED VIDEO EDITING
Organization Name
Inventor(s)
Lubomira Assenova Dontcheva of Seattle WA (US)
Dingzeyu Li of Seattle WA (US)
Kim Pascal Pimmel of Seattle WA (US)
Hijung Shin of Arlington MA (US)
Hanieh Deilamsalehy of Seattle WA (US)
Aseem Omprakash Agarwala of Seattle WA (US)
Joy Oakyung Kim of Sunnyvale CA (US)
Joel Richard Brandt of Venice CA (US)
Cristin Ailidh Fraser of Seattle WA (US)
VISUAL AND TEXT SEARCH INTERFACE FOR TEXT-BASED VIDEO EDITING - A simplified explanation of the abstract
This abstract first appeared for US patent application 17967703 titled 'VISUAL AND TEXT SEARCH INTERFACE FOR TEXT-BASED VIDEO EDITING
Simplified Explanation
The patent application describes a system for searching and navigating video transcripts using both visual and text queries.
- Visual search for frames matching a freeform text query
- Text search for matching words in the transcript or video tags
- Display of search results in tiles for easy navigation
- Selection of a search result tile navigates to the corresponding part of the transcript
Key Features and Innovation
- Integration of visual and text search for video transcripts
- Seamless navigation through search results for efficient browsing
- Enhanced user experience with a combination of visual and text cues
Potential Applications
- Video content management systems
- Educational platforms for searching through video lectures
- Legal and forensic analysis of video evidence
Problems Solved
- Efficient searching and navigation of video transcripts
- Integration of visual and text cues for enhanced user experience
- Streamlining the process of finding specific information within video content
Benefits
- Improved search functionality for video transcripts
- Enhanced user experience with a combination of visual and text search
- Efficient navigation through video content for quick access to relevant information
Commercial Applications
The technology can be applied in video content platforms, educational software, and legal analysis tools to enhance search and navigation capabilities, providing a more efficient and user-friendly experience for users.
Prior Art
No specific information on prior art related to this technology is provided in the abstract.
Frequently Updated Research
There is no information on frequently updated research relevant to this technology in the abstract.
Questions about Visual and Text Search Interface for Video Transcripts
Question 1
How does the system differentiate between visual and text search queries in the video transcript interface?
The system uses frame embeddings for visual search and matches them with the corresponding embedding of the freeform text query. It also conducts a text search for matching words in the transcript or video tags.
Question 2
What are the potential applications of this technology beyond video content management systems?
This technology can also be utilized in educational platforms for searching through video lectures and in legal and forensic analysis of video evidence.
Original Abstract Submitted
Embodiments of the present invention provide systems, methods, and computer storage media for a visual and text search interface used to navigate a video transcript. In an example embodiment, a freeform text query triggers a visual search for frames of a loaded video that match the freeform text query (e.g., frame embeddings that match a corresponding embedding of the freeform query), and triggers a text search for matching words from a corresponding transcript or from tags of detected features from the loaded video. Visual search results are displayed (e.g., in a row of tiles that can be scrolled to the left and right), and textual search results are displayed (e.g., in a row of tiles that can be scrolled up and down). Selecting (e.g., clicking or tapping on) a search result tile navigates a transcript interface to a corresponding portion of the transcript.
- Adobe Inc.
- Lubomira Assenova Dontcheva of Seattle WA (US)
- Dingzeyu Li of Seattle WA (US)
- Kim Pascal Pimmel of Seattle WA (US)
- Hijung Shin of Arlington MA (US)
- Hanieh Deilamsalehy of Seattle WA (US)
- Aseem Omprakash Agarwala of Seattle WA (US)
- Joy Oakyung Kim of Sunnyvale CA (US)
- Joel Richard Brandt of Venice CA (US)
- Cristin Ailidh Fraser of Seattle WA (US)
- G06F16/732
- CPC G06F16/732