US Patent Application 17739253. GENERATION OF CLOSED CAPTIONS BASED ON VARIOUS VISUAL AND NON-VISUAL ELEMENTS IN CONTENT simplified abstract

From WikiPatents
Jump to navigation Jump to search

GENERATION OF CLOSED CAPTIONS BASED ON VARIOUS VISUAL AND NON-VISUAL ELEMENTS IN CONTENT

Organization Name

Sony Group Corporation


Inventor(s)

BRANT Candelore of POWAY CA (US)

ADAM Goldberg of FAIRFAX VA (US)

ROBERT Blanchard of ST. GEORGE UT (US)

GENERATION OF CLOSED CAPTIONS BASED ON VARIOUS VISUAL AND NON-VISUAL ELEMENTS IN CONTENT - A simplified explanation of the abstract

This abstract first appeared for US patent application 17739253 titled 'GENERATION OF CLOSED CAPTIONS BASED ON VARIOUS VISUAL AND NON-VISUAL ELEMENTS IN CONTENT

Simplified Explanation

- The patent application describes an electronic device and method for generating closed captions for media content. - The device receives video and audio content and generates a first text by converting the audio content into text using speech-to-text analysis. - The device also generates a second text that describes audio elements of a scene in the media content, which are different from the speech component. - Based on the first and second texts, the device generates closed captions for the video content. - The device controls a display device to show the closed captions to the user.


Original Abstract Submitted

An electronic device and method for generation of closed captions based on various visual and non-visual elements in content is disclosed. The electronic device receives media content including video content and audio content associated with the video content. The electronic device generates a first text based on a speech-to-text analysis of the audio content. The electronic device further generates a second text which describes audio elements of a scene associated with the media content. The audio elements are different from a speech component of the audio content. The electronic device further generates closed captions for the video content, based on the first text and the second text and controls a display device to display the closed captions.