US Patent Application 17739266. DISTRIBUTOR-SIDE GENERATION OF CAPTIONS BASED ON VARIOUS VISUAL AND NON-VISUAL ELEMENTS IN CONTENT simplified abstract

From WikiPatents
Jump to navigation Jump to search

DISTRIBUTOR-SIDE GENERATION OF CAPTIONS BASED ON VARIOUS VISUAL AND NON-VISUAL ELEMENTS IN CONTENT

Organization Name

Sony Group Corporation


Inventor(s)

BRANT Candelore of POWAY CA (US)

ADAM Goldberg of FAIRFAX VA (US)

DISTRIBUTOR-SIDE GENERATION OF CAPTIONS BASED ON VARIOUS VISUAL AND NON-VISUAL ELEMENTS IN CONTENT - A simplified explanation of the abstract

This abstract first appeared for US patent application 17739266 titled 'DISTRIBUTOR-SIDE GENERATION OF CAPTIONS BASED ON VARIOUS VISUAL AND NON-VISUAL ELEMENTS IN CONTENT

Simplified Explanation

The abstract describes a content distribution system and method for generating captions for media content.

  • The system receives video and audio content and analyzes the audio to generate a text based on speech-to-text analysis.
  • The system also generates a second text that describes audio elements of the scene, which are different from the speech component.
  • Captions for the video content are generated based on the first and second texts.
  • The generated captions can be transmitted to electronic devices through various means such as OTA signal, cable, or streaming Internet connection.


Original Abstract Submitted

A content distribution system and method for distribution-side generation of captions is disclosed. The content distribution system receives media content including video content and audio content associated with the video content and generates a first text based on a speech-to-text analysis of the audio content. The content distribution system further generates a second text that describes audio elements of a scene associated with the media content. The audio elements are different from a speech component of the audio content. The content distribution system further generates captions for the video content based on the first text and the second text and transmits the generated captions to an electronic device via an Over-the-Air (OTA) signal, via a cable, or via a streaming Internet connection.