20240046917. INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM FOR GENERATING SYNTHESIZED AUDIO CONTENT FROM TEXT WHEN AUDIO CONTENT IS NOT REPRODUCIBLE simplified abstract (TOYOTA JIDOSHA KABUSHIKI KAISHA)

From WikiPatents
Jump to navigation Jump to search

INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM FOR GENERATING SYNTHESIZED AUDIO CONTENT FROM TEXT WHEN AUDIO CONTENT IS NOT REPRODUCIBLE

Organization Name

TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventor(s)

Jun Tsukamoto of Seto-shi (JP)

INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM FOR GENERATING SYNTHESIZED AUDIO CONTENT FROM TEXT WHEN AUDIO CONTENT IS NOT REPRODUCIBLE - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240046917 titled 'INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM FOR GENERATING SYNTHESIZED AUDIO CONTENT FROM TEXT WHEN AUDIO CONTENT IS NOT REPRODUCIBLE

Simplified Explanation

The patent application describes an information processing device that can receive audio data and corresponding text data. It includes a communication unit, an audio data reproduction unit, a text data reproduction unit, and a controller. The communication unit receives audio data and text data. The audio data reproduction unit plays the audio data, while the text data reproduction unit synthesizes the text data into audio. The controller controls the reproduction of the audio data or the text data. If the audio data reproduction unit is unable to play the audio data, the controller instructs the text data reproduction unit to synthesize and play the text data instead.

  • The device can receive both audio data and corresponding text data.
  • It can reproduce the audio data through audio playback.
  • It can also reproduce the text data by synthesizing it into audio.
  • The controller determines whether to play the audio data or the text data based on the capabilities of the audio data reproduction unit.
  • If the audio data reproduction unit is unable to play the audio data, the text data reproduction unit is instructed to play the synthesized text data.

Potential applications of this technology:

  • Accessibility: This technology can be beneficial for individuals with hearing impairments, as it allows them to access audio content through synthesized text.
  • Language learning: Users can listen to audio content while simultaneously reading the corresponding text, aiding in language learning and comprehension.
  • Multilingual support: The device can provide translations or subtitles by synthesizing the text data in different languages.

Problems solved by this technology:

  • Limited audio playback capabilities: In cases where the audio data reproduction unit is unable to play the audio data, the text data reproduction unit can provide an alternative method of accessing the content.
  • Accessibility barriers: This technology helps overcome barriers for individuals with hearing impairments by providing a text-to-speech synthesis option.

Benefits of this technology:

  • Enhanced accessibility: The device allows individuals with hearing impairments to access audio content through synthesized text.
  • Improved language learning: Users can simultaneously listen to audio and read the corresponding text, facilitating language learning and comprehension.
  • Multilingual support: The technology enables the synthesis of text data in different languages, providing translations or subtitles.


Original Abstract Submitted

an information processing device according to embodiments includes a communication unit configured to receive audio data of content and text data corresponding to the audio data, an audio data reproduction unit configured to perform reproduction of the audio data, a text data reproduction unit configured to perform the reproduction by audio synthesis of the text data, and a controller that controls the reproduction of the audio data or the text data. the controller causes the text data reproduction unit to perform the reproduction of the text data when the audio data reproduction unit is unable to perform the reproduction of the audio data.