Beijing Youzhuju Network Technology Co, Ltd. (20240379116). AUDIO CAPTION ALIGNMENT METHOD AND APPARATUS, MEDIUM, AND ELECTRONIC DEVICE simplified abstract
Contents
AUDIO CAPTION ALIGNMENT METHOD AND APPARATUS, MEDIUM, AND ELECTRONIC DEVICE
Organization Name
Beijing Youzhuju Network Technology Co, Ltd.
Inventor(s)
AUDIO CAPTION ALIGNMENT METHOD AND APPARATUS, MEDIUM, AND ELECTRONIC DEVICE - A simplified explanation of the abstract
This abstract first appeared for US patent application 20240379116 titled 'AUDIO CAPTION ALIGNMENT METHOD AND APPARATUS, MEDIUM, AND ELECTRONIC DEVICE
The disclosure pertains to an audio caption alignment method and apparatus, a medium, and an electronic device. The method involves obtaining a target audio and a target caption text, slicing the target audio into multiple segments, determining audio feature information for each segment, concatenating the audio feature information, and generating caption information based on the target audio and caption text.
- Obtaining target audio and caption text
- Slicing target audio into segments
- Determining audio feature information for each segment
- Concatenating audio feature information
- Generating caption information based on audio and caption text
Potential Applications: - Automated captioning for audio content - Enhancing accessibility for individuals with hearing impairments - Improving searchability and indexing of audio files
Problems Solved: - Efficient alignment of audio and caption text - Streamlining the captioning process - Enhancing user experience for audio content consumers
Benefits: - Increased accessibility for a wider audience - Time-saving in captioning processes - Improved organization and searchability of audio content
Commercial Applications: Title: Automated Audio Captioning Technology for Enhanced Accessibility This technology can be utilized in media production companies, online streaming platforms, educational institutions, and accessibility-focused organizations to automate the captioning process and improve user experience for audio content consumers.
Questions about Audio Caption Alignment Technology: 1. How does this technology improve accessibility for individuals with hearing impairments? - This technology provides accurate and synchronized captions for audio content, making it more accessible for individuals who rely on captions to understand audio information.
2. What are the potential cost-saving benefits for businesses implementing this technology? - By automating the captioning process, businesses can save time and resources that would otherwise be spent on manual captioning, leading to cost efficiencies and improved productivity.
Original Abstract Submitted
the disclosure relates to an audio caption alignment method and apparatus, a medium, and an electronic device. the method includes: obtaining a target audio and a target caption text of the target audio; obtaining a plurality of first target audios by slicing the target audio according to a slicing duration in a case that a duration of the target audio is greater than a first preset duration; determining first audio feature information of each of the first target audios; obtaining target audio feature information of the target audio by concatenating all of the first audio feature information in a case that the duration of the target audio is less than or equal to a second preset duration, where the second preset duration is greater than the first preset duration; and generating caption information corresponding to the target audio according to the target caption text and the target audio feature information.