International Business Machines Corporation (20250014569). STEREOPHONIC AUDIO GENERATION
Contents
STEREOPHONIC AUDIO GENERATION
Organization Name
International Business Machines Corporation
Inventor(s)
Yasmin Aumeeruddy of South Woodford (GB)
Thomas Jeffrey Solomon of Fordingbridge (GB)
STEREOPHONIC AUDIO GENERATION
This abstract first appeared for US patent application 20250014569 titled 'STEREOPHONIC AUDIO GENERATION
Original Abstract Submitted
stereophonic audio generation for a video having a monophonic audio track includes using a feature recognition algorithm to identify visual features of interest in the video. for each visual feature of interest, a spatial location of the visual feature is determined in the video. a sound of interest is identified in the monophonic audio track and an audio fingerprint is determined for the sound of interest. the video is analyzed based on the sound of interest and the audio fingerprint to identify if the sound of interest is linked to any of the visual features. responsive to identifying the sound of interest is linked to a visual feature of interest, the sound of interest is associated with the spatial location of the visual feature in the video. the stereo location of the sound of interest is determined within the stereoscopic audio for the video based on the associated spatial location.