International Business Machines Corporation (20250014569). STEREOPHONIC AUDIO GENERATION

From WikiPatents
Jump to navigation Jump to search

STEREOPHONIC AUDIO GENERATION

Organization Name

International Business Machines Corporation

Inventor(s)

Yasmin Aumeeruddy of South Woodford (GB)

Thomas Jeffrey Solomon of Fordingbridge (GB)

STEREOPHONIC AUDIO GENERATION

This abstract first appeared for US patent application 20250014569 titled 'STEREOPHONIC AUDIO GENERATION



Original Abstract Submitted

stereophonic audio generation for a video having a monophonic audio track includes using a feature recognition algorithm to identify visual features of interest in the video. for each visual feature of interest, a spatial location of the visual feature is determined in the video. a sound of interest is identified in the monophonic audio track and an audio fingerprint is determined for the sound of interest. the video is analyzed based on the sound of interest and the audio fingerprint to identify if the sound of interest is linked to any of the visual features. responsive to identifying the sound of interest is linked to a visual feature of interest, the sound of interest is associated with the spatial location of the visual feature in the video. the stereo location of the sound of interest is determined within the stereoscopic audio for the video based on the associated spatial location.