Jump to content

Google LLC (20250117185). USING AUDIO SEPARATION AND CLASSIFICATION TO ENHANCE AUDIO IN VIDEOS

From WikiPatents

USING AUDIO SEPARATION AND CLASSIFICATION TO ENHANCE AUDIO IN VIDEOS

Organization Name

Google LLC

Inventor(s)

Moonseok Kim of Mountain View CA US

Elliot Patros of Mountain View CA US

Sneh Singaraju of Mountain View CA US

Michelle Ansai of Mountain View CA US

Efthymios Tzinis of Mountain View CA US

USING AUDIO SEPARATION AND CLASSIFICATION TO ENHANCE AUDIO IN VIDEOS

This abstract first appeared for US patent application 20250117185 titled 'USING AUDIO SEPARATION AND CLASSIFICATION TO ENHANCE AUDIO IN VIDEOS

Original Abstract Submitted

a media application obtains a video that includes an audio portion. the media application separates the audio portion into a plurality of channels, where each channel corresponds to a particular audio source. an on-screen classifier model obtains an indication of whether the particular audio source for each channel is depicted in the video. an audio-type classifier model determines, an auditory object classification for each channel. the media application determines a respective gain for each channel based on the indication of whether the particular audio source for the channel is depicted in the video and the auditory object classification for the channel. the media application modifies each channel by applying the respective gain. the media application mixes the modified channels with the audio portion to generate a combined audio.

Cookies help us deliver our services. By using our services, you agree to our use of cookies.