Jump to content

Analog Devices, Inc. (20250061911). METHOD AND SYSTEM FOR MUTLIPLE TIME RESOLUTION AUDIO PROCESSING

From WikiPatents

METHOD AND SYSTEM FOR MUTLIPLE TIME RESOLUTION AUDIO PROCESSING

Organization Name

Analog Devices, Inc.

Inventor(s)

Johannes Traa of Medford MA (US)

Atulya Yellepeddi of Medford MA (US)

Donald F. Porges of Watertown MA (US)

METHOD AND SYSTEM FOR MUTLIPLE TIME RESOLUTION AUDIO PROCESSING

This abstract first appeared for US patent application 20250061911 titled 'METHOD AND SYSTEM FOR MUTLIPLE TIME RESOLUTION AUDIO PROCESSING

Original Abstract Submitted

aspects of the present disclosure provided a method for voice control that includes transforming, using a short-time fourier transform (stft) applied to data in each window aligned across each input channel of the multichannel audio stream, the multichannel audio stream into a complex valued frequency-domain representation. for a current window, the method further includes: updating a first complex-valued covariance matrix corresponding to a slowly-adapting beamformer and forming a single-channel denoised estimate for each frequency band in the stft; calculating a voice activity detection (vad) estimate for each frequency band in the stft by comparing a magnitude of the single-channel denoised estimate to a magnitude of each input channel of the multichannel audio stream; and selectively updating or refraining from updating, responsive to the vad estimate respectively indicating a presence or an absence of speech, a second complex-valued covariance matrix corresponding to a quickly-adapting beamformer.

Cookies help us deliver our services. By using our services, you agree to our use of cookies.