Analog Devices, Inc. (20250061911). METHOD AND SYSTEM FOR MUTLIPLE TIME RESOLUTION AUDIO PROCESSING
METHOD AND SYSTEM FOR MUTLIPLE TIME RESOLUTION AUDIO PROCESSING
Organization Name
Inventor(s)
Johannes Traa of Medford MA (US)
Atulya Yellepeddi of Medford MA (US)
Donald F. Porges of Watertown MA (US)
METHOD AND SYSTEM FOR MUTLIPLE TIME RESOLUTION AUDIO PROCESSING
This abstract first appeared for US patent application 20250061911 titled 'METHOD AND SYSTEM FOR MUTLIPLE TIME RESOLUTION AUDIO PROCESSING
Original Abstract Submitted
aspects of the present disclosure provided a method for voice control that includes transforming, using a short-time fourier transform (stft) applied to data in each window aligned across each input channel of the multichannel audio stream, the multichannel audio stream into a complex valued frequency-domain representation. for a current window, the method further includes: updating a first complex-valued covariance matrix corresponding to a slowly-adapting beamformer and forming a single-channel denoised estimate for each frequency band in the stft; calculating a voice activity detection (vad) estimate for each frequency band in the stft by comparing a magnitude of the single-channel denoised estimate to a magnitude of each input channel of the multichannel audio stream; and selectively updating or refraining from updating, responsive to the vad estimate respectively indicating a presence or an absence of speech, a second complex-valued covariance matrix corresponding to a quickly-adapting beamformer.