THE REGENTS OF THE UNIVERSITY OF CALIFORNIA (20250104727). SINGLE-CHANNEL SPEECH ENHANCEMENT USING ULTRASOUND
SINGLE-CHANNEL SPEECH ENHANCEMENT USING ULTRASOUND
Organization Name
THE REGENTS OF THE UNIVERSITY OF CALIFORNIA
Inventor(s)
Xinyu Zhang of San Diego CA US
SINGLE-CHANNEL SPEECH ENHANCEMENT USING ULTRASOUND
This abstract first appeared for US patent application 20250104727 titled 'SINGLE-CHANNEL SPEECH ENHANCEMENT USING ULTRASOUND
Original Abstract Submitted
in some embodiments, there is provided a method including receiving, by a machine learning model, first data corresponding to noisy audio including audio of a target speaker of interest proximate to a microphone; receiving, by the machine learning model, second data corresponding to articulatory gestures sensed by the microphone which also detected the noisy audio, wherein the second data corresponding to the articulatory gestures comprises one or more doppler data indicative of doppler associated with the articulatory gestures of the target speaker while speaking the audio; combining, by the machine learning model, a first set of features for the first data and a second set of features for the second data to form an output representative of the audio of the target speaker. related systems, methods, and articles of manufacture are also disclosed.