Zoom video communications, inc. (20240404505). SYNTHESIZING MULTI-ACCENT SPEECH USING ADAPTIVE WEIGHTS
SYNTHESIZING MULTI-ACCENT SPEECH USING ADAPTIVE WEIGHTS
Organization Name
zoom video communications, inc.
Inventor(s)
Tuan Nam Nguyen of Karlsruhe (DE)
Alexander Waibel of Sammamish WA (US)
SYNTHESIZING MULTI-ACCENT SPEECH USING ADAPTIVE WEIGHTS
This abstract first appeared for US patent application 20240404505 titled 'SYNTHESIZING MULTI-ACCENT SPEECH USING ADAPTIVE WEIGHTS
Original Abstract Submitted
techniques for synthesizing multi-accent speech using adaptive weights are provided. a computing system may receive a text input along with first information about a first accent. the computing system may access a first trained machine learning model, the first trained machine learning model trained to synthesize, from inputted text, waveforms representing speech. the computing device may apply one or more adaptive weights to the first trained machine learning model, the one or more adaptive weights characterizing the first accent. the computing device may then synthesize, using the first trained machine learning model with the applied one or more adaptive weights, a first waveform representing the text input, wherein the first waveform is characterized by the first accent.