Nvidia corporation (20240428020). REVERSIBLE SPEECH-TO-SPEECH TRANSLATION FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS

From WikiPatents
Revision as of 14:35, 29 December 2024 by Wikipatents (talk | contribs) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

REVERSIBLE SPEECH-TO-SPEECH TRANSLATION FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS

Organization Name

nvidia corporation

Inventor(s)

Xianchao Wu of Tokyo (JP)

Simon See Chong Wee of West Coast Rise (SG)

REVERSIBLE SPEECH-TO-SPEECH TRANSLATION FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS

This abstract first appeared for US patent application 20240428020 titled 'REVERSIBLE SPEECH-TO-SPEECH TRANSLATION FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS



Original Abstract Submitted

disclosed are apparatuses, systems, and techniques that may use machine learning for reversible translations of speech utterances. the techniques include training and using duplex neural networks (nns) having a first subnetwork and a second subnetwork that are mirror images of each other. training data for training the duplex nns may include a target output that includes a first speech utterance in a first language, a first training input that includes the target output distorted by a noise, and a second training input that includes a second speech utterance in a second language. the duplex nns may be trained to identify, using the first training input and the second training input, at least one of the target output or the first noise.