Deep Media Inc. (20240420681). SYSTEM AND METHOD OF PREPROCESSING INPUTS FOR CROSS-LANGUAGE VOCAL SYNTHESIS
SYSTEM AND METHOD OF PREPROCESSING INPUTS FOR CROSS-LANGUAGE VOCAL SYNTHESIS
Organization Name
Inventor(s)
Rijul Gupta of Oakland CA (US)
SYSTEM AND METHOD OF PREPROCESSING INPUTS FOR CROSS-LANGUAGE VOCAL SYNTHESIS
This abstract first appeared for US patent application 20240420681 titled 'SYSTEM AND METHOD OF PREPROCESSING INPUTS FOR CROSS-LANGUAGE VOCAL SYNTHESIS
Original Abstract Submitted
a system and method for synthesizing audio for translated text. the system and method include modifying an input text label to improve machine learning model outputs. in some embodiments, the text labels are modified using a phoneme generator configured to convert the raw text to phonemes. in some embodiments, the text labels are modified using a spacing character generator configured to input characters into the text to convey a gap in speech. some embodiments include a pacing character generator to input characters into the text to convey the pace at which a phoneme, word, or sentence is spoken. some embodiments include a non-verbal character generator to input characters into the text to convey when non-verbal speech occurs.