Jump to content

Deep Media Inc. (20240420681). SYSTEM AND METHOD OF PREPROCESSING INPUTS FOR CROSS-LANGUAGE VOCAL SYNTHESIS

From WikiPatents

SYSTEM AND METHOD OF PREPROCESSING INPUTS FOR CROSS-LANGUAGE VOCAL SYNTHESIS

Organization Name

Deep Media Inc.

Inventor(s)

Rijul Gupta of Oakland CA (US)

SYSTEM AND METHOD OF PREPROCESSING INPUTS FOR CROSS-LANGUAGE VOCAL SYNTHESIS

This abstract first appeared for US patent application 20240420681 titled 'SYSTEM AND METHOD OF PREPROCESSING INPUTS FOR CROSS-LANGUAGE VOCAL SYNTHESIS



Original Abstract Submitted

a system and method for synthesizing audio for translated text. the system and method include modifying an input text label to improve machine learning model outputs. in some embodiments, the text labels are modified using a phoneme generator configured to convert the raw text to phonemes. in some embodiments, the text labels are modified using a spacing character generator configured to input characters into the text to convey a gap in speech. some embodiments include a pacing character generator to input characters into the text to convey the pace at which a phoneme, word, or sentence is spoken. some embodiments include a non-verbal character generator to input characters into the text to convey when non-verbal speech occurs.

Cookies help us deliver our services. By using our services, you agree to our use of cookies.