Microsoft Technology Licensing, LLC (20240420404). GENERATING ENHANCED VIDEO MESSAGES FROM CAPTURED SPEECH
GENERATING ENHANCED VIDEO MESSAGES FROM CAPTURED SPEECH
Organization Name
Microsoft Technology Licensing, LLC
Inventor(s)
Mustafa Kasap of Kenmore WA (US)
Jason Hogg of Kirkland WA (US)
GENERATING ENHANCED VIDEO MESSAGES FROM CAPTURED SPEECH
This abstract first appeared for US patent application 20240420404 titled 'GENERATING ENHANCED VIDEO MESSAGES FROM CAPTURED SPEECH
Original Abstract Submitted
this disclosure describes a speech-to-video system that automatically generates enhanced short-form videos from speech. for example, the speech-to-video system utilizes different speech processing models to analyze speech in audio input and determine contextual features. additionally, the speech-to-video system utilizes various video generation models to create enhanced short-form videos using text summaries, audio contexts, user information, video parameter inputs, and/or other user contexts. in some cases, the speech-to-video system leverages components of a mobile core network to efficiently generate and deliver these features to mobile devices.