Jump to content

Microsoft Technology Licensing, LLC (20240420404). GENERATING ENHANCED VIDEO MESSAGES FROM CAPTURED SPEECH

From WikiPatents

GENERATING ENHANCED VIDEO MESSAGES FROM CAPTURED SPEECH

Organization Name

Microsoft Technology Licensing, LLC

Inventor(s)

Mustafa Kasap of Kenmore WA (US)

Jason Hogg of Kirkland WA (US)

GENERATING ENHANCED VIDEO MESSAGES FROM CAPTURED SPEECH

This abstract first appeared for US patent application 20240420404 titled 'GENERATING ENHANCED VIDEO MESSAGES FROM CAPTURED SPEECH



Original Abstract Submitted

this disclosure describes a speech-to-video system that automatically generates enhanced short-form videos from speech. for example, the speech-to-video system utilizes different speech processing models to analyze speech in audio input and determine contextual features. additionally, the speech-to-video system utilizes various video generation models to create enhanced short-form videos using text summaries, audio contexts, user information, video parameter inputs, and/or other user contexts. in some cases, the speech-to-video system leverages components of a mobile core network to efficiently generate and deliver these features to mobile devices.

Cookies help us deliver our services. By using our services, you agree to our use of cookies.