Ford global technologies, llc (20240202533). GENERATING ARTIFICIAL VIDEO WITH CHANGED DOMAIN simplified abstract

From WikiPatents
Jump to navigation Jump to search

GENERATING ARTIFICIAL VIDEO WITH CHANGED DOMAIN

Organization Name

ford global technologies, llc

Inventor(s)

Akhil Perincherry of Dearborn MI (US)

Arpita Chand of Dearborn MI (US)

GENERATING ARTIFICIAL VIDEO WITH CHANGED DOMAIN - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240202533 titled 'GENERATING ARTIFICIAL VIDEO WITH CHANGED DOMAIN

Simplified Explanation

A computer system is described that can generate an output video of a scene from an input video and audio data by mapping them to a lower-dimensional latent space and maintaining temporal consistency between the input and output videos using the audio data.

Key Features and Innovation

  • The computer system includes an encoder to map the input video and audio data to a latent vector and a generator to generate an output video from the latent vector.
  • The encoder and generator are trained to ensure temporal consistency between the input and output videos by utilizing the audio data.
  • The output video is generated in a second domain, different from the first domain of the input video.

Potential Applications

This technology can be used in video editing, virtual reality applications, and content creation for movies and video games.

Problems Solved

  • Maintaining temporal consistency between input and output videos.
  • Generating high-quality output videos from input videos and audio data.

Benefits

  • Improved video generation process.
  • Enhanced user experience in virtual reality environments.
  • Efficient content creation for various media platforms.

Commercial Applications

Potential commercial uses include video editing software, virtual reality content creation tools, and movie production software. This technology can have significant market implications in the entertainment industry.

Questions about the Technology

How does this technology improve the video generation process?

This technology improves the video generation process by mapping input videos and audio data to a lower-dimensional latent space, ensuring temporal consistency between input and output videos.

What are the potential commercial applications of this technology?

The potential commercial applications of this technology include video editing software, virtual reality content creation tools, and movie production software.


Original Abstract Submitted

a computer includes a processor and a memory, and the memory stores instructions executable by the processor to receive an input video of a scene and audio data associated with the input video, the input video being in a first domain; execute an encoder to map the input video and the audio data to a latent vector in a lower-dimensional latent space; and execute a generator to generate an output video of the scene from the latent vector, the output video being in a second domain. the encoder and the generator are trained to maintain temporal consistency between the input video and the output video by using the audio data.