FEATURE RECONSTRUCTION USING NEURAL NETWORKS FOR VIDEO STREAMING SYSTEMS AND APPLICATIONS

Organization Name

Inventor(s)

FEATURE RECONSTRUCTION USING NEURAL NETWORKS FOR VIDEO STREAMING SYSTEMS AND APPLICATIONS - A simplified explanation of the abstract

This abstract first appeared for US patent application 17955754 titled 'FEATURE RECONSTRUCTION USING NEURAL NETWORKS FOR VIDEO STREAMING SYSTEMS AND APPLICATIONS

Simplified Explanation

The patent application relates to facial video encoding and reconstruction in ultra-low bandwidth settings, using automatically tracked feature cropping information to dynamically determine the size of a bounding shape for maintaining proportion in feature reconstruction.

Automatically tracked feature cropping information is used in video conferencing or streaming applications.
The size of the bounding shape for cropped region varies dynamically to maintain proportion for feature reconstruction.
The tracking scheme smooths sudden movements for more natural transitions between frames.
Tracking and cropping information can be embedded in the encoded bitstream as supplemental enhancement information for eventual decoding by a receiver.

Potential Applications

This technology can be applied in video conferencing, live streaming, surveillance systems, and virtual reality applications.

Problems Solved

1. Maintaining proportion in feature reconstruction in ultra-low bandwidth settings. 2. Smoothing sudden movements for natural transitions between frames.

Benefits

1. Improved video quality in low bandwidth environments. 2. Enhanced user experience in video communication. 3. Efficient use of bandwidth resources.

Potential Commercial Applications

"Facial Video Encoding and Reconstruction Technology for Ultra-Low Bandwidth Settings" can be utilized in telecommunication systems, video conferencing platforms, security and surveillance systems, and virtual reality applications.

Possible Prior Art

There may be prior art related to video encoding and tracking technologies used in video conferencing and streaming applications.

Unanswered Questions

How does this technology handle different lighting conditions during video encoding and reconstruction?

The patent abstract does not mention how the technology adapts to varying lighting conditions during the encoding and reconstruction process.

What impact does this technology have on the overall latency of the video streaming process?

The abstract does not address how this technology affects the latency of the video streaming process and whether it introduces any delays in transmitting the reconstructed video frames.

Original Abstract Submitted

Systems and methods relate to facial video encoding and reconstruction, particularly in ultra-low bandwidth settings. In embodiments, a video conferencing or other streaming application uses automatically tracked feature cropping information. A bounding shape size—used to identify the cropped region—varies and is dynamically determined to maintain a proportion for feature reconstruction, such as resizing in the event of a zoom-in on a face (or other feature of interest) or a zoom-out. The tracking scheme may be used to smooth sudden movements, including lateral ones, to generate more natural transitions between frames. Tracking and cropping information (e.g., size and position of the cropped region) may be embedded within an encoded bitstream as supplemental enhancement information (“SEI”), for eventual decoding by a receiver and for compositing a decoded face at a proper location in the applicable stream.

17955754. FEATURE RECONSTRUCTION USING NEURAL NETWORKS FOR VIDEO STREAMING SYSTEMS AND APPLICATIONS simplified abstract (NVIDIA Corporation)

Contents

FEATURE RECONSTRUCTION USING NEURAL NETWORKS FOR VIDEO STREAMING SYSTEMS AND APPLICATIONS

Organization Name

Inventor(s)