18107173. TEMPORAL SEMANTIC BOUNDARY LOSS FOR VIDEO SEMANTIC SEGMENTATION NETWORKS simplified abstract (Samsung Electronics Co., Ltd.)

From WikiPatents
Revision as of 08:41, 31 May 2024 by Wikipatents (talk | contribs) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

TEMPORAL SEMANTIC BOUNDARY LOSS FOR VIDEO SEMANTIC SEGMENTATION NETWORKS

Organization Name

Samsung Electronics Co., Ltd.

Inventor(s)

Mostafa El-khamy of San Diego CA (US)

Hai Su of San Diego CA (US)

Qingfeng Liu of San Diego CA (US)

TEMPORAL SEMANTIC BOUNDARY LOSS FOR VIDEO SEMANTIC SEGMENTATION NETWORKS - A simplified explanation of the abstract

This abstract first appeared for US patent application 18107173 titled 'TEMPORAL SEMANTIC BOUNDARY LOSS FOR VIDEO SEMANTIC SEGMENTATION NETWORKS

Simplified Explanation

The abstract describes a method for semantic segmentation in a network using input data from multiple frames to generate ground truth temporal semantic boundary maps and predicted temporal semantic boundary maps.

  • Receiving input data from multiple frames in a semantic segmentation network
  • Computing ground truth labels on the frames
  • Generating ground truth temporal semantic boundary maps from the labels
  • Generating predicted temporal semantic boundary maps based on the input data output
  • Determining loss based on the ground truth and predicted temporal semantic boundary maps

Potential Applications

This technology could be applied in various fields such as autonomous driving, video surveillance, and medical imaging for accurate object detection and tracking.

Problems Solved

This technology solves the problem of accurately segmenting and tracking objects over time in video data, which is crucial for tasks like object recognition and scene understanding.

Benefits

The benefits of this technology include improved accuracy in semantic segmentation, better object tracking over time, and enhanced performance in various computer vision applications.

Potential Commercial Applications

Potential commercial applications of this technology include video analytics software for security systems, autonomous vehicle systems, and medical imaging software for diagnostic purposes.

Possible Prior Art

One possible prior art for this technology could be the use of recurrent neural networks for temporal segmentation in video data.

Unanswered Questions

How does this method compare to existing techniques in terms of accuracy and efficiency?

The article does not provide a comparison with existing techniques in terms of accuracy and efficiency.

What are the limitations of this method in real-world applications?

The article does not discuss the limitations of this method in real-world applications.


Original Abstract Submitted

Disclosed is a method including receiving, in a semantic segmentation network, input data from a plurality of frames, computing a ground truth label on the plurality of frames, generating a ground truth temporal semantic boundary map from the ground truth label on the plurality of frames, generating a predicted temporal semantic boundary map based on an output of the input data, and determining a loss based on the ground truth temporal semantic boundary map and the predicted temporal semantic boundary map.