18047572. MEDIA SEGMENT PREDICTION FOR MEDIA GENERATION simplified abstract (QUALCOMM Incorporated)

From WikiPatents
Jump to navigation Jump to search

MEDIA SEGMENT PREDICTION FOR MEDIA GENERATION

Organization Name

QUALCOMM Incorporated

Inventor(s)

Stephane Villette of San Diego CA (US)

Sen Li of San Diego CA (US)

Pravin Kumar Ramadas of San Diego CA (US)

Daniel Jared Sinder of San Diego CA (US)

MEDIA SEGMENT PREDICTION FOR MEDIA GENERATION - A simplified explanation of the abstract

This abstract first appeared for US patent application 18047572 titled 'MEDIA SEGMENT PREDICTION FOR MEDIA GENERATION

Simplified Explanation

The device described in the patent application is designed to process input media streams by extracting features, classifying utterances, and matching segments to produce identifiers for media output segments.

  • The device includes processors that input segments of an input media stream into a feature extractor.
  • The processors then pass the output of the feature extractor to an utterance classifier to generate representations of utterance classes.
  • The output of the feature extractor and the utterance representations are passed to a segment matcher to produce media output segment identifiers.

Potential Applications

This technology could be applied in speech recognition systems, language translation tools, and content recommendation engines.

Problems Solved

This technology helps in accurately identifying and categorizing different utterances within media streams, improving the overall efficiency and effectiveness of media processing systems.

Benefits

The benefits of this technology include enhanced media stream processing, improved accuracy in utterance classification, and streamlined content identification in various applications.

Potential Commercial Applications

Potential commercial applications of this technology include speech-to-text services, virtual assistants, and personalized content delivery platforms.

Possible Prior Art

One possible prior art for this technology could be existing speech recognition systems that use similar processes for feature extraction and utterance classification.

Unanswered Questions

How does this technology handle different languages in media streams?

The patent abstract does not specify how the device processes and classifies utterances in languages other than the primary language it is designed for. Further details on language adaptation and multilingual support would be beneficial.

What is the computational efficiency of this device when processing large media streams?

The abstract does not mention the computational resources required for processing extensive media streams. Understanding the device's performance in handling large volumes of data would be essential for practical implementation.


Original Abstract Submitted

A device includes one or more processors configured to input one or more segments of an input media stream into a feature extractor. The one or more processors are further configured to pass an output of the feature extractor into an utterance classifier to produce at least one representation of at least one utterance class of a plurality of utterance classes. The one or more processors are further configured to pass the output of the feature extractor and the at least one representation into a segment matcher to produce a media output segment identifier.