17969782. METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR PROCESSING TARGET OBJECT simplified abstract (Dell Products L.P.)

From WikiPatents
Jump to navigation Jump to search

METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR PROCESSING TARGET OBJECT

Organization Name

Dell Products L.P.

Inventor(s)

Zijia Wang of WeiFang (CN)

Zhisong Liu of Shenzhen (CN)

Zhen Jia of Shanghai (CN)

METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR PROCESSING TARGET OBJECT - A simplified explanation of the abstract

This abstract first appeared for US patent application 17969782 titled 'METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR PROCESSING TARGET OBJECT

Simplified Explanation

The abstract describes a method, electronic device, and computer program product for processing a target object by acquiring non-video feature vectors based on speech and text inputs, generating video, speech, and text features, and generating processing parameters for the target object.

  • The method involves acquiring an initial non-video feature vector based on speech and text inputs.
  • If no video input is received, a default feature vector is taken as an initial video feature vector.
  • Video, speech, and text features are generated based on the initial non-video feature vector and the initial video feature vector.
  • A processing parameter for the target object is generated based on the video, speech, and text features, including emotion, attribute, and pose parameters.

Potential Applications

This technology could be applied in various fields such as:

  • Speech recognition systems
  • Video analysis software
  • Emotion detection in human-computer interaction

Problems Solved

This technology helps in:

  • Integrating multiple types of inputs for better object processing
  • Enhancing the accuracy of feature extraction in multimedia data analysis

Benefits

The benefits of this technology include:

  • Improved object processing efficiency
  • Enhanced performance in recognizing and analyzing multimedia data

Potential Commercial Applications

Potential commercial applications of this technology could include:

  • Multimedia content analysis tools
  • Virtual assistant systems
  • Emotion recognition software

Possible Prior Art

One possible prior art could be the use of machine learning algorithms for feature extraction in multimedia data analysis.

Unanswered Questions

How does this technology handle privacy concerns related to processing sensitive data inputs?

This technology may raise concerns about privacy and data security, especially when processing sensitive information such as personal speech inputs. Implementing robust encryption and data anonymization techniques could address these concerns.

What are the limitations of this technology in terms of scalability and real-time processing?

Scalability and real-time processing capabilities are crucial for applications requiring quick and efficient object processing. It is essential to evaluate the performance of this technology in handling large datasets and time-sensitive tasks to determine its limitations in scalability and real-time processing.


Original Abstract Submitted

Embodiments of the present disclosure relate to a method, an electronic device, and a computer program product for processing a target object. The method includes acquiring an initial non-video feature vector on the basis of at least one input of a received speech input and text input. The method further includes taking, in response to not receiving a video input, a default feature vector as an initial video feature vector corresponding to the video input. The method further includes generating a video feature, a speech feature, and a text feature on the basis of the initial non-video feature vector and the initial video feature vector. The method further includes generating a processing parameter for a target object on the basis of the video feature, the speech feature, and the text feature, wherein the processing parameter includes at least one of an emotion parameter, an attribute parameter, and a pose parameter.