17884118. METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR DISTRIBUTED DATA PROCESSING simplified abstract (Dell Products L.P.)

From WikiPatents
Jump to navigation Jump to search

METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR DISTRIBUTED DATA PROCESSING

Organization Name

Dell Products L.P.

Inventor(s)

Jinpeng Liu of Shanghai (CN)

Zijia Wang of Weifang (CN)

Zhen Jia of Shanghai (CN)

Jiacheng Ni of Shanghai (CN)

METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR DISTRIBUTED DATA PROCESSING - A simplified explanation of the abstract

This abstract first appeared for US patent application 17884118 titled 'METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR DISTRIBUTED DATA PROCESSING

Simplified Explanation

Embodiments of the present disclosure provide a method, an electronic device, and a computer program product for distributed data processing. The method involves obtaining an input for a data processing task using a multi-head attention mechanism. The data processing task consists of a first subtask and a second subtask, where each subtask corresponds to a specific attention head in the multi-head attention mechanism. The input is then transmitted to dedicated computing resources assigned to each subtask. The first subtask is performed on the first dedicated computing resource, and the second subtask is performed on the second dedicated computing resource. The output of the data processing task is obtained by completing both subtasks.

  • The method involves using a multi-head attention mechanism for data processing tasks.
  • The input for the data processing task is divided into subtasks based on attention heads.
  • Dedicated computing resources are assigned to each subtask for efficient processing.
  • The subtasks are performed simultaneously on separate computing resources.
  • The output of the data processing task is obtained by combining the results of both subtasks.

Potential applications of this technology:

  • Natural language processing tasks that require attention mechanisms.
  • Machine translation and language generation tasks.
  • Speech recognition and sentiment analysis applications.
  • Image and video processing tasks that involve attention-based models.

Problems solved by this technology:

  • Efficient processing of complex data processing tasks.
  • Improved performance and accuracy in attention-based models.
  • Scalability and parallel processing of subtasks.
  • Resource allocation and optimization in distributed computing environments.

Benefits of this technology:

  • Faster and more efficient data processing.
  • Improved accuracy and performance in attention-based models.
  • Scalability and flexibility in distributed computing environments.
  • Enhanced resource utilization and optimization.


Original Abstract Submitted

Embodiments of the present disclosure provide a method, an electronic device, and a computer program product for distributed data processing. A method in one embodiment comprises obtaining an input for a data processing task based on a multi-head attention mechanism, the data processing task comprising a first subtask and a second subtask, the first subtask corresponding to a first attention head in the multi-head attention mechanism, and the second subtask corresponding to a second attention head in the multi-head attention mechanism. The method further comprises transmitting the input to a first dedicated computing resource and a second dedicated computing resource, the first dedicated computing resource corresponding to the first subtask, and the second dedicated computing resource corresponding to the second subtask, and performing the first subtask and the second subtask on the input for obtaining an output of the data processing task.