US Patent Application 17611436. SPEECH INSTRUCTION RECOGNITION METHOD, ELECTRONIC DEVICE, AND NON-TRANSIENT COMPUTER READABLE STORAGE MEDIUM simplified abstract

From WikiPatents
Jump to navigation Jump to search

SPEECH INSTRUCTION RECOGNITION METHOD, ELECTRONIC DEVICE, AND NON-TRANSIENT COMPUTER READABLE STORAGE MEDIUM

Organization Name

BOE Technology Group Co., Ltd.

Inventor(s)

Shaoxun Su of Beijing (CN)

SPEECH INSTRUCTION RECOGNITION METHOD, ELECTRONIC DEVICE, AND NON-TRANSIENT COMPUTER READABLE STORAGE MEDIUM - A simplified explanation of the abstract

This abstract first appeared for US patent application 17611436 titled 'SPEECH INSTRUCTION RECOGNITION METHOD, ELECTRONIC DEVICE, AND NON-TRANSIENT COMPUTER READABLE STORAGE MEDIUM

Simplified Explanation

The patent application describes a method for recognizing speech instructions using an electronic device and computer storage medium. Here are the key points:

  • The method involves acquiring a target speech and processing it to obtain a target speech vector.
  • Speech recognition is performed on the target speech to obtain a target speech text.
  • The target speech text is processed to obtain a target text vector.
  • The target speech vector and target text vector are inputted into a pre-trained instruction recognition model.
  • The output of the model is an instruction category corresponding to the target speech.


Original Abstract Submitted

A speech instruction recognition method, an electronic device, and a non-transient computer readable storage medium. The speech instruction recognition method comprises: acquiring a target speech; processing the target speech to obtain a target speech vector corresponding to the target speech; performing speech recognition on the target speech to obtain a target speech text of the target speech, and processing the target speech text to obtain a target text vector corresponding to the target speech text; and inputting the target speech vector and the target text vector to a pre-trained instruction recognition model to obtain an instruction category corresponding to the target speech.