BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD. (20250078839). SPEECH RECOGNITION
Contents
SPEECH RECOGNITION
Organization Name
BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Inventor(s)
SPEECH RECOGNITION
This abstract first appeared for US patent application 20250078839 titled 'SPEECH RECOGNITION
Original Abstract Submitted
a speech recognition method and a method for training a deep learning model are provided. the speech recognition method includes: obtaining a first speech feature of a speech to-be-recognized, which includes a plurality of speech segment features corresponding to a plurality of speech segments; decoding the first speech feature using a first decoder to obtain a plurality of first decoding results corresponding to a plurality of the words, indicating a first recognition result of words; extracting a second speech feature from the first speech feature based on first a priori information, which includes the plurality of first decoding results, and the second speech feature includes first word-level audio features corresponding to the plurality of words; and decoding the second speech feature using a second decoder to obtain a plurality of second decoding results corresponding to the plurality of words, indicating a second recognition result of the word.