Jump to content

BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD. (20250078839). SPEECH RECOGNITION

From WikiPatents

SPEECH RECOGNITION

Organization Name

BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor(s)

Xiaoyin Fu of BEIJING (CN)

Qiguang Zang of BEIJING (CN)

Fenfen Sheng of BEIJING (CN)

Haifeng Wang of BEIJING (CN)

Lei Jia of BEIJING (CN)

SPEECH RECOGNITION

This abstract first appeared for US patent application 20250078839 titled 'SPEECH RECOGNITION

Original Abstract Submitted

a speech recognition method and a method for training a deep learning model are provided. the speech recognition method includes: obtaining a first speech feature of a speech to-be-recognized, which includes a plurality of speech segment features corresponding to a plurality of speech segments; decoding the first speech feature using a first decoder to obtain a plurality of first decoding results corresponding to a plurality of the words, indicating a first recognition result of words; extracting a second speech feature from the first speech feature based on first a priori information, which includes the plurality of first decoding results, and the second speech feature includes first word-level audio features corresponding to the plurality of words; and decoding the second speech feature using a second decoder to obtain a plurality of second decoding results corresponding to the plurality of words, indicating a second recognition result of the word.

(Ad) Transform your business with AI in minutes, not months

Custom AI strategy tailored to your specific industry needs
Step-by-step implementation with measurable ROI
5-minute setup that requires zero technical skills
Get your AI playbook

Trusted by 1,000+ companies worldwide

Cookies help us deliver our services. By using our services, you agree to our use of cookies.