Microsoft technology licensing, llc (20250006178). DIAGNOSTIC SERVICE IN SPEECH RECOGNITION

From WikiPatents
Jump to navigation Jump to search

DIAGNOSTIC SERVICE IN SPEECH RECOGNITION

Organization Name

microsoft technology licensing, llc

Inventor(s)

Haoxuan Li of Beijing (CN)

Rui Jiang of Beijing (CN)

Yang Liu of Beijing (CN)

Edward C Lin of Beijing (CN)

Lei Sun of Beijing (CN)

Che Zhao of Beijing (CN)

DIAGNOSTIC SERVICE IN SPEECH RECOGNITION

This abstract first appeared for US patent application 20250006178 titled 'DIAGNOSTIC SERVICE IN SPEECH RECOGNITION



Original Abstract Submitted

systems and methods are provided for identifying targeted datasets that are configured to facilitate an improvement in the accuracy of an acoustic model included in the automatic speech recognition system. systems obtain a obtain a test dataset comprising (i) audio data having natural speech utterances and (ii) a transcription of the natural speech utterances. systems generate a text-to-speech dataset comprising audio data having synthesized speech utterances based on the transcription of the natural speech utterances. systems apply the test dataset and the text-to-speech dataset to the acoustic model to obtain a first acoustic model output and a second acoustic model output, respectively. systems identify a first set of errors in the first acoustic model output and a second set of errors in the second acoustic model output. finally, based on comparing the first set of errors and the second set of errors, an acoustic model error ratio is generated.