Microsoft Technology Licensing, LLC (20250006178). DIAGNOSTIC SERVICE IN SPEECH RECOGNITION
Contents
DIAGNOSTIC SERVICE IN SPEECH RECOGNITION
Organization Name
Microsoft Technology Licensing, LLC
Inventor(s)
DIAGNOSTIC SERVICE IN SPEECH RECOGNITION
This abstract first appeared for US patent application 20250006178 titled 'DIAGNOSTIC SERVICE IN SPEECH RECOGNITION
Original Abstract Submitted
systems and methods are provided for identifying targeted datasets that are configured to facilitate an improvement in the accuracy of an acoustic model included in the automatic speech recognition system. systems obtain a obtain a test dataset comprising (i) audio data having natural speech utterances and (ii) a transcription of the natural speech utterances. systems generate a text-to-speech dataset comprising audio data having synthesized speech utterances based on the transcription of the natural speech utterances. systems apply the test dataset and the text-to-speech dataset to the acoustic model to obtain a first acoustic model output and a second acoustic model output, respectively. systems identify a first set of errors in the first acoustic model output and a second set of errors in the second acoustic model output. finally, based on comparing the first set of errors and the second set of errors, an acoustic model error ratio is generated.