GOOGLE LLC (20240420692). Multilingual Re-Scoring Models for Automatic Speech Recognition
Multilingual Re-Scoring Models for Automatic Speech Recognition
Organization Name
Inventor(s)
Neeraj Gaur of Mountain View CA (US)
Tongzhou Chen of Mountain View CA (US)
Ehsan Variani of Mountain View CA (US)
Bhuvana Ramabhadran of Mt. Kisco NY (US)
Parisa Haghani of Mountain View CA (US)
Pedro J. Moreno Mengibar of Jersey City NJ (US)
Multilingual Re-Scoring Models for Automatic Speech Recognition
This abstract first appeared for US patent application 20240420692 titled 'Multilingual Re-Scoring Models for Automatic Speech Recognition
Original Abstract Submitted
a method includes receiving a sequence of acoustic frames extracted from audio data corresponding to an utterance. during a first pass, the method includes processing the sequence of acoustic frames to generate n candidate hypotheses for the utterance. during a second pass, and for each candidate hypothesis, the method includes: generating a respective un-normalized likelihood score; generating a respective external language model score; generating a standalone score that models prior statistics of the corresponding candidate hypothesis; and generating a respective overall score for the candidate hypothesis based on the un-normalized likelihood score, the external language model score, and the standalone score. the method also includes selecting the candidate hypothesis having the highest respective overall score from among the n candidate hypotheses as a final transcription of the utterance.