Google llc (20240428786). Transducer-Based Streaming Deliberation for Cascaded Encoders
Contents
Transducer-Based Streaming Deliberation for Cascaded Encoders
Organization Name
Inventor(s)
Tara N. Sainath of Jersey City NJ (US)
Arun Narayanan of Milpitas CA (US)
Ruoming Pang of New York NY (US)
Trevor Strohman of Mountain View CA (US)
Transducer-Based Streaming Deliberation for Cascaded Encoders
This abstract first appeared for US patent application 20240428786 titled 'Transducer-Based Streaming Deliberation for Cascaded Encoders
Original Abstract Submitted
a method includes receiving a sequence of acoustic frames and generating, by a first encoder, a first higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. the method also includes generating, by a first pass transducer decoder, a first pass speech recognition hypothesis for a corresponding first higher order feature representation and generating, by a text encoder, a text encoding for a corresponding first pass speech recognition hypothesis. the method also includes generating, by a second encoder, a second higher order feature representation for a corresponding first higher order feature representation. the method also includes generating, by a second pass transducer decoder, a second pass speech recognition hypothesis using a corresponding second higher order feature representation and a corresponding text encoding.