Google llc (20240428786). Transducer-Based Streaming Deliberation for Cascaded Encoders

From WikiPatents
Jump to navigation Jump to search

Transducer-Based Streaming Deliberation for Cascaded Encoders

Organization Name

google llc

Inventor(s)

Ke Hu of Stony Brook NY (US)

Tara N. Sainath of Jersey City NJ (US)

Arun Narayanan of Milpitas CA (US)

Ruoming Pang of New York NY (US)

Trevor Strohman of Mountain View CA (US)

Transducer-Based Streaming Deliberation for Cascaded Encoders

This abstract first appeared for US patent application 20240428786 titled 'Transducer-Based Streaming Deliberation for Cascaded Encoders



Original Abstract Submitted

a method includes receiving a sequence of acoustic frames and generating, by a first encoder, a first higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. the method also includes generating, by a first pass transducer decoder, a first pass speech recognition hypothesis for a corresponding first higher order feature representation and generating, by a text encoder, a text encoding for a corresponding first pass speech recognition hypothesis. the method also includes generating, by a second encoder, a second higher order feature representation for a corresponding first higher order feature representation. the method also includes generating, by a second pass transducer decoder, a second pass speech recognition hypothesis using a corresponding second higher order feature representation and a corresponding text encoding.