SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODELS

A method for performing speech recognition using sequence-to-sequence models includes receiving audio data for an utterance and providing features indicative of acoustic characteristics of the utterance as input to an encoder. The method also includes processing an output of the encoder using an attender to generate a context vector, generating speech recognition scores using the context vector and a decoder trained using a training process, and generating a transcription for the utterance using word elements selected based on the speech recognition scores. The transcription is provided as an output of the ASR system.

18815200. SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODELS (GOOGLE LLC)

Contents

SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODELS

Organization Name

Inventor(s)

SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODELS

Original Abstract Submitted

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools