Difference between revisions of "MULTI-TASK LEARNING FOR PERSONALIZED KEYWORD SPOTTING: abstract simplified (18153932)"
Jump to navigation
Jump to search
Wikipatents (talk | contribs) (Creating a new page) |
Wikipatents (talk | contribs) (Creating a new page) |
||
Line 1: | Line 1: | ||
− | Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. | + | Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. These representations are then used to determine a similarity score against a reference representation. Based on this score and a threshold, a keyword spotting output is generated, indicating whether the audio sample includes a target keyword or not. |
Revision as of 16:12, 1 October 2023
Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. These representations are then used to determine a similarity score against a reference representation. Based on this score and a threshold, a keyword spotting output is generated, indicating whether the audio sample includes a target keyword or not.