Difference between revisions of "MULTI-TASK LEARNING FOR PERSONALIZED KEYWORD SPOTTING: abstract simplified (18153932)"

Revision as of 16:13, 1 October 2023

Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. The speaker is associated with the keyword. A similarity score is calculated based on a reference representation and either the keyword representation or the speaker representation. This score is then analyzed against a threshold to determine if the audio sample includes the target keyword.

Revision as of 16:12, 1 October 2023 (view source) Wikipatents (talk \| contribs) (Creating a new page) ← Older edit		Revision as of 16:13, 1 October 2023 (view source) Wikipatents (talk \| contribs) (Creating a new page) Newer edit →
Line 1:		Line 1:
−	Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. ~~These representations are then used to determine a~~ similarity score ~~against~~ a reference representation. ~~Based on this~~ score ~~and~~ a threshold~~, a keyword spotting output is generated, indicating whether~~ the audio sample includes a target keyword ~~or not~~.	+	Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. The speaker is associated with the keyword. A similarity score is calculated based on a reference representation and either the keyword representation or the speaker representation. This score is then analyzed against a threshold to determine if the audio sample includes the target keyword.

Difference between revisions of "MULTI-TASK LEARNING FOR PERSONALIZED KEYWORD SPOTTING: abstract simplified (18153932)"

Revision as of 16:13, 1 October 2023

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools