Difference between revisions of "MULTI-TASK LEARNING FOR PERSONALIZED KEYWORD SPOTTING: abstract simplified (18153932)"

Revision as of 16:12, 1 October 2023

Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. These representations are then used to determine a similarity score against a reference representation. Based on this score and a threshold, a keyword spotting output is generated, indicating whether the audio sample includes a target keyword or not.

Revision as of 15:59, 1 October 2023 (view source) Wikipatents (talk \| contribs) (Creating a new page)		Revision as of 16:12, 1 October 2023 (view source) Wikipatents (talk \| contribs) (Creating a new page) Newer edit →
Line 1:		Line 1:
−	Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. A similarity score ~~is then calculated based on~~ a reference representation ~~and the representations of the keyword and speaker~~. ~~This~~ score ~~is compared against~~ a threshold ~~to determine if~~ the audio sample ~~contains the~~ target keyword.	+	Systems and techniques are described for processing audio data using personalized keyword spotting through multi-task learning (PK-MTL). This involves obtaining an audio sample and generating representations of both a keyword and a speaker based on the sample. These representations are then used to determine a similarity score against a reference representation. Based on this score and a threshold, a keyword spotting output is generated, indicating whether the audio sample includes a target keyword or not.

Difference between revisions of "MULTI-TASK LEARNING FOR PERSONALIZED KEYWORD SPOTTING: abstract simplified (18153932)"

Revision as of 16:12, 1 October 2023

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools