Phrase Extraction for ASR Models

Organization Name

google llc

Inventor(s)

Ehsan Amid of Mountain View CA (US)

Om Dipakbhai Thakkar of Sunnyvale CA (US)

Rajiv Mathews of Sunnyvale CA (US)

Francoise Beaufays of Mountain View CA (US)

Phrase Extraction for ASR Models - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240221772 titled 'Phrase Extraction for ASR Models

Simplified Explanation: The patent application describes a method for extracting phrases from audio data using an ASR model by modifying the audio data to obfuscate a specific phrase, then comparing the predicted transcription with the ground-truth transcription to identify if the phrase was leaked from the training data.

Key Features and Innovation:

Phrase extraction method for ASR models
Modification of audio data to obfuscate a particular phrase
Comparison of predicted transcription with ground-truth transcription
Detection of leaked phrases from training data

Potential Applications: This technology can be used in speech recognition systems to improve privacy and security by preventing the leakage of sensitive information during training.

Problems Solved: This technology addresses the issue of unintentional leakage of sensitive phrases from training data used to train ASR models.

Benefits:

Enhanced privacy and security in speech recognition systems
Prevention of sensitive information leakage
Improved accuracy of ASR models

Commercial Applications: Potential commercial applications include secure voice assistants, confidential speech-to-text services, and privacy-focused transcription software.

Prior Art: Prior research in the field of privacy-preserving machine learning and secure speech recognition systems may provide insights into similar methods and technologies.

Frequently Updated Research: Stay updated on advancements in privacy-preserving machine learning, secure speech recognition, and data obfuscation techniques.

Questions about Phrase Extraction Technology: 1. How does the method ensure that the obfuscated phrase is not leaked during the training of the ASR model? 2. What are the potential implications of leaked phrases from training data in ASR models?

Original Abstract Submitted

a method of phrase extraction for asr models includes obtaining audio data characterizing an utterance and a corresponding ground-truth transcription of the utterance and modifying the audio data to obfuscate a particular phrase recited in the utterance. the method also includes processing, using a trained asr model, the modified audio data to generate a predicted transcription of the utterance, and determining whether the predicted transcription includes the particular phrase by comparing the predicted transcription of the utterance to the ground-truth transcription of the utterance. when the predicted transcription includes the particular phrase, the method includes generating an output indicating that the trained asr model leaked the particular phrase from a training data set used to train the asr model.

Google llc (20240221772). Phrase Extraction for ASR Models simplified abstract

Contents

Phrase Extraction for ASR Models

Organization Name

Inventor(s)

Phrase Extraction for ASR Models - A simplified explanation of the abstract

Original Abstract Submitted

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools