18054670. GENERATION OF TRAINING EXAMPLES FOR TRAINING AUTOMATIC SPEECH RECOGNIZERS simplified abstract (INTERNATIONAL BUSINESS MACHINES CORPORATION)

From WikiPatents
Revision as of 03:15, 24 May 2024 by Wikipatents (talk | contribs) (Creating a new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

GENERATION OF TRAINING EXAMPLES FOR TRAINING AUTOMATIC SPEECH RECOGNIZERS

Organization Name

INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor(s)

Ngoc Minh Tran of Dublin (IE)

Hessel Tuinhof of Dublin (IE)

Beat Buesser of Dublin (IE)

GENERATION OF TRAINING EXAMPLES FOR TRAINING AUTOMATIC SPEECH RECOGNIZERS - A simplified explanation of the abstract

This abstract first appeared for US patent application 18054670 titled 'GENERATION OF TRAINING EXAMPLES FOR TRAINING AUTOMATIC SPEECH RECOGNIZERS

Simplified Explanation

The present invention involves a method, computer program product, and computer system for generating training examples for training an automatic speech recognizer. This includes receiving a training dataset of original audio signals and generating training examples based on imperceptible spaces and adversarial audio examples.

  • Training examples are generated for training an automatic speech recognizer.
  • The method involves constructing imperceptible spaces for original audio signals.
  • Adversarial audio examples are generated in the constructed imperceptible space.
  • Imperceptible and adversarial audio examples are provided to an adversarial trainer for the automatic speech recognizer.

Potential Applications

This technology can be applied in:

  • Speech recognition systems
  • Voice-controlled devices
  • Language translation tools

Problems Solved

This technology helps in:

  • Improving the accuracy of automatic speech recognition
  • Enhancing the performance of speech-to-text systems

Benefits

The benefits of this technology include:

  • Enhanced training examples for automatic speech recognition
  • Increased efficiency in speech recognition systems

Potential Commercial Applications

This technology can be commercially benefit:

  • Speech recognition software companies
  • Virtual assistant developers

Possible Prior Art

One possible prior art for this technology could be the use of adversarial training examples in machine learning algorithms to improve model performance.

Unanswered Questions

How does this technology handle different accents and dialects in speech recognition?

This article does not address how the generated training examples account for variations in accents and dialects that may affect speech recognition accuracy.

What impact does this technology have on the privacy and security of audio data used for training?

The article does not discuss the potential privacy and security implications of using imperceptible spaces and adversarial audio examples in training automatic speech recognizers.


Original Abstract Submitted

A method, computer program product, and computer system for generation of training examples for training an automatic speech recognizer. Embodiments of the present invention can receive a training dataset of original audio signals and generate training examples for training an automatic speech recognizer based, at least in part, on a constructed imperceptible space for an original audio signal of the original audio signals and adversarial audio examples in the constructed imperceptible space. Embodiments of the present invention can then generate an imperceptible and adversarial audio example to an adversarial trainer for the automatic speech recognizer.