20240013775. PATCHED MULTI-CONDITION TRAINING FOR ROBUST SPEECH RECOGNITION simplified abstract (Samsung Electronics Co., Ltd.)

From WikiPatents
Jump to navigation Jump to search

PATCHED MULTI-CONDITION TRAINING FOR ROBUST SPEECH RECOGNITION

Organization Name

Samsung Electronics Co., Ltd.

Inventor(s)

Pablo Peso Parada of Staines (GB)

Agnieszka Dobrowolska of Staines (GB)

Karthikeyan Saravanan of Staines (GB)

Mete Ozay of Staines (GB)

PATCHED MULTI-CONDITION TRAINING FOR ROBUST SPEECH RECOGNITION - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240013775 titled 'PATCHED MULTI-CONDITION TRAINING FOR ROBUST SPEECH RECOGNITION

Simplified Explanation

The abstract describes a method for obtaining a patched signal to train a model for speech and audio recognition. The method involves modifying a first signal (which can be speech or audio) to obtain at least one second signal. The first signal and the second signal are then divided into multiple patches. Selected patches from both signals are mixed together to obtain a patched signal.

  • The method involves modifying a first signal to obtain a second signal.
  • The first signal and the second signal are divided into multiple patches.
  • Selected patches from both signals are mixed together to obtain a patched signal.

Potential applications of this technology:

  • Training models for speech recognition.
  • Training models for audio recognition.

Problems solved by this technology:

  • The method provides a way to obtain a patched signal for training models in speech and audio recognition.
  • It allows for the modification and division of signals to create patches.

Benefits of this technology:

  • Improved accuracy in speech and audio recognition models.
  • Enhanced training capabilities for models in speech and audio recognition.


Original Abstract Submitted

a method of obtaining a patched signal for training a model for use in at least one of a speech and an audio recognition is disclosed. the method comprises obtaining a first signal, wherein the first signal is at least one of a speech and an audio signal, modifying the first signal to obtain at least one second signal, dividing the first signal and the at least one second signal respectively into a plurality of first patches and a plurality of second patches, wherein each one of the plurality of first patches comprises a respective part of the first signal and each one of the plurality of second patches comprises a respective part of the at least one second signal and mixing selected ones of the plurality of first patches and the plurality of second patches to obtain a patched signal.