Samsung electronics co., ltd. (20240135925). ELECTRONIC DEVICE FOR PERFORMING SPEECH RECOGNITION AND OPERATION METHOD THEREOF simplified abstract

From WikiPatents
Jump to navigation Jump to search

ELECTRONIC DEVICE FOR PERFORMING SPEECH RECOGNITION AND OPERATION METHOD THEREOF

Organization Name

samsung electronics co., ltd.

Inventor(s)

Gilho Lee of Suwon-si (KR)

Gajin Song of Suwon-si (KR)

Hoseon Shin of Suwon-si (KR)

Jungin Lee of Suwon-si (KR)

Seokyeong Jeong of Suwon-si (KR)

ELECTRONIC DEVICE FOR PERFORMING SPEECH RECOGNITION AND OPERATION METHOD THEREOF - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240135925 titled 'ELECTRONIC DEVICE FOR PERFORMING SPEECH RECOGNITION AND OPERATION METHOD THEREOF

Simplified Explanation

The electronic device described in the abstract is a speech recognition system that uses automatic speech recognition and natural language understanding to identify and output speech recognition results based on user input.

  • The device includes a microphone, memory, and at least one processor.
  • The processor acquires speech data from the user via the microphone.
  • It performs automatic speech recognition and natural language understanding to recognize text from the speech data.
  • The processor then identifies and outputs the recognized text based on the difference between the recognized text and stored text in memory.
  • Training data for speech recognition is acquired based on the relevance between the recognized text and stored text.

Potential Applications

This technology can be applied in various fields such as:

  • Voice-controlled devices
  • Virtual assistants
  • Speech-to-text transcription services

Problems Solved

This technology addresses the following issues:

  • Improving speech recognition accuracy
  • Enhancing user experience with voice-controlled devices
  • Streamlining speech-to-text conversion processes

Benefits

The benefits of this technology include:

  • Increased efficiency in speech recognition
  • Enhanced user interaction with electronic devices
  • Improved accuracy in speech-to-text conversion

Potential Commercial Applications

This technology can be utilized in commercial applications such as:

  • Customer service chatbots
  • Voice-activated smart home devices
  • Transcription services

Possible Prior Art

One possible prior art for this technology is the use of speech recognition systems in virtual assistants like Siri, Alexa, and Google Assistant.

Unanswered Questions

How does this technology handle different accents and speech patterns?

The abstract does not mention how the device deals with variations in speech patterns and accents that may affect speech recognition accuracy.

What is the processing speed of the device in recognizing and outputting speech data?

The abstract does not provide information on the processing speed of the device in recognizing and outputting speech data.


Original Abstract Submitted

an electronic device according to an embodiment may include a microphone, a memory, and at least one processor(s). according to an embodiment, the at least one processor may be configured to acquire speech data corresponding to a user's speech via the microphone. the at least one processor according to an embodiment may be configured to acquire first text recognized on speech data by at least partially performing automatic speech recognition and/or natural language understanding. the at least one processor according to an embodiment may be configured to identify, based on the first text, second text stored in the memory. the at least one processor according to an embodiment may be configured to control to output the first text or the second text as a speech recognition result of the speech data, based on a difference between the first text and the second text. the at least one processor according to an embodiment may be configured to acquire training data for recognition of the user's speech, based on relevance between the first text and the second text with respect to the speech data.