20240021202. METHOD AND APPARATUS FOR RECOGNIZING VOICE, ELECTRONIC DEVICE AND MEDIUM simplified abstract (Beijing Youzhuju Network Technology Co., Ltd.)

From WikiPatents
Jump to navigation Jump to search

METHOD AND APPARATUS FOR RECOGNIZING VOICE, ELECTRONIC DEVICE AND MEDIUM

Organization Name

Beijing Youzhuju Network Technology Co., Ltd.

Inventor(s)

Ling Xu of Beijing (CN)

Yi He of Beijing (CN)

METHOD AND APPARATUS FOR RECOGNIZING VOICE, ELECTRONIC DEVICE AND MEDIUM - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240021202 titled 'METHOD AND APPARATUS FOR RECOGNIZING VOICE, ELECTRONIC DEVICE AND MEDIUM

Simplified Explanation

The disclosed patent application describes a method and apparatus for speech recognition, as well as an electronic device and a medium. The method involves the following steps:

  • Acquiring audio data to be recognized, which includes a speech segment.
  • Determining the start and end time corresponding to the speech segment within the audio data.
  • Extracting at least one speech segment from the audio data based on the determined start and end time.
  • Performing speech recognition on the extracted speech segment to generate recognition text corresponding to the audio data.

Potential applications of this technology:

  • Voice assistants: The method can be used in voice assistants like Siri or Alexa to accurately recognize and understand user commands or queries.
  • Transcription services: The technology can be applied in transcription services to convert spoken language into written text.
  • Call center automation: Speech recognition can be utilized in call centers to automatically transcribe customer conversations and assist agents in real-time.

Problems solved by this technology:

  • Accurate speech recognition: The method improves the accuracy of speech recognition by extracting and analyzing specific speech segments, reducing errors caused by background noise or other non-speech elements.
  • Efficient processing: By extracting and analyzing only the relevant speech segments, the method reduces the computational resources required for speech recognition tasks.

Benefits of this technology:

  • Improved user experience: Accurate speech recognition enhances the user experience by enabling more precise and reliable voice commands and interactions.
  • Time-saving: The method allows for faster and more efficient processing of speech recognition tasks, leading to time savings in various applications.
  • Enhanced productivity: By automating speech recognition, the technology can increase productivity in tasks such as transcription or call center operations.


Original Abstract Submitted

embodiments of the disclosure disclose a method and apparatus for speech recognition, an electronic device and a medium. the method includes: acquiring an audio data to be recognized (), the audio data to be recognized including a speech segment; determining a start and end time corresponding to the speech segment which is comprised in the audio data (); extracting at least one speech segment from the audio data to be recognized based on the determined start and end time (); and performing speech recognition on the at least one extracted speech segment to generate recognition text corresponding to the audio data to be recognized ().