VIA TECHNOLOGIES, INC. (20240347054). SPEAKING PRACTICE SYSTEM WITH RELIABLE PRONUNCIATION EVALUATION simplified abstract

From WikiPatents
Jump to navigation Jump to search

SPEAKING PRACTICE SYSTEM WITH RELIABLE PRONUNCIATION EVALUATION

Organization Name

VIA TECHNOLOGIES, INC.

Inventor(s)

Jing-Jing Guo of Shanghai (CN)

Steve Shu Liu of Shanghai (CN)

SPEAKING PRACTICE SYSTEM WITH RELIABLE PRONUNCIATION EVALUATION - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240347054 titled 'SPEAKING PRACTICE SYSTEM WITH RELIABLE PRONUNCIATION EVALUATION

Simplified Explanation: The patent application discusses techniques for evaluating the goodness of pronunciation (GOP) with improved reliability. It involves a data preprocessing system that includes a phonetic symbol generation system and an audio recording preprocessing system, along with a GOP evaluation system.

Key Features and Innovation:

  • Data preprocessing system with phonetic symbol generation and audio recording preprocessing.
  • GOP evaluation system that scores audio recordings based on phonetic symbols and audio data.
  • Phonetic symbol generation system utilizes an artificial intelligence model to handle polyphonic words effectively.

Potential Applications: This technology can be used in language learning applications, speech therapy tools, and automated pronunciation assessment systems.

Problems Solved: The technology addresses the challenge of accurately evaluating pronunciation, especially in cases of polyphonic words with multiple pronunciations.

Benefits:

  • Improved reliability in evaluating pronunciation.
  • Enhanced learning and feedback for language learners.
  • Efficient assessment of pronunciation in speech therapy.

Commercial Applications: Potential commercial applications include language learning platforms, educational software, and speech assessment tools for professionals.

Prior Art: Prior research in the field of speech recognition and pronunciation assessment can provide valuable insights into similar technologies and approaches.

Frequently Updated Research: Stay updated on advancements in artificial intelligence models for phonetic symbol generation and audio processing in speech recognition systems.

Questions about Pronunciation Evaluation Technology: 1. How does the artificial intelligence model handle polyphonic words in generating phonetic symbols? 2. What are the potential implications of this technology in improving language learning outcomes?


Original Abstract Submitted

goodness of pronunciation (gop) evaluation techniques with improved reliability are presented. a data preprocessing server operates a data pre-processing system and a gop evaluation system. the data pre-processing system includes a phonetic symbol generation system and an audio recording preprocessing system. based on a practice text as well as an audio recording of the user reading the practice text, the phonetic symbol generation system generates phonetic symbols, and the audio recording preprocessing system generates audio data. the gop evaluation system scores the audio recording based on the phonetic symbols and the audio data. the phonetic symbol generation system operates an artificial intelligence model, which generates the phonetic symbols in response to the fact that the practice text includes polyphonic words. polyphonic words are words with several pronunciations due to their parts of speech, or special words which are numbers or place names.