US Patent Application 18356738. TEXT DATA PROCESSING METHOD AND APPARATUS simplified abstract

From WikiPatents
Jump to navigation Jump to search

TEXT DATA PROCESSING METHOD AND APPARATUS

Organization Name

HUAWEI TECHNOLOGIES CO., LTD.


Inventor(s)

Nianzu Zheng of Shenzhen (CN)

Disong Wang of Shenzhen (CN)

Liqun Deng of Shenzhen (CN)

Yang Zhang of Shenzhen (CN)

TEXT DATA PROCESSING METHOD AND APPARATUS - A simplified explanation of the abstract

This abstract first appeared for US patent application 18356738 titled 'TEXT DATA PROCESSING METHOD AND APPARATUS

Simplified Explanation

The patent application is about methods and apparatuses for processing text data.

  • The method involves obtaining target text that contains adjacent phonemes.
  • Feature extraction is performed on the first and second phonemes to obtain audio features.
  • A target recurrent neural network (RNN) is used to obtain speech data corresponding to each phoneme based on their respective audio features.
  • A vocoder is used to generate audio for each phoneme based on the obtained speech data.


Original Abstract Submitted

The present disclosure relates to text data processing methods and apparatuses. One example method includes obtaining target text, where a phoneme of the target text includes a first phoneme and a second phoneme that are adjacent to each other. Feature extraction is performed on the first phoneme and the second phoneme to obtain a first audio feature of the first phoneme and a second audio feature of the second phoneme. By using a target recurrent neural network (RNN) and based on the first audio feature, first speech data corresponding to the first phoneme is obtained.By using the target RNN and based on the second audio feature, second speech data corresponding to the second phoneme is obtained.By using a vocoder and based on the first speech data and the second speech data, audio corresponding to the first phoneme and audio corresponding to the second phoneme are obtained.