US Patent Application 18026960. Speech Separation Method, Electronic Device, Chip, and Computer-Readable Storage Medium simplified abstract
Speech Separation Method, Electronic Device, Chip, and Computer-Readable Storage Medium
Organization Name
Inventor(s)
Speech Separation Method, Electronic Device, Chip, and Computer-Readable Storage Medium - A simplified explanation of the abstract
- This abstract for appeared for US patent application number 18026960 Titled 'Speech Separation Method, Electronic Device, Chip, and Computer-Readable Storage Medium'
Simplified Explanation
This abstract describes a method for separating speech from a mixed audio signal using video information. The method involves obtaining audio and video information from a user, coding the audio information to create a mixed acoustic feature, extracting a visual semantic feature from the video information, inputting both features into a visual speech separation network, and decoding the resulting acoustic feature to obtain the user's speech signal. This method can be implemented in electronic devices using a specific chip and can be stored in a computer-readable storage medium.
Original Abstract Submitted
A speech separation method is provided, and relates to the field of speech. The method includes: obtaining, in a speaking process of a user, audio information including a user speech and video information including a user face; coding the audio information to obtain a mixed acoustic feature; extracting a visual semantic feature of the user from the video information; inputting the mixed acoustic feature and the visual semantic feature into a preset visual speech separation network to obtain an acoustic feature of the user; and decoding the acoustic feature of the user to obtain a speech signal of the user. An electronic device, a chip, and a computer-readable storage medium are provided.