TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED (20240242722). AUDIO PROCESSING METHOD AND APPARATUS, DEVICE, READABLE STORAGE MEDIUM, AND PROGRAM PRODUCT simplified abstract

From WikiPatents
Jump to navigation Jump to search

AUDIO PROCESSING METHOD AND APPARATUS, DEVICE, READABLE STORAGE MEDIUM, AND PROGRAM PRODUCT

Organization Name

TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor(s)

Hongning Zhu of Shenzhen (CN)

AUDIO PROCESSING METHOD AND APPARATUS, DEVICE, READABLE STORAGE MEDIUM, AND PROGRAM PRODUCT - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240242722 titled 'AUDIO PROCESSING METHOD AND APPARATUS, DEVICE, READABLE STORAGE MEDIUM, AND PROGRAM PRODUCT

The abstract of the patent application describes a method and apparatus for audio processing in the field of computer technologies. The method involves determining voiceprint vectors for audio segments, creating a similarity matrix based on these vectors, adjusting the matrix with dynamic thresholds, and identifying audio objects based on the reference similarity matrix.

  • The method involves determining voiceprint vectors for audio segments.
  • A similarity matrix is created based on the voiceprint vectors.
  • The matrix is adjusted with dynamic thresholds to obtain a reference similarity matrix.
  • Audio objects are identified based on the reference similarity matrix.

Potential Applications: - Speech recognition systems - Voice authentication systems - Audio content analysis tools

Problems Solved: - Efficient audio processing and analysis - Accurate identification of audio objects

Benefits: - Improved accuracy in audio processing - Enhanced performance of speech recognition systems

Commercial Applications: Title: Advanced Audio Processing Technology for Speech Recognition Systems This technology can be used in various industries such as telecommunications, security, and entertainment for developing advanced speech recognition systems with improved accuracy and performance.

Questions about the technology: 1. How does this technology improve the accuracy of speech recognition systems? 2. What are the potential applications of this audio processing method in the security industry?

Frequently Updated Research: Researchers are constantly exploring new algorithms and techniques to enhance the performance of audio processing methods for various applications such as speech recognition and voice authentication systems. Stay updated on the latest advancements in this field to leverage the benefits of cutting-edge technologies.


Original Abstract Submitted

in the field of computer technologies, an audio processing method and apparatus, a device, a readable storage medium, and a program product are provided. the method includes: determining a voiceprint vector corresponding to each of a plurality of audio segments; determining an initial similarity matrix according to the voiceprint vector corresponding to each audio segment, the initial similarity matrix including a similarity between voiceprint vectors corresponding to any two audio segments; adjusting the initial similarity matrix according to a dynamic threshold corresponding to each row of the initial similarity matrix to obtain a reference similarity matrix; and determining, according to the reference similarity matrix, a quantity of audio objects corresponding to the plurality of audio segments.