Tencent Technology (Shenzhen) Company Limited (20240321289). METHOD AND APPARATUS FOR EXTRACTING FEATURE REPRESENTATION, DEVICE, MEDIUM, AND PROGRAM PRODUCT simplified abstract

From WikiPatents
Jump to navigation Jump to search

METHOD AND APPARATUS FOR EXTRACTING FEATURE REPRESENTATION, DEVICE, MEDIUM, AND PROGRAM PRODUCT

Organization Name

Tencent Technology (Shenzhen) Company Limited

Inventor(s)

Yi Luo of Shenzhen (CN)

Jianwei Yu of Shenzhen (CN)

METHOD AND APPARATUS FOR EXTRACTING FEATURE REPRESENTATION, DEVICE, MEDIUM, AND PROGRAM PRODUCT - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240321289 titled 'METHOD AND APPARATUS FOR EXTRACTING FEATURE REPRESENTATION, DEVICE, MEDIUM, AND PROGRAM PRODUCT

The patent application describes a method and apparatus for extracting a feature representation in the field of voice analysis technologies.

  • Obtaining sample audio
  • Extracting a sample time-frequency feature representation from the sample audio
  • Performing frequency band segmentation on the time-frequency feature representation to obtain time-frequency sub-feature representations for at least two frequency bands
  • Performing inter-frequency band relationship analysis on the sub-feature representations to obtain an application time-frequency feature representation based on the analysis result

Potential Applications: - Voice recognition systems - Speaker identification technology - Speech analysis for medical or forensic purposes

Problems Solved: - Efficient extraction of feature representations from audio samples - Improved accuracy in voice analysis and recognition

Benefits: - Enhanced performance of voice analysis systems - Better identification and classification of speakers - Potential for applications in various industries such as security, healthcare, and telecommunications

Commercial Applications: Title: Advanced Voice Analysis Technology for Enhanced Speaker Recognition This technology can be used in security systems for access control, in call centers for customer identification, and in healthcare for voice-based patient authentication.

Questions about Voice Analysis Technology: 1. How does this technology improve the accuracy of speaker recognition systems? 2. What are the potential limitations of this method in real-world applications?

Frequently Updated Research: Stay updated on advancements in voice analysis technologies, particularly in the areas of feature extraction and inter-frequency band relationship analysis.


Original Abstract Submitted

a method and an apparatus for extracting a feature representation, a device, a medium, and a program product are provided and relate to the field of voice analysis technologies. the method includes: obtaining sample audio; extracting a sample time-frequency feature representation corresponding to the sample audio; performing frequency band segmentation on the sample time-frequency feature representation from a frequency domain dimension, to obtain time-frequency sub-feature representations respectively corresponding to at least two frequency bands; and performing inter-frequency band relationship analysis on the time-frequency sub-feature representations respectively corresponding to the at least two frequency bands from the frequency domain dimension, and obtaining an application time-frequency feature representation based on an inter-frequency band relationship analysis result.