Snap inc. (20240185879). NEURAL NETWORKS FOR CHANGING CHARACTERISTICS OF VOCALS simplified abstract

From WikiPatents
Jump to navigation Jump to search

NEURAL NETWORKS FOR CHANGING CHARACTERISTICS OF VOCALS

Organization Name

snap inc.

Inventor(s)

Gurunandan Krishnan Gorumkonda of Kirkland WA (US)

NEURAL NETWORKS FOR CHANGING CHARACTERISTICS OF VOCALS - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240185879 titled 'NEURAL NETWORKS FOR CHANGING CHARACTERISTICS OF VOCALS

Simplified Explanation

Simplified Explanation: The patent application describes a messaging system for audio character type swapping, where input audio data with a certain characteristic is transformed into an image, processed using a convolutional neural network (CNN), and then converted back into audio data with a different characteristic.

  • The method involves transforming input audio data into an image representing frequencies and intensities.
  • A CNN is used to process the image and generate output audio data with a different characteristic.
  • The CNN is trained alongside another CNN that performs the reverse transformation.
  • Discriminator CNNs are used to determine the characteristics of the input and output audio data.

Key Features and Innovation:

  • Audio character type swapping through image transformation and CNN processing.
  • Training of CNNs to swap audio characteristics using discriminator CNNs.
  • Integration of vocals in the input and output audio data.

Potential Applications: This technology could be used in voice modulation software, audio editing tools, and entertainment applications.

Problems Solved: This technology addresses the need for efficient and accurate audio character type swapping.

Benefits:

  • Enables easy transformation of audio characteristics.
  • Provides flexibility in audio editing and manipulation.
  • Enhances creativity in audio production.

Commercial Applications: Title: Innovative Audio Character Type Swapping Technology for Enhanced Audio Editing This technology can be utilized in music production software, voice-changing applications, and online communication platforms, potentially impacting the entertainment and communication industries.

Prior Art: There is limited information available on prior art related to this specific technology.

Frequently Updated Research: There is ongoing research in the field of audio processing and neural networks that may impact the development and optimization of this technology.

Questions about Audio Character Type Swapping: Question 1: How does this technology compare to traditional methods of voice modulation? This technology offers a more advanced and automated approach to audio character type swapping compared to traditional manual methods.

Question 2: Can this technology be applied to real-time audio processing? While the current application focuses on offline processing, with further development, real-time audio character type swapping could be possible.


Original Abstract Submitted

a messaging system for audio character type swapping. methods of audio character type swapping include receiving input audio data having a first characteristic and transforming the input audio data to an input image where the input image represents the frequencies and intensities of the audio. the methods further include processing the input image using a convolutional neural network (cnn) to generate an output image and transforming the output image to output audio data, the output audio data having a second characteristic. the input audio and output audio may include vocals. the first characteristics may indicate a male voice and the second characteristics may indicate a female voice. the cnn is trained together with another cnn that changes input audio having the second characteristic to audio having the first characteristic. the cnns are trained using discriminator cnns that determine whether audio has a first characteristic or a second characteristic.