GOOGLE LLC (20240233713). GENERATING AUDIO USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS simplified abstract

From WikiPatents
Jump to navigation Jump to search

GENERATING AUDIO USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Organization Name

GOOGLE LLC

Inventor(s)

Andrea Agostinelli of Zurich (CH)

Timo Immanuel Denk of Zurich (CH)

Antoine Caillon of Paris (FR)

Neil Zeghidour of Paris (FR)

Jesse Engel of Orinda CA (US)

Mauro Verzetti of Dübendorf (CH)

Christian Frank of Zurich (CH)

Zalán Borsos of Zurich (CH)

Matthew Sharifi of Kilchberg (CH)

Adam Joseph Roberts of Durham NC (US)

Marco Tagliasacchi of Kilchberg (CH)

GENERATING AUDIO USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240233713 titled 'GENERATING AUDIO USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Simplified Explanation: The patent application describes methods, systems, and apparatus for generating a prediction of an audio signal using neural networks.

  • **Key Features and Innovation:**
 * Utilizes an embedding neural network to map input to embedding tokens.
 * Generates a semantic representation of the audio signal.
 * Uses generative neural networks to create an acoustic representation of the audio signal.
 * Processes the acoustic representation with a decoder neural network to predict the audio signal.
  • **Potential Applications:**
 * Speech recognition technology.
 * Music composition and generation.
 * Audio enhancement in video editing.
  • **Problems Solved:**
 * Improves accuracy in predicting audio signals.
 * Enhances the quality of generated audio.
  • **Benefits:**
 * Efficient prediction of audio signals.
 * Enhanced audio generation capabilities.
 * Improved user experience in various applications.
  • **Commercial Applications:**
 * Title: "Advanced Audio Prediction Technology for Various Industries"
 * Potential commercial uses in music production, speech-to-text applications, and audio editing software.
 * Market implications include improved productivity and quality in audio-related industries.
  • **Prior Art:**
 * Researchers in the field of neural networks and audio signal processing.
 * Studies on generative neural networks for audio generation.
  • **Frequently Updated Research:**
 * Ongoing advancements in neural network technology for audio processing.
 * Research on improving the accuracy and efficiency of audio signal prediction.

Questions about Audio Signal Prediction Technology: 1. How does the use of neural networks improve the accuracy of audio signal prediction? 2. What are the potential limitations of using generative neural networks in audio signal processing?

Ensure the content is informative, engaging, and optimized for SEO to attract relevant traffic and provide valuable information on the technology described in the patent application.


Original Abstract Submitted

methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. one of the methods includes receiving a request to generate an audio signal conditioned on an input; processing the input using an embedding neural network to map the input to one or more embedding tokens; generating a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation and the embedding tokens, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.