Apple Inc. (20240314509). Extracting Ambience From A Stereo Input simplified abstract

From WikiPatents
Jump to navigation Jump to search

Extracting Ambience From A Stereo Input

Organization Name

Apple Inc.

Inventor(s)

Ismael H. Nawfal of Redondo Beach CA (US)

Mehrez Souden of Los Angeles CA (US)

Juha O. Merimaa of San Mateo CA (US)

Extracting Ambience From A Stereo Input - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240314509 titled 'Extracting Ambience From A Stereo Input

Simplified Explanation: The patent application describes a method for converting first order ambisonics (FOA) audio into a desired higher order ambisonics (HOA) format using machine learning.

  • **Key Features and Innovation:**
   * Processor formats FOA audio signals into audio frames.
   * Machine learning model reformat the audio into desired HOA format.
   * Output audio in HOA format can be rendered in a playback format of choice.
  • **Potential Applications:**
   * Virtual reality audio experiences.
   * Immersive gaming soundscapes.
   * 3D audio production in film and television.
  • **Problems Solved:**
   * Simplifies the conversion process from FOA to HOA.
   * Enhances the realism and immersion of audio content.
  • **Benefits:**
   * Improved audio quality.
   * Seamless integration of different ambisonics formats.
   * Enhanced user experience in audio applications.
  • **Commercial Applications:**
   * Audio software development for entertainment industry.
   * Virtual reality and augmented reality platforms.
   * Audio production tools for professionals.
  • **Prior Art:**
   Research existing patents related to ambisonics audio processing and machine learning in audio technology.
  • **Frequently Updated Research:**
   Stay informed on advancements in machine learning algorithms for audio processing and ambisonics technology.

Questions about Ambisonics Audio Processing: 1. How does machine learning improve the conversion process from FOA to HOA audio formats? 2. What are the potential limitations of using machine learning in audio processing technologies?


Original Abstract Submitted

a sound scene is represented as first order ambisonics (foa) audio. a processor formats each signal of the foa audio to a stream of audio frames, provides the formatted foa audio to a machine learning model that reformats the formatted foa audio in a target or desired higher order ambisonics (hoa) format, and obtains output audio of the sound scene in the desired hoa format from the machine learning model. the output audio in the desired hoa format may then be rendered according to a playback audio format of choice. other aspects are also described and claimed.