US Patent Application 18246024. SEAMLESS SCALABLE DECODING OF CHANNELS, OBJECTS, AND HOA AUDIO CONTENT simplified abstract

From WikiPatents
Jump to navigation Jump to search

SEAMLESS SCALABLE DECODING OF CHANNELS, OBJECTS, AND HOA AUDIO CONTENT

Organization Name

Apple Inc.


Inventor(s)

Moo Young Kim of San Diego CA (US)

Dipanjan Sen of Dublin CA (US)

Eric Allamanche of Sunnyvale CA (US)

J. Kevin Calhoun of Santa Rosa CA (US)

Frank Baumgarte of Sunnyvale CA (US)

Sina Zamani of Cupertino CA (US)

Eric Day of San Jose CA (US)

SEAMLESS SCALABLE DECODING OF CHANNELS, OBJECTS, AND HOA AUDIO CONTENT - A simplified explanation of the abstract

This abstract first appeared for US patent application 18246024 titled 'SEAMLESS SCALABLE DECODING OF CHANNELS, OBJECTS, AND HOA AUDIO CONTENT

Simplified Explanation

- This patent application describes methods and systems for decoding immersive audio content that is encoded with different numbers of scene elements for channels, audio objects, higher-order ambisonics (HOA), and other sound field representations. - The decoded audio is then rendered to the speaker configuration of a playback device. - When the bit streams representing audio scenes have different mixtures of channels, objects, and/or HOA in consecutive frames, a fade-in of the new frame and fade-out of the old frame can be performed. - Crossfading between consecutive frames can occur in the speaker layout after rendering, in the spatially decoded content type before rendering, or between the transport channels as the output of the baseline decoder but before spatial decoding and rendering. - Crossfading can be achieved using an immediate fade-in and fade-out frame (IFFF) for the transition frame or by using an overlap-add synthesis technique such as time-domain aliasing cancellation (TDAC) of MDCT.


Original Abstract Submitted

Disclosed are methods and systems for decoding immersive audio content encoded by an adaptive number of scene elements for channels, audio objects, higher-order ambisonics (HOA), and/or other sound field representations. The decoded audio is rendered to the speaker configuration of a playback device. For bit streams that represent audio scenes with a different mixture of channels, objects, and/or HOA in consecutive frames, fade-in of the new frame and fade-out of the old frame may be performed. Crossfading between consecutive frames happen in the speaker layout after rendering, in the spatially decoded content type before rendering, or between the transport channels as the output of the baseline decoder but before spatial decoding and rendering. Crossfading may use an immediate fade-in and fade-out frame (IFFF) for the transition frame or may use an overlap-add synthesis technique such as time-domain aliasing cancellation (TDAC) of MDCT.