Dolby Laboratories Licensing Corporation (20250006208). AUDIO CONTENT GENERATION AND CLASSIFICATION

From WikiPatents
Jump to navigation Jump to search

AUDIO CONTENT GENERATION AND CLASSIFICATION

Organization Name

Dolby Laboratories Licensing Corporation

Inventor(s)

Brenton James Potter of New South Wales (AU)

Hadis Nosrati of New South Wales (AU)

AUDIO CONTENT GENERATION AND CLASSIFICATION

This abstract first appeared for US patent application 20250006208 titled 'AUDIO CONTENT GENERATION AND CLASSIFICATION



Original Abstract Submitted

some disclosed methods involve receiving audio data of at least a first audio data type and a second audio data type, including audio signals and associated spatial data indicating intended perceived spatial positions for the audio signals, determining at least a first feature type from the audio data and applying a positional encoding process to the audio data, to produce encoded audio data. the encoded audio data may include representations of at least the spatial data and the first feature type in first embedding vectors of an embedding dimension. some methods may involve training a neural network, based on the encoded audio data, to transform audio data from an input audio data type having an input spatial data type to a transformed audio data type having a transformed spatial data type. some methods may involve training a neural network to identify an input audio data type.