Dolby Laboratories Licensing Corporation (20250006208). AUDIO CONTENT GENERATION AND CLASSIFICATION
Contents
AUDIO CONTENT GENERATION AND CLASSIFICATION
Organization Name
Dolby Laboratories Licensing Corporation
Inventor(s)
Brenton James Potter of New South Wales (AU)
Hadis Nosrati of New South Wales (AU)
AUDIO CONTENT GENERATION AND CLASSIFICATION
This abstract first appeared for US patent application 20250006208 titled 'AUDIO CONTENT GENERATION AND CLASSIFICATION
Original Abstract Submitted
some disclosed methods involve receiving audio data of at least a first audio data type and a second audio data type, including audio signals and associated spatial data indicating intended perceived spatial positions for the audio signals, determining at least a first feature type from the audio data and applying a positional encoding process to the audio data, to produce encoded audio data. the encoded audio data may include representations of at least the spatial data and the first feature type in first embedding vectors of an embedding dimension. some methods may involve training a neural network, based on the encoded audio data, to transform audio data from an input audio data type having an input spatial data type to a transformed audio data type having a transformed spatial data type. some methods may involve training a neural network to identify an input audio data type.