REINFORCEMENT LEARNING USING QUANTILE CREDIT ASSIGNMENT

Organization Name

deepmind technologies limited

Inventor(s)

Thomas Mesnard of Paris FR

Remi Munos of London GB

Alaa Saade of Montreuil FR

Yunhao Tang of London GB

Mark Daniel Rowland of London GB

Theophane Guillaume Weber of London GB

Wenqi Chen of Cambridge MA US

REINFORCEMENT LEARNING USING QUANTILE CREDIT ASSIGNMENT

This abstract first appeared for US patent application 20240256883 titled 'REINFORCEMENT LEARNING USING QUANTILE CREDIT ASSIGNMENT

Original Abstract Submitted

methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network used to select actions to be performed by an agent interacting with an environment. implementations of the system can take into account a level of luck in the environment, and hence whilst learning can account for outcomes that were caused by external factors as well as those dependent on the actions of the agent.