Deepmind technologies limited (20240256883). REINFORCEMENT LEARNING USING QUANTILE CREDIT ASSIGNMENT
REINFORCEMENT LEARNING USING QUANTILE CREDIT ASSIGNMENT
Organization Name
Inventor(s)
Mark Daniel Rowland of London GB
Theophane Guillaume Weber of London GB
REINFORCEMENT LEARNING USING QUANTILE CREDIT ASSIGNMENT
This abstract first appeared for US patent application 20240256883 titled 'REINFORCEMENT LEARNING USING QUANTILE CREDIT ASSIGNMENT
Original Abstract Submitted
methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network used to select actions to be performed by an agent interacting with an environment. implementations of the system can take into account a level of luck in the environment, and hence whilst learning can account for outcomes that were caused by external factors as well as those dependent on the actions of the agent.