ALGORITHM SYSTEM OF DEEP REINFORCEMENT LEARNING AND ALGORITHM METHOD THEREOF

Abstract: an algorithm method for deep reinforcement learning includes initializing an environment and a model; executing an experience collection process and a network update process in parallel, and determining whether the experience collection process and the network update process have reached a termination condition; and continuing executing the experience collection process and the network update process in parallel in response to neither of the experience collection process and the network update processes has met the termination conditions; and stopping executing the experience collection process and the network update process in response to one of the experience collection processes and the network update process having met the termination conditions. the experience collection process includes obtaining a current state of the environment; calculating to determine the current action based on the current observation values according to a current policy of the model; and returning the current action to the environment.

Inventor(s): Chia-Hsiang Yang, Shih-Hao Chen, Chih-Wei Liu

CPC Classification: G06N3/092 (Reinforcement learning)

Search for rejections for patent application number 20250165793