US Patent Application 18023347. METHOD AND APPARATUS FOR OPTIMIZING OTN RESOURCES, COMPUTER DEVICE AND STORAGE MEDIUM simplified abstract
METHOD AND APPARATUS FOR OPTIMIZING OTN RESOURCES, COMPUTER DEVICE AND STORAGE MEDIUM
Organization Name
Inventor(s)
Dajiang Wang of Shenzhen, Guangdong (CN)
Youdao Ye of Shenzhen, Guangdong (CN)
Zhenyu Wang of Shenzhen, Guangdong (CN)
METHOD AND APPARATUS FOR OPTIMIZING OTN RESOURCES, COMPUTER DEVICE AND STORAGE MEDIUM - A simplified explanation of the abstract
This abstract first appeared for US patent application 18023347 titled 'METHOD AND APPARATUS FOR OPTIMIZING OTN RESOURCES, COMPUTER DEVICE AND STORAGE MEDIUM
Simplified Explanation
The present disclosure describes a method for optimizing OTN (Optical Transport Network) resources.
- The method involves determining and creating a service based on an action policy in the current service creation state.
- A timely reward is calculated in the current service creation state.
- The method then moves to the next service creation state until an episode is ended.
- The optimized objective policy parameter is calculated and updated in each service creation state based on the timely reward.
- This process is iterated for a preset number of episodes to further update the optimized objective policy parameter in each service creation state.
- The resultant optimized objective policy parameter in each service creation state is determined based on the optimized objective policy parameter calculated in the preset number of episodes.
- The action policy is updated according to the resultant optimized objective policy parameter in each service creation state.
Original Abstract Submitted
The present disclosure provides a method for optimizing OTN resources, including: determining and creating, according to an action policy, a service to be created in a current service creation state, calculating a timely reward in the current service creation state, entering a next service creation state until an episode is ended, and calculating and updating, according to the timely reward in each service creation state, an optimized objective policy parameter in each service creation state; iterating a preset number of episodes to calculate and update the optimized objective policy parameter in each service creation state; determining, according to the optimized objective policy parameter in each service creation state in the preset number of episodes, a resultant optimized objective policy parameter in each service creation state; and updating the action policy according to the resultant optimized objective policy parameter in each service creation state.