...多步Q学习 [gap=1137]hastic dynamic programming;Markov chain;parallel simulation of heuristic policy;multi-step Q learning ...
基于10个网页-相关网页
multi-step q learning
多步q学习
以上为机器翻译结果,长、整句建议使用 人工翻译 。
To solve the problem of slow update speed in Q learning, a multi-step Q learning scheduling algorithm is proposed, in which the value function is updated based on the information in multiple steps.
针对任务调度的Q学习算法更新速度慢的问题,提出一种基于多步信息更新值函数的多步q学习调度算法。
youdao
应用推荐
模块上移
模块下移
不移动