传统的强化学习模型在整个学习过程中使用恒定学习速率,导致在未知环境下收敛速度慢且适应性差。
The learning process use the constant learning rate in the traditional reinforce learning model, because of that robot learn in a low convergence speed and with the poor adaptation.
传统的强化学习模型在整个学习过程中使用恒定学习速率,导致在未知环境下收敛速度慢且适应性差。
The learning process use the constant learning rate in the traditional reinforce learning model, because of that robot learn in a low convergence speed and with the poor adaptation.
应用推荐