• Trying to improve the learning time, the reward values in Q-learning method are not constant. MFQLA tuned the reward values according to current state.

    为了改善学习时间Q学习方法中的奖励不是固定的,而是根据状态而变化。

    youdao

  • For reinforcement learning control in continuous Spaces, a Q-learning method based on a self-organizing fuzzy RBF (radial basis function) network is proposed.

    针对连续空间下强化学习控制问题,提出了一种基于自组织模糊rbf网络Q学习方法

    youdao

  • In order to reduce the delay of cars passing through intersections, control strategies are set up by cloud model and some parameters of the control model are improved by Q-learning method.

    为了减少车辆通过路口延误采用模型建立控制策略,运用Q -学习改进控制模型的参数

    youdao

  • Q learning method is used in intelligence planning path with magnets to achieve the shortest path search, obstacle avoidance, task scheduling and so on.

    采用Q学习方法进行磁钉路径智能规划实现最短路径寻找,同时解决了任务调度及障等问题。

    youdao

  • The result of simulation illustrates that the signal control method based on Q-Learning is better than fixed-time control, actuated control and signal control based on genetic algorithms.

    仿真实验结果表明基于Q -学习信号控制方法优于定时控制、感应式控制基于遗传算法的信号控制方法。

    youdao

  • Then the four main algorithms including dynamic programming, monte carlo method, temporal-difference and Q-learning are given respectively, and their difference and relation are pointed out.

    动态规划蒙特卡罗算法时序差分算法Q-学习指出它们之间的区别联系

    youdao

  • Q-learning is a typical Reinforcement Learning (RL) method with a slow convergence speed especially as the scales of the state space and action space increase.

    学习一种典型强化学习,其学习效率低,尤其是状态空间决策空间较大时。

    youdao

  • Q-learning is a typical Reinforcement Learning (RL) method with a slow convergence speed especially as the scales of the state space and action space increase.

    学习一种典型强化学习,其学习效率低,尤其是状态空间决策空间较大时。

    youdao

$firstVoiceSent
- 来自原声例句
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定