• It is illustrated and compared to other reinforcement learning algorithms.

    仿真研究将该方法其他再励学习方法进行了比较

    youdao

  • The thesis mainly focuses on the dynamic scheduling method based on the averaged rewards reinforcement learning algorithms.

    论文主要研究基于平均强化学习算法的动态调度方法

    youdao

  • It is rational to adopt the average reward reinforcement learning algorithms for solving the absorbing goal states cyclical tasks.

    对于有吸收目标状态循环任务,比较合理方法采用基于平均报酬模型的强化学习

    youdao

  • Due to the theoretical limitation that it assumes that an environment is Markovian, traditional reinforcement learning algorithms cannot be applied directly to multi-agent system.

    由于强化学习理论限制多智能体系统中马尔科夫过程模型不再适用,因此不能把强化学习直接用于多智能体的协作学习问题。

    youdao

  • Due to the theoretical limitation that it assumes that an environment is Markovian, traditional reinforcement learning algorithms cannot be applied directly to multi-agent system.

    由于强化学习理论限制多智能体系统中马尔科夫过程模型不再适用,因此不能把强化学习直接用于多智能体的协作学习问题。

    youdao

$firstVoiceSent
- 来自原声例句
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定