• Learning is of great importance in reinforcement learning.

    学习一种重要强化学习算法。

    youdao

  • Without reinforcement learning is only short term and easily lost.

    没有巩固学习只能短期的,很快遗忘的。

    youdao

  • Can you explain the A. I. technique called reinforcement learning?

    解释一下什么是“强化学习技术吗?

    youdao

  • What makes a task more appropriate for incorporating reinforcement learning?

    什么样任务适合应用强化学习技术

    youdao

  • What are the differences between supervised learning and reinforcement learning?

    监督学习强化学习区别什么

    youdao

  • This sample graph is from a simple reinforcement learning application that USES Q learning.

    这个示例是从使用Q学习一个简单增强式学习应用程序中得到的。

    youdao

  • Contemporary theories of reinforcement learning are rooted in the dopaminergic reward system.

    当代强化学习理论基于多巴奖赏系统

    youdao

  • The former one is a new approach combining reinforcement learning with feedback control.

    基于强化学习的多指手控制方法,方法反馈控制强化学习相结合。

    youdao

  • Reinforcement learning (RL) to motion planning of dynamic manipulation tasks was applied.

    提出增强学习RL)解决机器人动态操作任务运动规划方法

    youdao

  • An average reward reinforcement learning algorithm for control Markov chains is presented.

    讨论平均准则控制马氏强化学习算法

    youdao

  • This paper adopts reinforcement learning method to accomplish robot soccer cooperation strategy.

    利用强化学习方法实现足球机器人协作策略。

    youdao

  • MAXQ, a hierarchical reinforcement learning method for multi-agent system, is proposed in recent years.

    MAXQ分层智能学习方法近年来被提出种新方法。

    youdao

  • Simulation machine car through reinforcement learning algorithm, learning optimal navigation strategies.

    说明:模拟智能机器小车通过强化学习算法,学习最优导航策略

    youdao

  • Research on local path planning of mobile robot based on Q reinforcement learning and CMAC neural networks.

    基于Q强化学习CMAC神经网络移动机器人局部路径规划研究

    youdao

  • Several approaches applying reinforcement learning techniques to game playing have been described in the literature.

    强化学习技术运用于游戏集中方法文献都有记载

    youdao

  • This paper discusses reinforcement learning(RL)algorithm and its application to technical action learning of soccer robot.

    主要研究了强化学习算法及其机器人足球比赛技术动作学习问题中的应用

    youdao

  • The thesis mainly focuses on the dynamic scheduling method based on the averaged rewards reinforcement learning algorithms.

    论文主要研究基于平均强化学习算法的动态调度方法

    youdao

  • Reinforcement learning has the ability to learn from experience as opposed to supervised learning which learns from examples.

    监督学习范例中学习的方式不同,强化学习不需要先验知识,而是具有经验学习能力

    youdao

  • For vector control AC drive system, the thesis presented a fuzzy neural network speed controller based on reinforcement learning.

    针对矢量控制交流调速系统该文提出并设计了一种基于再励学习的模糊神经网络速度控制器

    youdao

  • It is rational to adopt the average reward reinforcement learning algorithms for solving the absorbing goal states cyclical tasks.

    对于有吸收目标状态循环任务,比较合理方法采用基于平均报酬模型的强化学习

    youdao

  • A reinforcement learning algorithm based on process reward and prioritized sweeping is presented as interference solving strategy.

    本文提出了基于过程奖赏优先扫除强化学习算法作为多机器人系统的冲突消解策略

    youdao

  • On the basis of theoretical analysis, the cooperative game reinforcement learning method is proposed and its convergence is proved.

    理论分析基础上提出协同博弈强化学习算法证明了算法的收敛性。

    youdao

  • Reinforcement learning is an important machine learning method. However, slow convergence has been one of main problem in practice.

    强化学习一种重要机器学习方法然而实际应用中收敛速度缓慢是其主要不足之一

    youdao

  • Reinforcement learning based on Markov decision process is a way of on-line learning, which can be applied to single agent environment.

    基于马尔科夫过程强化学习作为在线学习方式,能够好地应用智能体环境中。

    youdao

  • This characteristic of reinforcement learning must increase learning difficulty for intelligent system and learning time also grows up.

    强化学习这种特性必然增加智能系统的困难性,学习时间增长

    youdao

  • In this paper, the approximate theorem of average reward reinforcement learning is proven by means of the theory of performance potentials.

    文中基于性能理论证明平均奖赏强化学习逼近定理

    youdao

  • Reinforcement learning is a common technique for this scenario as well as the more traditional scenario of actually learning the utility function.

    强化学习这种情况的常用技术更多传统情形下需要使用效用函数

    youdao

  • Here the computational principle is reinforcement learning and active exploration, which may also be behind learning motor movements in an infant.

    在这里计算原理加强学习过程主动探索过程这些也许也是婴儿学习动机背后原因。

    youdao

  • Reinforcement learning does not need priori knowledge and improves its behavior policy with knowledge obtained by interaction with the environment.

    强化学习需要先验知识,而是通过环境的不断交互获得知识,改进行为策略具有自学习的能力。

    youdao

  • Reinforcement learning does not need priori knowledge and improves its behavior policy with knowledge obtained by interaction with the environment.

    强化学习需要先验知识,而是通过环境的不断交互获得知识,改进行为策略具有自学习的能力。

    youdao

$firstVoiceSent
- 来自原声例句
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定