• 学习一种重要强化学习算法

    Learning is of great importance in reinforcement learning.

    youdao

  • 讨论平均准则控制马氏强化学习算法

    An average reward reinforcement learning algorithm for control Markov chains is presented.

    youdao

  • 论文主要研究基于平均强化学习算法动态调度方法

    The thesis mainly focuses on the dynamic scheduling method based on the averaged rewards reinforcement learning algorithms.

    youdao

  • 传统强化学习算法只能解决离散状态空间动作空间的学习问题。

    Conventional reinforcement algorithms only deal with discrete state Spaces and discrete action Spaces.

    youdao

  • 说明:模拟智能机器小车通过强化学习算法学习最优导航策略

    Simulation machine car through reinforcement learning algorithm, learning optimal navigation strategies.

    youdao

  • 目前主流的强化学习算法Q学习算法Q学习本身存在一些问题

    Q learning algorithm is the most popular reinforcement learning algorithm, but the algorithm exist some problems.

    youdao

  • 多代理体技术实现教学个性化强化学习算法使得教学策略具有智能化

    Multi-Agent technology achieves the personalized in ITS, and reinforcement learning algorithm makes teaching strategies with the intelligent.

    youdao

  • 主要研究强化学习算法及其机器人足球比赛技术动作学习问题中的应用

    This paper discusses reinforcement learning(RL)algorithm and its application to technical action learning of soccer robot.

    youdao

  • 理论分析基础上提出协同博弈强化学习算法证明了算法收敛性。

    On the basis of theoretical analysis, the cooperative game reinforcement learning method is proposed and its convergence is proved.

    youdao

  • 本文提出了基于过程奖赏优先扫除强化学习算法作为多机器人系统的冲突消解策略

    A reinforcement learning algorithm based on process reward and prioritized sweeping is presented as interference solving strategy.

    youdao

  • Q强化学习算法应用移动机器人局部路径规划,解决移动机器人在复杂环境中的局部路径规划问题

    In this paper Q reinforcement learning algorithm is adopted for mobile robot local path planning. It makes mobile robot resolve the problem of local path planning in a complex environment.

    youdao

  • 论文提出模糊强化学习算法,通过模糊推理系统连续状态空间映射连续的动作空间,然后通过学习得到一个完整的规则

    In this paper, we propose a fuzzy reinforcement algorithm, which map continuous state Spaces to continuous action Spaces by fuzzy inference system and then learn a rule base.

    youdao

  • 这种方法可以削减学习哈尔滨工程大学博士学位论文单元状态信息降低学习空间组合强度加快群体强化学习算法学习速度。

    The new algorithm can cut down the redundant state information, so that the composition intensity of learning space is decreased and the convergence of the learning course is accelerated.

    youdao

  • 为了提高智能体系统中的典型强化学习——Q -学习学习速度收敛速度,使学习过程充分利用环境信息,本文提出种基于经验知识的Q -学习算法

    In order to enhance the study speed and the convergence rate of Q-learning algorithm, an algorithm that based on the experience knowledge about environment is proposed.

    youdao

  • 算法采用强化学习迭代策略运行能够环境获取相应知识提高搜索能力

    By adopting the value iterative strategies of reinforcement learning, the algorithm can absorb the corresponding knowledge from its environment during its running and improve its search ability.

    youdao

  • 讨论了学习社会行为可行性必要性采用强化学习方法,给出多机器人传接合作搬运详细算法实现。

    The possibility and necessity of learning social behavior were discussed, and applying reinforcement learning and the above idea to multi-agent's learning relay cooperation in convey.

    youdao

  • 提出基于强化学习网络爬虫算法应用于餐饮类站点发现中。

    A network spider algorithm based on the reinforcement learning is proposed and deployed to discovery the web site of dinning.

    youdao

  • 主要采用强化学习的方法AUV进行控制决策综合Q学习算法BP神经网络人工场法对AUV进行避碰规划

    The reinforcement learning is adopted to control and decision for AUV, and Q-learning, BP neural net, artificial potential is integrated to avoidance planning for AUV.

    youdao

  • 单独行为对象中包含基于强化学习中的Q学习人工神经网络优化学习算法

    A single behavior object contains the algorithm for optimizing the demonstrated group of ACTS. The algorithm is using the Q-learning based on artificial nerve network.

    youdao

  • 本文提出了一种基于算法强化学习模型蚁群算法Q学习结合的思想。

    The paper proposes a model of reinforcement learning based on ant colony algorithm, namely the combination of ant colony algorithm and Q learning.

    youdao

  • 本文提出了一种基于算法强化学习模型蚁群算法Q学习结合的思想。

    The paper proposes a model of reinforcement learning based on ant colony algorithm, namely the combination of ant colony algorithm and Q learning.

    youdao

$firstVoiceSent
- 来自原声例句
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定