A fuzzy Q learning algorithm is proposed in this dissertation, which map continuous state Spaces to continuous action Spaces by fuzzy inference system and then learn a rule base.
首先,提出一种模糊Q学习算法,通过模糊推理系统将连续的状态空间映射到连续的动作空间,然后通过学习得到一个完整的规则库。
For reinforcement learning control in continuous Spaces, a Q-learning method based on a self-organizing fuzzy RBF (radial basis function) network is proposed.
针对连续空间下的强化学习控制问题,提出了一种基于自组织模糊rbf网络的Q学习方法。
For reinforcement learning control in continuous Spaces, a Q-learning method based on a self-organizing fuzzy RBF (radial basis function) network is proposed.
针对连续空间下的强化学习控制问题,提出了一种基于自组织模糊rbf网络的Q学习方法。
应用推荐