Q-learning学习算法——这是一种通过学习动作值函数(action-value function)完成的强化学习算法,函数采取在给定状态的给定动作,并计算出期望的效用价值,在此后遵循固定的策略。
基于16个网页-相关网页
Defining the scope of the plaintiff clearly and setting reasonable conditions of the plaintiff are the foundation of exerting the value and function of the Shareholder Derivative Action System.
明确界定代表诉讼原告的范围,设置合理的股东代表诉讼原告条件是充分发挥股东代表诉讼制度价值功能的重要基础。
Beginning with social action value system, the au thor discussion study and grasping of social dominant action value system and its function in architecture scheme.
作者从社会行为价值系统着手,初步探索对社会主导行为价值系统的研究并把握其在建筑策划中所起的作用。
From the aspects of tort's purpose, action, duty, value and function, we can get reasonable explanations to the justice and rationality of gang members 'joint liability.
团伙侵权行为由全体团伙成员承担连带责任,从侵权故意角度、行为角度、义务违反角度、价值角度及功能角度都具有解释上的正当性及合理性。
应用推荐