The models are analyzed and an optimal policy is obtained.
并对模型进行了分析,得出了最优决策。
Certain important properties of an optimal policy in m (c) for a continuous time discounted Markov decision model are studied.
本文研究了连续时间马氏决策规划折扣模型在(c)上最优策略的若干重要性质和它的结构。
It is shown that the randomized stationary policy is an optimal policy in m (c) if and only if it is convex combination of some deterministic stationary optimal policies.
特别是证明了:一随机平稳策略,它在(c)上是最优的充要条件是它可表为若干个决定性平稳最优策略的凸组合。
应用推荐