Certain important properties of an optimal policy in m (c) for a continuous time discounted Markov decision model are studied.
本文研究了连续时间马氏决策规划折扣模型在(c)上最优策略的若干重要性质和它的结构。
The solution to the cross-layer design is modeled as a Markov decision process and utilizes the linear programming method to obtain the optimal adaptive transmission policy.
该跨层设计将问题的求解建模为马尔科夫决策过程,利用线性规划推导出最优的自适应传输策略。
应用推荐