其实这是 一个典型的马尔科夫决策过程(Markov decision process,MDP)。 马尔科夫决策过程(Markov decision process,MDP):Agent 可感知到其环 境的不同状态集合,并且有它可执行的动作集合。
基于64个网页-相关网页
...只能选择其中几条进行检测、接入,第三章中提出了基于部分可观测马尔科夫决策过程(Partially Observable Markov Decision Process, POMDP)的接入算法,包括了一次决策和多次决策两种方案。
基于2个网页-相关网页
一下马尔科夫决策过程 Markov decision processes ; MDP
基于马尔科夫决策过程 Markov decision process ; MDP
将马尔科夫决策过程 Markov Decision Process
准马尔科夫决策过程 semi markov decision process
运筹学和马尔科夫决策过程处理以一个序列发生的行为。
Operations research and Markov decision processes tackled actions taking place in a sequence.
为了计划后面多个步骤,当可能的世界状态数目不算太多时,通常用到一种被称作马尔科夫决策过程的算法。
In order to plan ahead for multiple moves, an algorithm known as a markov decision process is commonly used when there are only a reasonably small group of possible world states.
该跨层设计将问题的求解建模为马尔科夫决策过程,利用线性规划推导出最优的自适应传输策略。
The solution to the cross-layer design is modeled as a Markov decision process and utilizes the linear programming method to obtain the optimal adaptive transmission policy.
应用推荐