Under the criterion of infinite-horizon expected discounted reward, the existence of some optimal policy is proved.
在无穷时间和连续折扣情况下,证明了最优修理、更新策略的存在,以使设备的期望折扣净收入最大。
Under the criterion of infinite-horizon expected discounted reward, the existence of some optimal policy is proved.
在无穷时间和连续折扣情况下,证明了最优修理、更新策略的存在,以使设备的期望折扣净收入最大。
应用推荐