在一定条件下,我们证明随机对策有值函数,两个局中人相对于折扣报酬都有最优策略。
It is proved that under certain condition the stochastic game has a value and both players have optimal strategies for discounted rewards.
给出了“跳一停”策略存在的条件、最优费用函数、最佳控制的解析式及控制方法。
Also, the conditions of the "fail-stop" strategy, optimal cost function and optimal control function are given.
结论表明,捕食者的最优交易策略为时间的二次曲线形式,并且捕食交易利润是瞬时冲击系数的单调减函数。
The conclusion shows that the optimal strategy of predator is the quadratic equation of time and the profit is the descending function of temporary impact coefficient.
应用推荐