One is OSVM-Q, online SVM is set for each exploration state. The other is OSVM-Q-1, only one online SVM is set for all state-action of CSPS system.
另一种是只设置一个在线支持向量机,用来逼近CSPS系统的所有状态-行动对的Q值函数的OSVM - Q - 1算法。
One is OSVM-Q, online SVM is set for each exploration state. The other is OSVM-Q-1, only one online SVM is set for all state-action of CSPS system.
另一种是只设置一个在线支持向量机,用来逼近CSPS系统的所有状态-行动对的Q值函数的OSVM - Q - 1算法。
应用推荐