This paper is concerned with the problem of a novel Q-learning algorithm for solving optimal cost function.
该文利用求解最优费用函数的方法给出了一种新的Q学习算法。
The algorithm uses the thought of delamination, makes local planning and global planning combination, and improves cost function, finally with ensuring optimal trajectory increases search efficiency.
该算法采用分层思想,将局部规划与全局规划相结合,并对代价函数进行了改进,在保证航迹优化的基础上,提高了搜索效率。
Constructing a simple network and converting the allocation problem into the min-cost max-flow in the network, we have developed an optimal algorithm for the allocation problem.
我们构造了一个简单网络,将布点问题转化为该网络中的最小费用最大流问题,从而给出了求解布点问题的最优性算法。
Taking minimizing annual operating cost as an object function, the method employed hereditary algorithm to get optimal compensatory places (non-node) and capacity.
该算法以年运行费用最小为目标函数,运用遗传算法求出最佳的补偿地点(非节点)和最优补偿容量。
The paper discusses a nonregular cost function and its optimum criterion, presenting an algorithm of constructing and optimal alphabetic binary tree under this criterion.
本文讨论一种非正则的评价函数和它的最优化的一种准则,给出在这种准则下构造按字典次序的最优二元树的一种算法。
The authors put forward a multiple time period EOQ inventory policy with cost changes, deal with its optimal structure and provide an optimal algorithm for its solution.
提出了一种基于两层遗传算法的多时段无功优化方法,将复杂的无功优化问题转化为多个时段静态无功优化的并行处理问题。
The convex optimization algorithm was used to get the minima upper bound of performance cost and parameter of optimal minimax controller.
引入凸优化算法,求解使闭环系统渐近稳定且性能指标上界最小的最优控制器参数。
Secondly, the model of the cost objective optimization was established, and the general optimal control law and control algorithm were proposed.
其次,建立了成本目标优化模型,提出控制法则并给出控制算法;
Under the constraint of the optimal resource cost, an algorithm based on resource competition chain was also promoted to update the float information of the uncritical activities.
在最优耗费的约束下,还给出了一个基于资源竞争链的浮动信息更新算法,以便更新各活动的浮动信息。
Under the constraint of the optimal resource cost, an algorithm based on resource competition chain was also promoted to update the float information of the uncritical activities.
在最优耗费的约束下,还给出了一个基于资源竞争链的浮动信息更新算法,以便更新各活动的浮动信息。
应用推荐