平均报酬模型_双语例句

双语例句

原声例句

权威例句

go top 返回词典

对于有吸收目标状态的循环任务，比较合理的方法是采用基于平均报酬模型的强化学习。

It is rational to adopt the average reward reinforcement learning algorithms for solving the absorbing goal states cyclical tasks.

youdao
对于有吸收目标状态的循环任务，比较合理的方法是采用基于平均报酬模型的强化学习。

It is rational to adopt the average reward reinforcement learning algorithms for solving the absorbing goal states cyclical tasks.

youdao

应用推荐

$firstVoiceSent

- 来自原声例句

关于有道 Investors 有道智选官方博客技术博客诚聘英才联系我们站点地图网络举报 © 2025 网易公司隐私政策服务条款京ICP证080268号京ICP备10005211号

小调查

请问您想要如何调整此模块？

模块上移

模块下移

不移动

感谢您的反馈，我们会尽快进行适当修改！

进来说说原因吧确定

小调查

请问您想要如何调整此模块？

模块上移

模块下移

不移动

感谢您的反馈，我们会尽快进行适当修改！

进来说说原因吧确定