这类学习类型的目标不是让效用函数最大化,而是找到训练数据中的近似点。
In this type of learning, the goal is not to maximize a utility function, but simply to find similarities in the training data.
一旦这些不同的函数(例如概率分布)都被掌握,智能体就会很容易判断哪一种行为会让预期效用最大化,并据此选择正确的行为。
Once these different functions (such as the probability distribution) are learned, the correct action to take is simply a matter of deciding which action maximizes the "expected utility" of the agent.
最好的工作点可以被定义为一个最小化或最大化的效用函数。
The best operating point can be defined as the one that minimizes or maximizes the utility function.
Of course, we will also be talking about behavioral finance in this course and we'll, at times, be saying that the utility function concept isn't always right-- the idea that people are actually maximizing expected utility might not be entirely accurate.
当然 我们还会,在这门课上讨论行为金融学,并且,我们会间或,讨论到效用函数不总是正确的,人们希望最大化期望效用的观点,也许并不是完全准确的
应用推荐