It's a long sequence of instructions, it starts at the beginning, walks through, may jump around a little bit, but eventually comes down at the end. It's okay for the things you're doing for the early problem sets.
这是一长串指令,从开端开始,向前步行,偶尔来回跳一会儿,但是最终会走到结尾,这些足以解决最初级的问题。
So, imagine the pig has a heavy collar and to reward the pig for walking forward you might remove the heavy collar.
所以,想象这头猪带着很重的项圈,作为对这头猪向前走的奖励,你可以把项圈拿掉。
He began walking very fast.
他开始箭步向前走。
You don't just want your pig to walk forward.
你不只想让这头猪向前走。
And if you do that over the fullness of time, your reinforcement and punishment will give rise to a pig who walks forward.
如果你长时间的这么做,你的强化和惩罚,就会使这头猪形成向前走的行为。
So, you reinforce the pig for walking forward and you punish the pig for walking backward.
那你就对这头猪向前走的行为给予强化,如果它向后退,你就惩罚它。
Suppose you want the pig to walk forward.
假设你想让这头猪向前走。
应用推荐