So what we see is, again perfectly reasonable statistical techniques, but not looking at things in the right way.
所以我们这里可以看到,还是很完美很有逻辑的统计学工具,但是他们没有用正确的方式去看待数据。
We could just collect a bunch of data. For a material .What's the volume it occupies at some pressure and temperature?
对一种物质我们可以得到一系列测量数据,在给定的温度和气压下,它的体积是什么?
You can compel the compiler to treat some value as a different type of value, at least if it makes intuitive sense that that should be possible.
你可以控制编译器处理,不同类型的数据,至少我们凭直觉,它应该是可能的。
We saw, associated with that primitive data, we have ways of taking data in and creating new kinds of data out, or new versions of data out, so we have operations.
我们可以取得输入的数据然后,创建新的类型的输出数据,或者新的版本的输出数据,这就是我们说的操作。
And one way we can get to that is by looking at two pieces of data.
一种可以得出答案的方法,就是观察2个数据。
And you can join on to the Red Sox nation on top of that, and part of being a good Red Sox fan is knowing the statistics of your team.
除此之外你们可以,加入红袜队的国度,作为一个好的红袜队球迷,也是了解你们支持球队的,数据的一部分。
They can buy into it if they buy into a $15,000 point-of-sales system.
他们可以买入这些数据0,比如价值15000美元的销售网点系统。
Because you don't know how much more time you've got-- You can make a guess based on statistics, but as we saw, there's wild unpredictability.
因为你不知道你剩下多少时间-,你可以根据数据进行猜测,但是我们都知道,不可预见性太难把握。
But you can imagine the inability to do that in a big population study with thousands and thousands of subjects, so we have to rely on data like this.
但你们可以想象在一个大样本研究中,不可能对数以千计的研究对象做这种培训,所以我们需要依靠像这样的数据
You can see the different points-- I've calculated this using data from 1983 until 2006-- and I computed all of the inputs to those equations that we just saw.
你可以观察不同的点-,我用从1983年到2006年的数据-,代入我们刚教授的等式,进行了计算。
Fine. But there are reasons to study development even if you are not interested in children because sometimes developmental studies and developmental data and developmental science can inform questions about adults.
没关系,但即使你对儿童不感兴趣,却依然有很多原因促使你去研究发展,因为发展的研究,数据以及科学,有时可以解释成人的问题
In other words there was not enough data on these things, in other words, with other conventional mortgages, you have data covering peace and war, prosperity and depression and so forth and you can follow these data back for decades.
换句话说,关于这些新增贷款没有足够数据,换种说法,和其他传统贷款相比,你有和平和战争时期的数据,有市场繁荣和萧条时的数据等等,你可以追述这些数据。
- If someone-- you're-- one of you're predecessor didn't already submit it, do follow the directions at top left which says, add calendar so that we can augment the data set even further.
如果-,除非前任代表没有提交相应信息,一定要遵循左上方的用法说明,添加日历,这样就可以进一步增大数据集。
And if you think about it, associated with each - one of those data types is a set of functions it's intended to apply to.
你会发现每个类型都有,与之对应的一个集合的操作,有的时候这些操作,有的时候某个操作可以,应用于多种数据类型。
So from measured equation of state data, or from a model like the ideal gas or the van der Waal's gas or another equation of state you know this.
所以,从测量的到的状态方程的数据,或者从状态方程模型比如理想气体方程,范德瓦尔斯方程或者其他状态方程,我们就可以知道。
I've added the ability to have more complex data structures here. But I dropped a hint in the first lecture about what you could computer with things. In fact if you think for a second about that list, you could ask what can I compute with just that set of constructs?
我添加了使用更复杂的数据结构的能力,但是我在第一节课给大家了了一个,关于你可以用什么来做计算的暗示,实际上如果你思考,这个列表一会儿的话,你会问我可以用?
So you have these at your disposal.
所以说你在处理数据的时候可以用这些常数。
Well, then, we could just use that for our equation of state.
然后我们就可以把这些数据,作为我们的状态方程。
You can take a problem that might be relatively intuitive to solve but when you scale this thing up as is increasingly the case in the web, in large data systems, and so forth, you actually have to now think smart, you actually have to think efficiently and you have to solve this problem effectively.
你可以把一个问题用比较直观的方法解决,但如果你把此类问题的数量增大,正如越来越多的互联网,和大规模数据系统中出现的问题等等,你应该考虑怎样才能更简便,怎样才能更高效,你应该用行之有效的方法处理问题。
You can see there are errors for both of these things.
你们可以看见,两项数据都有误差
They had a little bit of a problem around World War II and you can try to bridge the gap, but--anyway, there are people who have tried to sort this out.
他们在二战期间的数据可能有所缺失,不过可以试着弥补这个缺失,但是,无论如何,仍然有人想方设法整理这些数据
So historically, the course is a lot of sophomores who, like me perhaps, are realizing that they finally have time to explore beyond their own interests.
从往年数据上可以看出,上这个课程的大都是大学二年级的学生,可能跟当时的我一样,开始认识到我们终于有时间,来探究我们兴趣以外的事情。
Cause when you can actually manipulate a computer's memory at this low level, you can steal people's passwords, you can steal their data if you know how that memory is laid out.
因为即使是在这么低的权限下,你也能熟练控制电脑的存储器,你就可以窃取别人的秘密,如果知道相应数据位于哪里的话,你也能窃取别人的数据。
They might say my sample period was off, ... but that's what the theory-- ... using my data for the sample period that I computed-- the expected returns and co-variances says one should do.
他们可能说我的采样周期是有问题的,不过我的结果都是靠理论-,我采用自己收集的数据计算出-,预期收益和协方差可以用来指导我们的投资行为。
In the case of Burt Malkiel's data, more than 11% per year and in the case of Roger Ibbotson's data between 7% and 8% per year of those returns can be explained either by backfill bias or survivorship bias.
在伯特·麦基尔的数据中,超过11%的年平均收益,在罗杰·伊博森的数据中,7%到8%的年平均收益,可以用生存偏差或回填偏差来解释
And with this data, where students were able ; to implement last year their own E-trade-like website; whereby you have accounts and you log in your hand age of your users 10,000 virtual dollars and with them can they get stock quotes, by stocks, sell stocks and the like, all of this accomplished just after a few weeks time.
通过这些数据,学生们就可以做出类似电子商务的网站;,只要你有账号就能进入你的账户,里面有1万的虚拟美元,这样你就能去查询股票报价,进行买卖股票之类的事,这一切仅仅用了几个星期的时间就完成。
The green line is the building costs and you can see that building costs, since 1890, in real terms-- everything is corrected for inflation -have gone up a little bit since 1890, but not a whole lot.
绿线表示了建筑成本,你可以看到建筑成本,自1890年来,以实际值计算,所有的数据均进行了通胀修正,从1890年开始有一点点上涨,并非是大幅度上涨
It imports it into our database and does some fancy indexing as we'll call it later in the term to make searches more efficient.
将它导入到我们的数据库中,并进行一些变址,这将在后期会提到,它可以使查找更高效。
And I haven't said yet, how do I get that collection, but you could certainly conceptualize that, if I had that collection, that would be nice thing to do. That is a more common pattern.
也可以这么来操作,这是个更常用的模式,这基本上也就意味着,对于某些数据的集合,我想要一个循环机制。
And over the next couple of days, you'll see what we mean by this in detail.
就是抽象数据类型,你可以在网上看到这两个术语。
应用推荐