计算从每个数据样例到群集中心(我们随意选中的数据行)的距离,使用距离计算的最小平方法。
Compute the distance from each data sample to the cluster center (our randomly selected data row), using the least-squares method of distance calculation.
本系列后续的文章将会涉及挖掘数据的其他方法,包括群集、最近的邻居以及分类树。
Future articles will touch upon other methods of mining data, including clustering, Nearest Neighbor, and classification trees.
单击choose并从所出现的各种选项中选择SimpleKMeans(这是本文中我们所期望的进行群集的方法)。
Click Choose and select SimpleKMeans from the choices that appear (this will be our preferred method of clustering for this article).
应用推荐