数据清理模式定义为标准化、清理并最终基于自由格式文本字段的内容进行记录匹配(或消除重复项)。
The data cleansing pattern is defined as the standardization, clean-up, and ultimately, matching (or de-duplicating) records based on the content of freeform text fields.
对于单个源数据,这些默认值的标准化可以通过根据一个查找表或定义好的模式或算法,在数据中指定映射值来完成。
For single source data, standardization of these default values may be through assigned mapping values, against a lookup table, or based on a defined pattern or algorithm within the data.
定义模型的部分过程是标准化一组通用元数据属性(如标题、说明和作者等),这些属性是所有内容类型中所通用的。
Part of the process of defining the model was to standardize on a set of common metadata properties (such as title, description, authors, and so on) which are common across all content types.
应用推荐