数据清理模式定义为标准化、清理并最终基于自由格式文本字段的内容进行记录匹配(或消除重复项)。
The data cleansing pattern is defined as the standardization, clean-up, and ultimately, matching (or de-duplicating) records based on the content of freeform text fields.
为了使处理更简单,我添加了一个类型字段,可以用来明确定义的记录类型。
To make processing even simpler, I have added a type field, which can be used to explicitly define the type of the record.
记录可以为此字段定义多个值。
应用推荐