模糊数学方法的引用,使该模型不但能处理矛盾样本,而且有信息优化处理的功能。
Using fuzzy method we can not only treat with contradictory samples, but also make it in optimal case.
本文依据冗余网页的特点引入模糊匹配的思想,利用网页文本的内容、结构信息,提出了基于特征串的中文网页的快速去重算法,同时对算法进行了优化处理。
The idea of fuzzy matching and information of content and structure of the text of web page are introduced into the algorithm, and the efficiency of the algorithm is optimized.
应用推荐