在全文索引中建立文档的索引之前,管道将对文档进行处理,并且提供了自动化的摘要、文档格式转换和分类服务(请参见图 1)。
The pipeline processes documents before they are indexed in the full-text index, and it provides automated summarization, document format conversion, and categorization services (See Figure 1).
自动文档分类是信息处理技术的一个重要部分。
Document automatic classifying is an important application of the information processing technology.
其他自动处理可能采用其他结构化形式;例如,Technorati使用rel - tag微格式对其大量的博客文章进行分类。
Other automatic processing is possible with the other structured forms; for example, Technorati makes use of the rel-tag microformat to categorize its vast aggregation of blog posts.
应用推荐