之后对每条标题语料进行加工标注,最后建成可供检索和统计的小型网络体育新闻标题语料库。
After each title on the corpus of words with Mark sentence carried out, final and complete statistics are available for retrieval of small sports news network corpus title.
在TDT5语料库上的实验表明,该算法提高了话题检测的正确率,降低了新闻报道数据处理过程中的计算开销。
The experiments on TDT5 corpora indicated that: the new algorithm improved the accuracy of topic detection and decreased the computational overhead in the process of news data processing.
应用推荐