在筛选出的文本中,经过分词、去除停用词等处理后,选取二元词串作为特征;
In those texts, we select bigram as feature after Chinese word segmentation, deleting stop word and other process.
所构造的有向图可以作为机械分词、消除歧义以及进一步分析句子的基础。
The digraph generated can be used in automatic segmentation, ambiguity diminishing and sentence analyzing.
而主题提取是以中文分词作为第一步,分词质量直接影响到文献主题提取的质量。
Chinese word segmentation is always the first step of subject extraction. The quality of word segmentation is effective to the quality of text subject extraction.
而主题提取是以中文分词作为第一步,分词质量直接影响到文献主题提取的质量。
Chinese word segmentation is always the first step of subject extraction. The quality of word segmentation is effective to the quality of text subject extraction.
应用推荐