经过分词处理的大型汉语语料库是进行语言学和计算语言学研究的重要资源。
The large scale word segmented corpus is an important resource for the study of both linguistics and computational linguistics.
在筛选出的文本中,经过分词、去除停用词等处理后,选取二元词串作为特征;
In those texts, we select bigram as feature after Chinese word segmentation, deleting stop word and other process.
在筛选出的文本中,经过分词、去除停用词等处理后,选取二元词串作为特征;
In those texts, we select bigram as feature after Chinese word segmentation, deleting stop word and other process.
应用推荐