I am sure you are using NLP methodology which can help you to get data with proximity and then you can remove noise based on your experience.
我确信你使用的是自然语言处理方法可以帮助你获得的数据接近然后你可以根据你的经验,噪声消除。
The speed of Chinese word segmentation is very important for many Chinese NLP systems, such as web search engines based on words.
对于基于词的搜索引擎等中文处理系统,分词速度要求较高。
Word alignment is a basic problem of Cross-lingual Natural Language Processing. Many NLP tasks based on bilingual corpus such as SBMT, EBMT, WSD, Automated Dictionary Extraction need to align words.
词语对齐是跨语言自然语言处理领域的一个基本问题,许多基于双语语料库的应用(如sbmt、EBMT、WSD、词典编纂)都需要词汇级别的对齐。
应用推荐