The technology of lexicography based on corpus has been the main method for contemporary lexicography.
基于语料库的词典编纂技术已经成为现代词典编纂的主流方法。
A method based on the bayes and machine readable dictionary was proposed, which could disambiguate by the training of a small-scale corpus and the definition of semantic in machine dictionary.
提出了一种基于贝叶斯分类与机读词典的多义词排歧方法,通过小规模语料库的训练和歧义词在机读词典中的语义定义来完成歧义的消除。
This paper uses a corpus with break indices based on C-TOBI. Applying supervised learning method, some useful attempts are made in the field of automatic break indices intonation.
本文采用了一个基于C TOBI的停顿指数标注的语料库,利用有指导的学习方法对自动停顿指数标注方面做了一些有益的探索。
应用推荐