构建中医汉英双语语料库平台并介绍其主要功能。
This paper develops a platform of Chinese-English bilingual corpus for Chinese medicine.
双语语料对齐是自然语言处理的一个重要研究课题。
Align bilingual corpus is a natural language processing important research subject.
大规模双语语料库的建设是进行基于语料库研究的基础。
Building a large scale of bilingual corpus is the foundation of research on corpus.
对利用汉英双语语料进行翻译等价词对抽取进行了探索。
This paper examines how the translation equivalent pairs could be extracted from parallel corpus.
第三,在双语语料库的基础上进行了双语组块获取的研究。
Third, we have conducted the research to the acquisition of bilingual chunk on bilingual corpus.
介绍了使用混合策略从汉英双语语料库中抽取搭配的方法。
This paper introduces a new approach to extract collocation from Chinese-English parallel corpora using hybrid strategy.
双语语料库自动对齐是自然语言处理的一个重要研究课题。
Bilingual Corpus alignment of natural language processing is an important research topic.
主要思路为:(1)半自动构造了一个中英平行双语语料库;
This is done by the following steps: 1). semi-automatically constructing a parallel corpora;
网络上存在的大量双语资源,给构建大规模双语语料库提供了可能。
Large amount of bilingual resource on the Internet bring the probability of building a large scale of bilingual corpus.
统计已对齐和标注的双语语料中的名词和名词短语生成候选术语集。
The term candidate set is produced by statistical the nouns and noun phrases of both corpora.
该算法能基于双语语料,通过机器学习来自动进行语义聚类,生成词间相似度矩阵。
Based on bilingual corpora, the algorithm can produce words-similarity-matrix through machine learning.
为了支持一项正在进行的汉英机器翻译系统的开发,我们建立了一个汉英双语语料库。
To support an ongoing Chinese-English machine translation project, a Chinese English bilingual corpus is being set up.
采用《汉语同义词词林》和英汉双语语料库,通过“双语对齐”扩充了英汉词典的单词译文;
An alignment method which makes use of thesaurus and bilingual corpus is adopted and thus the number of Chinese translation of word in English Chinese dictionaries is enlarged.
如何通过现有的互译文本来建立大规模的双语语料库,对双语互译文本的加工成为至关重要的问题。
How to use the existing bilingual text to build the large scale of bilingual corpus made it important to process the bilingual text.
针对双语语料是开发统计机器翻译系统的重要资源,提出一种从网络中自动挖掘双语平行网页的方法。
Aiming at bilingual corpora is critical resources for developing statistical machine translation system, this paper presents a method which automatically mines bilingual parallel Web page form Web.
在当前的基于统计的翻译方法中,双语语料库的规模、词对齐的准确率对于翻译系统的性能有很大的影响。
Based on the current statistical machine translation, the size of corpus and the accuracy of word alignment mainly affect the performance of SMT systems.
双语语料库在基于实例的机器翻译、翻译知识的获取、双语词典的建立、词义消歧等领域有着重要的应用价值。
Bilingual corpus plays an important role in Example-base Machine translation (EBMT), acquirement of translation knowledge, construction of bilingual dictionary etc.
该系统中需要一个大规模、高质量的双语语料库,而要建设这样一个双语库,其中的关键技术就是双语句子对齐。
The system needs a large and high-quality Bilingual Corpus. To build such a Corpus, one of the key technologies is Bilingual Sentence Alignment.
基于双语语料库可自动抽取翻译等价对:利用4种常见的数学模型来计算任意两个词的共现频率,以共现频率的高低来获取翻译等价对。
The word translation probability was calculated by 4 common co-occurrence models in bilingual corpus, and the translation equivalence was extracted according to translation probability.
词语对齐是跨语言自然语言处理领域的一个基本问题,许多基于双语语料库的应用(如sbmt、EBMT、WSD、词典编纂)都需要词汇级别的对齐。
Word alignment is a basic problem of Cross-lingual Natural Language Processing. Many NLP tasks based on bilingual corpus such as SBMT, EBMT, WSD, Automated Dictionary Extraction need to align words.
基于平行语料抽取双语词典是一个很重要的研究方向。
The research on the bilingual dictionary extraction based on parallel corpora is an important direction.
本文就是希望通过建立基于双语句对语料库的词对齐模型将句对之间的信息加以改进。
This paper hope to set up a align model based on bilingual sentence pair base for making use of the hidden info among the sentence pairs, and improving it.
该方法首先利用翻译引擎将双语平行语料库自动转换为单语平行语料库,以此构建复述语料库并用于候选复述的抽取。
The method first constructs a paraphrase corpus by automatically translating a bilingual parallel corpus into a monolingual parallel corpus, from which candidate paraphrases for words are extracted.
除机器翻译方面的应用之外,平行语料库的建设对于双语词典编纂、词义消岐和跨语言信息检索也具有重要价值。
Moreover , the parallel corpus is valuable in machine translation , bilingual dictionary compilation , word sense disambiguation and cross - lingual information retrieval.
除机器翻译方面的应用之外,平行语料库的建设对于双语词典编纂、词义消岐和跨语言信息检索也具有重要价值。
Parallel corpus has valuable application in machine translation, bilingual dictionary compilation, word sense disambiguation and Cross-Lingual Information Retrieval.
本文提出的从大规模网页中抽取双语翻译对的方法是基于可比较语料的。
This dissertation presents a method on extracting bilingual translation tuples from a comparable corpus.
作为一项重要的基础资源,双语平行语料库在人工智能领域的研究起着举足轻重的作用。
As important elementary resources, bilingual parallel corpora play a crucial role in research of artificial intelligence.
本文阐述了汉文-维吾尔文双语对齐语料库的构建的一些技巧,通过实践提出了建设汉文-维吾尔文双语对齐语料库的方法。
This paper introduces certain skills of constructing Chinese and Uighur parallel corpus and put forward the methods of construction through practice.
双语对齐作为语料库加工过程中的关键技术,已经引起研究者的高度重视。
As the key technology during the course of building the corpus, Bilingual alignment technology is growing High recognition.
双语对齐作为语料库加工过程中的关键技术,已经引起研究者的高度重视。
As the key technology during the course of building the corpus, Bilingual alignment technology is growing High recognition.
应用推荐