对于基于词的搜索引擎等中文处理系统,分词速度要求较高。
The speed of Chinese word segmentation is very important for many Chinese NLP systems, such as web search engines based on words.
它具有化学结构检索和文字检索两大检索功能,并便于数据库维护、中文处理、图形存贮和使用方便等特点。
It is satisfied in maintenance of database, treatment of Chinese text and storing of graph. It is easy to use.
此知识库使用英语作为其自然语言处理语言,对于此场景,您需要将默认语言更改为中文。
The knowledge base uses English as its natural language processing language, so for this scenario you need to change the default language to Chinese.
换句话说,POSIXlocale有可能可以处理2字节或者3字节的日文或中文字符。
In other words, POSIX locales are potentially able to deal with 2 - and 3-byte Japanese and Chinese characters.
对当地网民的口味缺乏了解,在中文软件处理的弱点,反应缓慢。
Lack of understanding of local netizens' tastes, weakness in Chinese language software treatment, slow reactions.
Jython中文本文件的处理只有简单的三行代码:open()、readlines/writelines (taking alist)和close()。
Text File processing in Jython is as simple as 3 liners of open(),readlines/writelines (taking a list) and close().
汉字的计算机输入是中文信息处理的关键问题之一,而汉字计算机输入的数学模型对汉字的计算机输入的研究有重要的意义。
Chinese character inputting play a key role in the Chinese information processing, and the input model is very important for the study on the Chinese character input.
Percolator的一个关键设计理念是:提供对库中文档的随机访问,以实现对单个文档的处理,从而避免了像MapReduce那样对文档全集进行处理。
One of the key design concepts behind Percolator was to provide random access to the repository to process documents individually, and thus avoiding the global processing required by MapReduce.
专有名词的识别是中文信息处理领域的基础研究课题之一,也是尚未很好解决的一个问题。
The recognition of proper names (PNs) is one of the tasks on Chinese natural language processing research, and is also the question that still have not been resolved properly.
新术语的提取是中文信息处理领域的一个重要研究课题。
Extraction of new domain-specific terms is one of the important topics in Chinese natural language processing.
git - cola:用于处理git存储库中文件和目录的gui。
Git-cola. A GUI for working with files and directories in a git repository.
本文详细介绍在处理LED显示屏中文信息时使用红外遥控输入汉字的方法:区位码输入法和拼音输入法。
This article introduces the Chinese spelling input in LED display screen by using infrared remote controller, including area code input and spelling input.
根据中文信息处理的需要,把传统的有限自动机改造成了反向有限自动机。
To meet the need of Chinese information processing, the traditional finite automata is transformed to the reversed finite automata.
现代汉语文本自动分词是中文信息处理的重要基石,为此提供一个通用的分词接口是非常重要的。
Automatic word segmentation of modern Chinese text is the base of Chinese information processing. So a general purpose application interface for word segmentation is important.
希望对字母词语的规范和中文信息处理有所裨益。
Wishes to bestead the Chinese information processing and the lettered-words standardization.
汉字编码是中文信息处理的关键之一,也是最棘手的问题。
Chinese Character Coding is one of the keys to Chinese information processing, and is also the most difficult.
因此对复句语义进行研究为中文信息处理从字、词处理级提升到句处理级及句处理级以上的研究提供了基础。
So researches on Chinese complex sentence semantics facilitate the NLP from the characters and words level to the sentence or even the paragraph level.
实验结果表明,尽管本文提出的只是一个初步的系统,但是也说明了英文问答系统处理的方法也同样适用于中文问答系统。
The results of the experiments show that although the system presented here is primitive, the methods used in English QA system can be applied in Chinese QA system likewise.
汉语的自动分词,是计算机中文信息处理领域中一个基础而困难的课题。
Automatic word segmentation for the Chinese language is a fundamental and difficult problem in the field of computer Chinese language information processing.
系统利用中文文档信息处理中的文档特征向量提取技术,作为邮件过滤的依据。
The system completes email filtering function, on the basis of document eigenvector pick-up technology of Chinese document information management.
因此,分词在机器翻译、信息检索、文本分类等中文信息处理的各项任务中都发挥着基础性的重要作用。
Therefore word segmentation is a key sub-problem of Chinese information processing, such as machine translation, information retrieval and text classification.
希望对字母词语的规范和中文信息处理有所裨益。
The paper attempts to bestead the Chinese information processing and the lettered-words standardization.
到目前为止,中文信息处理主要依赖于对大规模语料的统计,根据概率,对词与词的关系作出界定。
The main stream in Chinese information processing community depends heavily on corpus-based methods, by making full use of the statistical relationship among words, in recent years.
汉语自动分词是计算机中文信息处理中的难题,也是文献内容分析中必须解决的关键问题之一。
Chinese automatic segmentation is one of the most difficult problems in computer Chinese information disposal and the key problem that document content analysis must resolve.
汉语自动分词是计算机中文信息处理中的难题,也是文献内容分析中必须解决的关键问题之一。
Chinese automatic seg mentation is one of the most difficult problems in computer Chinese information disposal and the key problem that document content analysis must resolve.
汉语分词是信息检索、机器翻译、文本校对等中文信息处理重要领域的基础。
Chinese word segmentation is a basic research issue on Chinese NLP areas such as information retrieval, machine translation, text correction, and so on.
中文相似句子检索的方法在基于实例的机器翻译等中文信息处理领域,具有非常广泛的应用背景。
The approach of similar Chinese sentence retrieval has been used widely in the field of Chinese information processing, such as Example Based Machine Translation (EBMT) and so on.
中文相似句子检索的方法在基于实例的机器翻译等中文信息处理领域,具有非常广泛的应用背景。
The approach of similar Chinese sentence retrieval has been used widely in the field of Chinese information processing, such as Example Based Machine Translation (EBMT) and so on.
应用推荐