• 信息抽取是从自由文本语料构建数据库实现情报自动收集有效途径之一。

    Information extraction is a main approach for constructing database from free text corpus and for automatic collecting intelligence information.

    youdao

  • 一些文本语料进行了分类,例如通过类型或者主题有时候语料类别相互重叠

    Some text corpora are categorized, e. g. , by genre or topic; sometimes the categories of a corpus overlap each other.

    youdao

  • 一些文本语料进行了分类例如经由过程类型或者主题有时辰语料类别彼此重叠

    Some text corpora are categorized, e. g. , by genre or topic; sometimes the categories of a corpus overlap each other.

    youdao

  • 为了训练方法可以使用第一家庭作业得到讲稿转录文本语料(GZ)6.001课本中文件(GZ)。

    To train your method, you will use the lecture transcript corpus (GZ) from the first homework, and a 6.001textbook source file (GZ).

    youdao

  • 任何文本分析一步是从文本内容生成一个语料(corpus),后续的分析将应用于此语料库。

    In any text analysis, the first step is to generate a corpus from the textual content, with the subsequent analysis being applied to the corpus.

    youdao

  • 相关文档执行文本分析可以导致更高质量分类因为可以交叉引用大的语料分析出文档之间更深层的关系。

    Performing textual analysis across a set of related documents can result in higher-quality categorization, as you can cross-reference from a larger corpus and glean deeper relations between documents.

    youdao

  • 生成语料原因之一规范化文本删除任何相关的内容。

    One of the reasons for generating a corpus is to normalize text and remove anything that isn't relevant.

    youdao

  • 通过大型语料(海量文本)来检查是个好方法。

    Large corpora (masses of text) are a good place to start.

    youdao

  • 本文基于大量真实WTO语料考察WTO文本语言现象,分析特有的句法特征探讨其汉译一些策略

    This paper, based on a large of corpus of authentic WTO texts, examines their linguistic, particularly their syntactic features and the strategies for translating such texts into Chinese.

    youdao

  • 自动机设计充分考虑各种类别的实体文本结构特点大规模人民日报语料测试取得了很好的识别效果

    The design of automaton fully considers the characteristics of each kind of entity, and acquired good recognition results while testing on large-scale people daily corpus.

    youdao

  • 同时大量真实文本语料详细探讨”、“”指代词情景语境中的手势非手势指示、上下文语境回指、预指等现象的规律或倾向性规律。

    Meanwhile, by using real data in our corpus, we attempt to investigate the gestural and symbolic usage of this and that in situational context and anaphora and cataphora in linguistic context.

    youdao

  • 语料语言学作为新兴学科可以应用于文学批评领域分析文学文本

    As a new and rising discipline, Corpus Linguistics can be applied in the field of literary criticism to analyze literary text.

    youdao

  • 统计机器翻译是利用基于语料训练得到统计参数模型,将源语言的文本翻译目标语言,机器翻译主流方向

    Statistical machine translation (SMT) is the text translation by the statistical parameter models obtained from the training corpus, which has become the mainstream of machine translation research.

    youdao

  • 翻译英语语料(TEC)世界上首个当代翻译英语语料库,包含许多书面文本英语译文

    TEC (translational English corpus) is the first and largest translational English corpus in the world, which consists of written English translations from a range of source languages.

    youdao

  • 大规模语料基础上,利用语言模型中稀疏事件概率估计方法汉语的熵进行计算讨论语料规模等因素对熵的影响。

    Different estimation methods of the probabilities of sparse events for the computation of the entropy in large scale modern Chinese text are applied in this paper.

    youdao

  • 本文利用三种特征选择方法、两种权重计算方法、用词表以及支持向量机分类器对汽车语料文本情感类别进行了研究

    The experiment results indicate that the greater text sentiment classification impact depends on other corpus, excluded adjective, verb, adverb as stop words and none stop words.

    youdao

  • 基于概率算法考虑训练语料概率模型,对于不同领域文本处理不尽如人意。

    And the probabilistic methods those consider the probabilistic model of the training set only also do a bad job on the texts of a specific domain.

    youdao

  • 首先作者建立一个语料,收录了19922008年17研究生入学英语考试试题文本

    Firstly, the author has built a corpus based on PGEE exam texts of 17 years from 1992 to 2008.

    youdao

  • 文章描述了一种自动获取文本切分知识的机器学习方法

    This paper presents a learning method to auto ma tically acquire segmentation knowledge from Chinese corpus.

    youdao

  • 基于语料语义接受度(SAS)研究在线衡量文本理解程度的可行性方法

    The corpus-based study on Semantic Accessibility Scale(SAS) is a useful method to evaluate the acceptance of electronic texts.

    youdao

  • 如何通过现有的互译文本建立大规模双语语料双语互译文本加工成为至关重要问题。

    How to use the existing bilingual text to build the large scale of bilingual corpus made it important to process the bilingual text.

    youdao

  • 互联网用作语料一种把互联网上的文本用作语料资源的新兴方法

    While the web is not an archetypal corpus, "web as a corpus" method is irrefutably functional, and has found its widespread applications in linguistic data retrieval and linguistic hypothesis testing.

    youdao

  • 抽取电子邮件手机短信多种文本特征分别TREC07P电子邮件语料真实中文手机短信语料上进行垃圾信息过滤实验

    Through multiple text features extraction from email and short message service (SMS) document, some spam filtering experiments are run on TREC07P email corpus and real Chinese SMS corpus separately.

    youdao

  • 但是大规模双语平行语料获取并不容易现有的平行语料规模时效性领域平衡性等方面还不能满足处理真实文本实际需要

    However, access to a large-scale bilingual parallel corpus is not easy, the existing parallel corpora can not meet the actual needs in terms of the scale, timeliness and balance of the fields.

    youdao

  • 方法通过对机器分人工校对语料学习,自动获取中文文本的分校对规则,并应用规则机器分词结果进行自动校对。

    It discusses and analyzes the actuality of Chinese word segmentation, and describes an approach to correcting the Chinese word segmentation automatically based on rules.

    youdao

  • 方法通过对机器分人工校对语料学习,自动获取中文文本的分校对规则,并应用规则机器分词结果进行自动校对。

    It discusses and analyzes the actuality of Chinese word segmentation, and describes an approach to correcting the Chinese word segmentation automatically based on rules.

    youdao

$firstVoiceSent
- 来自原声例句
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定