滤词器在分词之后对标号做进一步处理(最典型的比如去掉标点符号和一些常见词,像"the", "an", "a")。
The filters do any post-tokenizing work on the tokens (typically dropping out punctuation and commonly occurring words like "the", "an", "a", etc).
扫描设备经常遇到难词(两种不同的符号识别程序会将其显示成不同的字母)。
The scanners regularly encounter difficult words (those for which two different character-recognition algorithms produce different transliterations).
语言允许在符号,比如说出的一个词,与我们想要使用的任何想法之间,存在着这种映射关系。
It allows for this map between a symbol, say a spoken word, and any sort of thought we want to use.
起初他试图为每一个词创造一个符号,但是后来证明这是不可行的&&切罗基语言里有太多的词汇。
At first he tried to make a sign, or symbol, for each word. But that proved impossible - there were just too many words.
金融、华尔街,“招聘”一词已经成了这种博弈的符号,代表着比仅仅选择一条职业道路更广更深的一系列问题。
Finance, Wall Street, "recruiting" have become the symbol of this dilemma, representing a set of issues that is much broader and deeper than just one career path.
报告中提到,一些家长也很喜欢英文字母词,一对夫妇想给他们的孩子起名为@,他们说,这个用在电子邮件地址中的符号能体现他们对孩子的爱。
The report said some parents are so keen on English letters that a couple tried to name their baby "@", claiming the character used in email addresses reflects their love for the child.
通过定义同义词,可以使用更短的符号表示表数量。
By defining a synonym, you can refer to table quantities by using a shorter notation.
莫斯说,游戏玩家通常用数字和符号来代替字母。他们创造出了一种“l33tspeak”“精英语言”(网络黑话),“l33t”读作“leet”,是elite(精英)一词的缩写。
Gamers commonly substitute numbers and symbols for the letters, Morse says, creating what they call "l33t speak" — that's "leet" when spoken, short for "elite" to the rest of the world.
Lucene支持基于编辑距离算法的模糊搜索,你可以使用波浪符号“~”放在查询词的后面,比如搜索一个与“roam”拼写相近的词可以使用。
Lucene supports fuzzy searches based on the Levenshtein Distance, or Edit Distance algorithm. To do a fuzzy search use the tilde, "~", symbol at the end of a Single word Term.
attr _ reader和attr _writer都不是关键词,而是Ruby中的实际方法(在module类中),它们以符号作为参数。
Both attr_reader and attr_writer are not keywords but are actual methods in Ruby (found in the Module class) that take symbols as arguments.
写一个读取文件的程序,把每一行拆分成一个个词,去掉空白和标点符号,然后把所有单词都转换成小写字母的。
Write a program that reads a file, breaks each line into words, strips whitespace and punctuation from the words, and converts them to lowercase.
下划线划在符号,词或词组下的线,用以表示强调或斜体打印。
A line under something, such as a symbol, word, or phrase, used to indicate emphasis or italic type.
书写单位;书写符号代表一个元音、辅音、音节、词或其它表达方式,并且不能进一步分解的书写符号。
A written character that represents a vowel, consonant, syllable, word, or other expression and that cannot be further analyzed.
不要把研究特殊性的一词与表意符号的一词混淆,后者是衍生于表意符号的形容词。
The word idiographic is not to be confused with ideographic, which is the adjective formed from ideogram.
词是语义最基本的载体,它不仅仅是一种语言符号,还具有更为复杂的多重文化意义。
Word is the most basic semantic: unit. Not only is it a kind of language symbol, but it also possesses many complex cultural meanings.
第二种单词是同音异形异意词——它是由字母与符号或者数字组合构成的,而听起来像另外的词。
The second kind of word is a "homophone" -it's created by combining letters and symbols, or Numbers, to sound like other words.
符号学的翻译标准“意义相符,功能相似”对商标词的翻译具有重要的指导意义。
It points out that trademark translation should follow the criteria of "correspondence in meaning and similarity in function".
结构主义所认为的语言符号的任意性是指词形和词义之间的关系,多义词的基本词义与其他延伸意义间存在一定联系。
The arbitrariness claimed by structuralism exists between word form and word meaning, while the relationship between the original and extended meanings of a polysemous word is motivated.
下面两段话用同样的词但不同的标点符号,意思却南辕北辙了。
The following two paragraphs have same words but different punctuations. See what difference punctuations can make!
的一词与表意符号的一词混淆, 后者是衍生于表意符号的形容词。
The word idiographic is not to be confused with ideographic , which is the adjective formed ideogram.
的一词与表意符号的一词混淆,后者是衍生于表意符号的形容词。
The word idiographic is not to be confused with ideographic, which is the adjective formed ideogram.
本文从符号学的角度对网语进行了初步探讨,认为谐音策略是网语符号生成的主要方式之一,并归纳分析了谐音对译、谐音别解、谐音仿词、谐音代字、谐音假借和谐音节略六种形式。
This paper presents a brief discussion on Chinese cybertalk from the semiotic perspective, and the homophony strategies are considered to be the main and effective way to produce the cybertalk signs.
过分依赖文字的搜索引擎降低了搜索可用性,他们不能应付那些打字稿(扫描稿),错别字、以及复数的、加了连字符号的词,还有变异的单词。
Overly literal search engines reduce usability in that they 're unable to handle typos, plurals, hyphens, and other variants of the query terms.
它是通过字母,符号或数字的组合创造出来的,而听起来却像其他的词。
It's created by combining letters and symbols, or Numbers, to sound like other words.
第二种是“同音词”。它是由字母符号、数字组成的,听起来像别的单词。
The second kind of word is a "homophone" - it's created by combining letters and symbols, or Numbers, to sound like other words.
字是词的书写符号,造字就在书写形体与词之间确立指称关系。
Characters are graphic symbols of Chinese lexical items, while Character-formation is to set up the referential relationship between graphic forms and lexical items.
字是词的书写符号,造字就在书写形体与词之间确立指称关系。
Characters are graphic symbols of Chinese lexical items, while Character-formation is to set up the referential relationship between graphic forms and lexical items.
应用推荐