多字节的另一个值得关注的问题是对齐问题。
One other issue of concern for multibyte words is alignment, Modern.
多字节的另一个值得关注的问题是对齐问题。
One other issue of concern for multibyte words is alignment.
由于第一个字符是多字节的,因此会导致字符分解和错误输出。
Since the first character is multi-byte, it results in splitting the character and leads to dirty output.
在当时这种情况下,就需要推出可以处理更多字节的微处理器。
记住,传输16字节或更多字节的dma在spe和PPE上必须是16字节的整数倍,并按照16字节边界进行对齐。
Remember that DMA transfers of 16 bytes or more must be in 16-byte multiples and aligned to 16-byte boundaries on both the SPE and the PPE.
默认条件下,多字节的支持也包含在内,允许您在Rhapsody模型的描述以及标签区域中输入日语字符。
Multi-byte support is enabled, by default, to allow you enter Japanese characters into the description and label fields of your Rhapsody model.
CHAR存储固定长度的单字节或多字节的文本字符串(最高可达32,767个字节);在整理文本数据时支持使用代码集顺序。
CHAR Stores single-byte or multibyte text strings of fixed length (up to 32,767 bytes); supports code-set order in collation of text data.
UTF - 8在不支持多字节数据的工具中也能正常工作。
UTF-8 also works better with tools that don't expect to receive multibyte data.
另一个问题是损害了与MRI的兼容性,特别是在多字节支持上。
Another problem is that compatibility with MRI suffers, especially in the multi byte support.
因此,在多字节格式中所需要的字节数取决于长度值。
Therefore, the number of bytes required in multi-byte format depends upon the length value.
但是对于多字节编码,字符的字节长度随使用编码模式的不同而不同,每个字符的长度可能是一个字节或多个字节。
However, in the case of a multi-byte encoding, the length of the character in bytes varies according to the encoding used, and each character can be one or more bytes in length.
UCS是ISO 10646标准定义的所有已知字符集编码的多字节(31位)超集。
UCS is the multibyte (31-bit) superset of all known set encodings defined by the ISO 10646 standard.
这时,我就知道找到了在多字节整数格式中需要的字节数。
When this happens, I know that I have found the required number of bytes in the multi-byte integer format.
VARCHAR存储可变长度的单字节或多字节文本字符串(最高可达255个字节);在整理文本数据时支持使用代码集顺序。
VARCHAR Stores single-byte or multibyte text strings of varying length (up to 255 bytes); supports code-set order collation of text data.
LVARCHAR (IDS)存储可变长度的单字节或多字节文本字符串(最高可达32,739个字节)。
LVARCHAR (IDS) Stores single-byte or multibyte text strings of varying length (up to 32,739 bytes).
每一种POSIXlocale,只要它实际上使用了post- 2.2版本的glibc,不管它是使用ASCII单字节,还是使用多字节utf - 8字符编码,都要依赖于这种字符类型。
Every POSIX locale, regardless of whether it USES an ASCII single-byte or multibyte UTF-8 character encoding, relies on this character type, provided it is actually using a post-2.2 version of glibc.
UTF - 8的目的是使7位和8位ascii应用程序可以在将来的多字节unicode编辑环境中生存。
The purpose of UTF-8 is to make it possible for 7-bit and 8-bit ASCII applications to live in a potentially multibyte Unicode-encoded environment.
Endianness是指多字节字在内存中的存储顺序。
Endianness refers to the order in which the bytes of a multi-byte word are stored in memory.
回想在本系列第一篇文章中的讨论,在希望使用多字节整数格式的时候,必须将第一个长度字节的最左边一位设置为1。
Recall from the discussion in the first article of this series that whenever I want to use multi-byte integer format, I must set the leftmost bit of the first length byte to 1.
但是由于第一个字符是多字节字符,因此得到结果3,它是搜索字符串的第一次出现的位置。
But since you have a multi-byte character as the first character, you get the result as 3, which is the first occurrence of the search string.
这是能够代表被使用代码的最短的可能的多字节序列。
The shortest possible multi-byte sequence that can represent the code is used.
这甚至不需要程序被明确的编码成当前的多字节编码。
The program does not even have to be explicitly coded to the current multi-byte encoding.
回想一下在本系列的第一篇文章中对表2的讨论,有两种表示字节长度的方法:单字节表示法和多字节表示法。
Recall from the discussion of Table 2 in the first article of this series that there are two ways of representing length bytes: single-byte length notation and multi-byte length notation.
将字符看作一个单元而不是一个字节序列,这是进行多字节字符的字符串操作的必要条件。
Recognizing the character as a single, unit as opposed to a sequence of bytes, is a requirement in the case of string manipulations involving multi-byte characters.
正如上面介绍的一样,有一些广为认可的标准为多字节存储和可移植性提供了一些便利;然而,现在还没有为输入和显示制定标准。
As mentioned above, ratified standards provide for multi-byte character storage and portability; as yet, though, there are no standards for input or rendering.
行1有一个包含3个逻辑字符的多字节ut f字符,其中每个字符包含3个字节(上标表示一个字母的存储)。
Row 1 has a multi-byte UTF character containing 3 logical characters containing 3 bytes each (superscript denotes the storage of a single letter).
首先,它可以从面向字节的代码集编码单字节字符,其次,当在数组中使用时,它可以从多字节字符集(如unicode),编码任何多字节字符。
First, it can encode a single byte character from any byte-oriented code set, and second, when used in an array, it can encode any multi-byte character from a multi-byte character set such as Unicode.
图4展示了对“a”的搜索,“a”的实际字符位置是2,而输出的位置是3,原因在于字符串中有多字节字符。
Figure 4 shows a search for "a", the actual character position of "a" is 2 but the output is 3 because there is a multi-byte character in the string.
使用复杂的多字节字符时,比如日语中的那些字符,可能需要构建libiconv库来支持特定代码页,或者是需要应用补丁。
It may also be necessary to build the libiconv library to support a specific code page or apply a patch when complex multi-byte characters are used, such as those in Japanese.
高的inline length值告诉DB 2空间值实际上可以在单独一行中占用很多字节。
A high inline length tells DB2 that the spatial values can actually take that many bytes in a single row.
应用推荐