代码转换成多字节编码。
这甚至不需要程序被明确的编码成当前的多字节编码。
The program does not even have to be explicitly coded to the current multi-byte encoding.
此外,与其他单字节编码系统的兼容性也是一个巨大的挑战。
Moreover, compatibility with other single-byte encodings is another significant challenge.
UTF - 8字符集比任何其他多字节编码格式更易于分析和操作。
The UTF-8 character set is easier to parse and to manipulate than any other multi-byte encoding format.
Base64将三个字节编码为四个字符(前面的例子就是6lgp)。
Base64 encodes three bytes as four characters (6lgp for the preceding example).
而Unicode需要至少16位字节编码,以添加足够多的信息来区别最少65536个字符。
Unicode needs at least 16 bits to incorporate enough information to distinguish a minimum of 65,536 characters.
本标准具体规定了图形字符的单字节编码和双字节编码,并对四字节编码体系结构做出了规定。
This standard specifies the graphic characters in the single-byte and double byte codes, as well as four-byte coding structure make provisions.
但是对于多字节编码,字符的字节长度随使用编码模式的不同而不同,每个字符的长度可能是一个字节或多个字节。
However, in the case of a multi-byte encoding, the length of the character in bytes varies according to the encoding used, and each character can be one or more bytes in length.
在DB 2UDBV8.2之前,Unicode数据库只能定义为排序次序identity,这意味着按照字节编码对字符进行比较。
Prior to DB2 UDB V8.2, a Unicode database could only be defined with the collating sequence IDENTITY, which means that the characters are compared by their byte encoding.
语言环境设置会导致printf中的%l s格式说明符调用wcsrtombs函数以便于将宽字符的参数字符串转换成依赖语言环境的多字节编码。
The locale setting will cause the % ls format specifier in printf to call the wcsrtombs function in order to convert the wide character argument string into the locale-dependent multi-byte encoding.
Unicode转换格式(Unicode Transformation Formats,UTFs)是一种通过映射多字节编码中的值来支持Unicode的字符编码方案。
Unicode Transformation Formats (UTFs) are CESs that support the use of Unicode by mapping a value in a multi-byte code.
现在,如果必须在UNIXlocale中添加一个或者几个Unicode编码的脚本,那么您会面临一个问题:POSIX locale和扩展的Linux字符串一直是使用8位字节编码的。
Now, if you have to include one or several Unicode-encoded scripts in UNIX locales, you face a problem: POSIX locales and, by extension, Linux strings, are always encoded using 8-bit bytes.
UnicodeCCS 3.1的官方称谓是ISO 10646-1通用多八字节编码字符集(Universal MultipleOctet CodedCharacter Set,UCS)。
The Unicode CCS 3.1 is officially known as the ISO 10646-1 Universal Multiple Octet Coded Character Set (UCS).
但是如果使用字节语义并使用UTF - 8对字符进行编码,则该字符串的长度为6字节。
But if byte semantics is used, and characters are encoded using UTF-8, then the length of the string is 6 bytes.
它是低效的;通过使用每字节仅6位的编码,它添加了大约消息长度的三分之一。
It is inefficient; by using only 6 bits of each byte, it adds about a third to the length of a message.
读取器/写入器有性能开销,因为它们旨在用于字符流,且会在后台将数据编码为字节。
There is a performance overhead involved with readers/writers because they are intended for character streams, and they encode data to bytes under the covers.
结果造成您拥有许多按照XML 1.0规则完全等价的文档,但在对编码的源文件进行逐字节比较时,它们可能完全不同。
The result is that you can have numerous documents that are exactly equivalent in XML 1.0 rules, while being very different under byte-by-byte comparison of the encoded source.
如清单1中所示的任何查询都有到指定数据的精确路径,DBMS可以迅速地在均衡树和字节偏移编码中确定它。
Any query such as the one in Listing 1 has a precise path to the specified datum, which a DBMS can quickly determine in balanced trees and byte-offset codings.
将哈希后的字节数组编码成十六进制字符。
UTF - 8:使用1到4字节来表示每个字符的编码。
UTF-8:1 to 4 bytes are used for the encoding of each character.
因为在utf - 8编码中字符需要不同数量的字节(1到4字节),所以很难处理这些函数。
It is difficult to work with those functions because characters might require various Numbers of bytes in UTF-8 encoding (1 to 4 bytes).
假设您拥有一个UTF - 8编码的字符,其长度为3字节,而字符串只拥有编码的前两个字节。
Consider that you have a character in UTF-8 encoding has length of 3 bytes, and the string has only the first two byte of the encoding.
字节数组限制为MAX_VALUE_LENGTH长度75%,因为字节数组是通过编码为字符串存储的。
Byte arrays are limited in length to 75 percent of MAX_VALUE_LENGTH because byte arrays are stored by encoding them as strings.
对于单字节字符编码模式,一个字节组成一个字符,单字节字符串的长度与字符串的字节长度相同。
In the case of a single-byte character encoding scheme, a single byte constitutes a character and the length of a single byte string is the same as the byte length of the string.
使用Unicode编码后,每用户名占用64字节。
UCS - 2:每个Unicode字符都编码为2字节。
UCS-2: Each Unicode character is encoded with exactly 2 bytes.
UCS - 2:每个Unicode字符都编码为2字节。
UCS-2: Each Unicode character is encoded with exactly 2 bytes.
应用推荐