character encoding

(redirected from Character sets)
Also found in: Dictionary.
Related to Character sets: ASCII, Character encoding

character encoding

(character)
(Or "character encoding scheme") A mapping of binary values to code positions and back; generally a 1:1 (bijective) mapping.

In the case of ASCII, this is generally a f(x)=x mapping: code point 65 maps to the byte value 65, and vice versa. This is possible because ASCII uses only code positions representable as single bytes, i.e., values between 0 and 255, at most. (US-ASCII only uses values 0 to 127, in fact.)

Unicode and many CJK coded character sets use many more than 255 positions, requiring more complex mappings: sometimes the characters are mapped onto pairs of bytes (see DBCS). In many cases, this breaks programs that assume a one-to-one mapping of bytes to characters, and so, for example, treat any occurrance of the byte value 13 as a carriage return. To avoid this problem, character encodings such as UTF-8 were devised.
References in periodicals archive ?
5" full-color display and built-in/customizable video tutorials; a tool-free print head and platens replacement; an aluminum diecast construction; and over 30 display languages and 40 print character sets.
The Unicode character set has become the standard operating system, allowing all products that are Unicode enabled to handle all characters for every country around the world.
Single Byte Character Sets (SBCS) contain 256 character codes and are adequate for handling data from Western Europe, the United States, Canada and Latin America.
The client comes with a translator, which converts copied or imported records in standard Windows character sets to Unicode.
Here we use the multiple character sets, both in combination (a total-evidence approach) and independently (a taxonomic-congruence approach) to statistically examine the level of discriminatory power attained for the two stone crab species, and we assess the efficacy of using the combined versus the independent character sets to classify stone crabs from the hybrid zone.
Incorporating the new character set presented a number of challenges besides the obvious one--the right to left flow of sentences.
Other systems currently use either DOS code pages, which can handle only 255 characters simultaneously, or have hard coded character sets, none of which have the clean fixed length structure of UNICODE or are truly international standards.
Microsoft Dynamics partnered with Basis Technology to audit, implement and Unicode enable Microsoft Dynamics AX, to handle multiple character sets in a single environment.
Special features of the system include enhancements to the way character sets are handled and the extended use of the BRS/Thesaurus module.
Supports international double-byte character sets enabling creation of PDF documents using nearly any Latin, Cyrillic or Asian character set.
SecureSphere also provides the industry's most comprehensive multi-language and multi-character set support, including support for double-byte character sets, a key requirement in the Korean market.