character encoding

(redirected from Text encoding)
Also found in: Dictionary.

character encoding

(character)
(Or "character encoding scheme") A mapping of binary values to code positions and back; generally a 1:1 (bijective) mapping.

In the case of ASCII, this is generally a f(x)=x mapping: code point 65 maps to the byte value 65, and vice versa. This is possible because ASCII uses only code positions representable as single bytes, i.e., values between 0 and 255, at most. (US-ASCII only uses values 0 to 127, in fact.)

Unicode and many CJK coded character sets use many more than 255 positions, requiring more complex mappings: sometimes the characters are mapped onto pairs of bytes (see DBCS). In many cases, this breaks programs that assume a one-to-one mapping of bytes to characters, and so, for example, treat any occurrance of the byte value 13 as a carriage return. To avoid this problem, character encodings such as UTF-8 were devised.
References in periodicals archive ?
Finally, the company seems to be positioning itself to process and save unstructured data albeit it's text encoding manually today.
Guidelines for Electronic Text Encoding Interchange.
The documentation of electronic texts using Text Encoding Initiative headers: an introduction.
The technical problems are thus considerable, but are being solved by refreshment and critical fail-safe mechanisms, standardisation of formats, such as the Text Encoding Initiative (TEI), Computer Aided Design (CAD) and Geographic Information Systems (GIS), which are all initiatives under consideration by the International Standards Organization.
Esta tesis presenta una investigacion sobre el almacenamiento y la publicacion de los libros en formato digital, para ello se esta haciendo uso de la indizacion y catalogacion de paginas web, a fin de poder facilitar la publicacion e intercambio de los documentos; asimismo el uso de los estandares de publicacion de documentos digitales tales como the Inicitive Text Encoding (TEI), Dublin Core (DC), Resource Description Framework (RDF) XML.
She has been actively involved in language re source development and representation since 1987, founded the Text Encoding initiative, and is currently a project leader in the International Organization for Standardization subcommittee for language resources (ISO TC37 SC4).
It will of significant interest to language teachers and humanities scholars if tools can be developed to enable interchange between the Open eBook standard and the new XML-based format for the long-running Text Encoding Initiative (TEI).
Thus, NVivo can be used in analysis with documents formatted by the standards of the Text Encoding Initiative.
In order to achieve the appropriate balance between "getting the whole text" and "nothing but the text" (28), one must first understand what a text is, to which Renear gives the answer: an "Ordered Hierarchy of Content Objects" or OCHO (27)--a textual ontology that has been used to support strategies such as the Text Encoding Initiative (TEI).
An enhanced version of iSKETCH, which includes encryption, more shapes, colors, animation, and text encoding, is already available for license to software developers and system integrators.
Early invitees include representatives from the Digital Audio-based Information System (DAISY) initiative, the Electronic Book Exchange (EBX) Working Group, the Text Encoding Initiative (TEI) Consortium, NISO, W3C, DocBook, the International Publisher's Association, MPEG, the U.
the Text Encoding Initiative's [TEI] Header, the VRA core, the metadata standards recommended by the Federal Geographic Data Committee--FGDC) but to the fact that there are few common implementations of any single format or scheme.