character encoding

(redirected from Text encoding)
Also found in: Dictionary.

character encoding

(character)
(Or "character encoding scheme") A mapping of binary values to code positions and back; generally a 1:1 (bijective) mapping.

In the case of ASCII, this is generally a f(x)=x mapping: code point 65 maps to the byte value 65, and vice versa. This is possible because ASCII uses only code positions representable as single bytes, i.e., values between 0 and 255, at most. (US-ASCII only uses values 0 to 127, in fact.)

Unicode and many CJK coded character sets use many more than 255 positions, requiring more complex mappings: sometimes the characters are mapped onto pairs of bytes (see DBCS). In many cases, this breaks programs that assume a one-to-one mapping of bytes to characters, and so, for example, treat any occurrance of the byte value 13 as a carriage return. To avoid this problem, character encodings such as UTF-8 were devised.
References in periodicals archive ?
Despite the undisputable interpretative nature of text encoding, traditional digital literary scholars have prioritized structural over overtly interpretive and critical markup in an attempt to produce the most objective and reliable scholarly editions as possible.
A sampling of topics includes: the impact of computers on scholarly editing, digitizing George Herbert's The Temple, technical competence in digital scholarly editing, the text encoding initiative (TEI), and user focused interfaces.
On the other hand, the field also contains many examples of idealistic approaches to content such as can be seen in the guidelines of the Text Encoding Initiative (TEI).
Academics can synthesise, expose and repurpose information in dynamic new ways, while mass digitisation and text encoding are making historic material more accessible and online databases are extending the boundaries of research.
This is possible as the debates' text is analyzed to extract relevant knowledge using tools such as text encoding and clustering.
Today such risks are fewer due to common standards such as the Text Encoding Initiative (TEI), which specify guidelines for producing digital versions of texts.
(2.) For example, Metadata Encoding and Transmission Standard (METS), Model Editions Project (MEP), and Text Encoding Initiative (TEI).
The significance of this project can be equated with that of the Text Encoding Initiative (TEI).
Most of the textual resources in the Perseus Digital Library are transcriptions that have been marked up, or encoded, according to the standards specified by the Text Encoding Initiative, or TEI (Sperberg-McQueen 2002).
The TEI 2008 Members' Meeting conference will give the Text Encoding Initiative (TEI) community a place to gather, meet new colleagues, learn about new projects, and share research, among other things.
With Encore, Innovative's discovery services platform, patrons may search against a library's entire portfolio of collections, whether built on MARC or on XML-based schema (Dublin CoreSM, Encoded Archival Description [EAD], Text Encoding Initiative [TEI], etc.) using modern web-discovery features like faceted searching.
The program will begin on May 1st and 2nd with hands-on workshops, including one sponsored by the TEI Consortium and funded by the NEH, which will provide a practical introduction to text encoding and another that will focus on navigating online resources in African American and African Diaspora Studies.