character recognition


Also found in: Dictionary, Medical, Wikipedia.
Related to character recognition: Optical character recognition software

character recognition

[′kar·ik·tər ‚rek·ig′nish·ən]
(computer science)
The technology of using a machine to sense and encode into a machine language the characters which are originally written or printed by human beings.

Character recognition

The process of converting scanned images of machine-printed or handwritten text (numerals, letters, and symbols) into a computer-processable format; also known as optical character recognition (OCR). A typical OCR system contains three logical components: an image scanner, OCR software and hardware, and an output interface. The image scanner optically captures text images to be recognized. Text images are processed with OCR software and hardware. The process involves three operations: document analysis (extracting individual character images), recognizing these images (based on shape), and contextual processing (either to correct misclassifications made by the recognition algorithm or to limit recognition choices). The output interface is responsible for communication of OCR system results to the outside world.

Commercial OCR systems can largely be grouped into two categories: task-specific readers and general-purpose page readers. A task-specific reader handles only specific document types. Some of the most common task-specific readers read bank checks, letter mail, or credit-card slips. These readers usually utilize custom-made image-lift hardware that captures only a few predefined document regions. For example, a bank-check reader may scan just the courtesy-amount field (where the amount of the check is written numerically) and a postal OCR system may scan just the address block on a mail piece. Such systems emphasize high throughput rates and low error rates. Applications such as letter-mail reading have throughput rates of 12 letters per second with error rates less than 2%. The character recognizer in many task-specific readers is able to recognize both handwritten and machine-printed text.

General-purpose page readers are designed to handle a broader range of documents such as business letters, technical writings, and newspapers. These systems capture an image of a document page and separate the page into text regions and nontext regions. Nontext regions such as graphics and line drawings are often saved separately from the text and associated recognition results. Text regions are segmented into lines, words, and characters, and the characters are passed to the recognizer. Recognition results are output in a format that can be postprocessed by application software. Most of these page readers can read machine-written text, but only a few can read hand-printed alphanumerics. See Computer

character recognition

The ability of a machine to recognize printed text. See OCR and MICR.
References in periodicals archive ?
Significant in their commercialization efforts will be a focus on handwriting and character recognition.
The latest addition to the XDR product family, this entry-level system combines a unique combination of hardware, software and development tools to obtain the highest quality character recognition efficiently converting scanned images containing handprint and degraded machine print into ASCII data.
OWR(TM) provides up to 40% better search accuracy compared to document review programs that only have Optical Character Recognition (OCR) output.
The iDRS contains the most recent advancements in optical character recognition technologies developed by I.
The Book Reader comes with optical character recognition (OCR), and text-to-speech (TTS) software, both of which allow the scanner to copy and "read" the documents scanned.
It also improves the accuracy of recognition software, thereby reducing the need for manual correction of problematic intelligent character recognition and/or OCR results.
Readiris Pro 10 for PC and Readiris Pro 10 Corporate Edition Use Optical Character Recognition to Create Editable Documents, PDF Archives and Contact Databases
Source Technologies, a leading provider of integrated solutions for managing financial transactions and other secure business processes, announced the availability of its ST9420 magnetic ink character recognition (MICR) laser printer for retail environments.
This revolutionary approach dramatically reduces the number of complex electro-mechanical assembly boards and the amount of other electronic-based hardware typically included in high speed image and optical character recognition scanners.
The IL6000 delivers high-speed image capture for applications that demand the ultimate in image fidelity, such as optical character recognition (OCR).
Source Technologies, a leading provider of integrated solutions for managing financial transactions and other secure business processes, announced the availability of its ST9445 magnetic ink character recognition (MICR) laser printer for secure image replacement document (IRD) printing at the Bank Administration Institute's Annual Retail Delivery Conference.