speech recognition

(redirected from Spoken word recognition)
Also found in: Dictionary, Medical.

speech recognition

[′spēch ‚rek·ig′nish·ən]
(engineering acoustics)
The process of analyzing an acoustic speech signal to identify the linguistic message that was intended, so that a machine can correctly respond to spoken commands.
McGraw-Hill Dictionary of Scientific & Technical Terms, 6E, Copyright © 2003 by The McGraw-Hill Companies, Inc.

speech recognition

(application)
(Or voice recognition) The identification of spoken words by a machine. The spoken words are digitised (turned into sequence of numbers) and matched against coded dictionaries in order to identify the words.

Most systems must be "trained," requiring samples of all the actual words that will be spoken by the user of the system. The sample words are digitised, stored in the computer and used to match against future words. More sophisticated systems require voice samples, but not of every word. The system uses the voice samples in conjunction with dictionaries of larger vocabularies to match the incoming words. Yet other systems aim to be "speaker-independent", i.e. they will recognise words in their vocabulary from any speaker without training.

Another variation is the degree with which systems can cope with connected speech. People tend to run words together, e.g. "next week" becomes "neksweek" (the "t" is dropped). For a voice recognition system to identify words in connected speech it must take into account the way words are modified by the preceding and following words.

It has been said (in 1994) that computers will need to be something like 1000 times faster before large vocabulary (a few thousand words), speaker-independent, connected speech voice recognition will be feasible.
This article is provided by FOLDOC - Free Online Dictionary of Computing (foldoc.org)

voice recognition

(1) Using a person's voice as a form of identification. See two-factor authentication.

(2) The conversion of spoken words into computer text. Speech is first digitized and then matched against a dictionary of coded waveforms. Also called "speech recognition," the matches are converted into text as if the words were typed on the keyboard. "Speaker-dependent" systems require users to enunciate samples to train and fine tune the system. "Speaker-independent" recognition such as telephone voice response systems do not require training but generally handle only a limited vocabulary.

Three Categories
The least taxing on the electronics, "command" systems recognize several dozen words and eliminate using the mouse or keyboard. "Discrete voice" recognition systems used for dictation require a pause between each word. "Continuous voice" recognition understands natural speech without pauses and is the most process intensive. The Holy Grail of voice recognition, speaker-independent, continuous systems that handle extensive vocabularies are slowly but surely becoming mainstream. Contrast with speaker recognition.


First Handheld Speech Recognition
The first continuous dictation in a handheld device was in 2000 when Lernout & Hauspie showed off this Linux PDA prototype. It provided keyboard-free email composition. (Image courtesy of Lernout & Hauspie.)
Copyright © 1981-2019 by The Computer Language Company Inc. All Rights reserved. THIS DEFINITION IS FOR PERSONAL USE ONLY. All other reproduction is strictly prohibited without permission from the publisher.
References in periodicals archive ?
Most studies used monosyllabic stimuli both with visual word recognition tasks (Coltheart et al., 2001) and spoken word recognition tasks (Gaskell & Marslen-Wilson, 1997; 1999; Goldinger, et al., 1989; Kessler & Treiman, 2002; Marslen-Wilson & Warren, 1994; Vitevitch & Luce, 1999).
Ambiguity, competition, and blending in Spoken word recognition. Cognitive Science, 23, 439-462.
(1999).Probabilistic phonotactics and neighborhood activation in spoken word recognition. Journal of Memory and Language, 40, 374-408.
Neighborhood density effects in spoken word recognition in Spanish.
Similarity mapping in spoken word recognition. Journal of Memory and Language 37, 463-480.
Competition in spoken word recognition: spotting words in other words.