soundex


Also found in: Dictionary, Wikipedia.

soundex

(algorithm, text)
An algorithm for encoding a word so that similar sounding words encode the same. The first letter is copied unchanged then subsequent letters are encoded as follows:

bfpv -> "1" cgjkqsxz? -> "2" dt -> "3" l -> "4" mn? -> "5" r -> "6"

Other characters are ignored and repeated characters are encoded as though they were a single character. Encoding stops when the resulting string is four characters long, adding trailing "0"s if it is shorter. For example, "SMITH" or "SMYTHE" would both be encoded as "S530".

Soundex

A system for coding English words based on their sound rather than their spelling. Developed in the early 20th century, a Soundex is used to index and match words. The vowels are dropped, and all remaining consonants are assigned three digits except for the first, which is the first letter of the word. There have been several variants of Soundex, but there are far more sophisticated methods for coding speech in today's voice recognition systems. See voice recognition.
Mentioned in ?
References in periodicals archive ?
We think SoundEx provides the best noise reduction for the money and we like that it is not glued to the aluminum skin.
Phonetic algorithms for matching strings have been discussed in context to English language by various researchers from SoundEx [8,9], Metaphone [10,11] and Phonix [12].
Soundex representations of city name and street name were calculated.
Soundex es un algoritmo de codificacion fonetica, que convierte una palabra en un codigo [24].
MatchUp's sophisticated algorithms for fuzzy matching include exact, phonetic, Soundex, containment, frequency, numerics only, and others.
More complex algorithms based on Levenshtein distance (Levenshtein 1966) or soundex (NARA 2010) were not used, as Airport Limousine's clientele has similar demographic profiles and similar address samples were readily available in the historical data.
33) Soundex Index to Petitions for Naturalization filed in Federal, Slate, and Local Courts located in New York City, 1792-1989 (New York, NY: National Archives and Records Administration, Northeast Region: "Classified Ad 8," The New York Amsterdam News, September 18, 1937, 23; Roi Ottley, New World A-Coming: Inside Black America (Cleveland, OH: The World Publishing Company, 1945), 55.
In anticipation of possible new misspellings and transcription errors, a Soundex phonetic algorithm was applied to the cause of death field during classification.
A well-known example especially for the English language is the Soundex algorithm--patented by Russell--for indexing names by sound.
To cite Wiktionary, a Soundex is a "phonetic algorithm for indexing names by their English pronunciation, based on the most probably significant consonants, so that a search for a misspelled name may find the desired one.
The purpose of the Soundex code [is to cluster together names that have similar sounds MVA, 02].