Vocabulary mismatch

From Wikipedia, the free encyclopedia

Vocabulary mismatch is a common phenomenon in the usage of natural languages, occurring when different people name the same thing or concept differently. It is also known as the vocabulary problem, vocabulary gap, term mismatch, or semantic gap.[1]

Furnas et al. (1987) were perhaps the first to quantitatively study the vocabulary mismatch problem.[2] Their results show that on average 80% of the times different people (experts in the same field) will name the same thing differently. There are usually tens of possible names that can be attributed to the same thing. This research motivated the work on latent semantic indexing.

One source of vocabulary mismatch is inflectional form differences, such as using a female word instead of a male word, or a plural form instead of a singular form.[3] Stemming and lemmatization are two different methods of addressing this source by converting all variations of a word to one form.[3]

Vocabulary mismatch also occurs when language changes over time. For example, a doctor may search for papers about "type 1 diabetes mellitus" and not find papers about "juvenile diabetes" due to a change in terminology.[1]

In information retrieval

Other contexts

References

Related Articles

Wikiwand AI