Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

On the universal structure of human lexical semantics

Journal Article · · Proceedings of the National Academy of Sciences of the United States of America
 [1];  [2];  [3];  [4];  [5];  [6];  [7];  [8]
  1. Institute for New Economic Thinking at the Oxford Martin School, Oxford (United Kingdom); Univ. of Oxford, Oxford (United Kingdom); Santa Fe Institute, Santa Fe, NM (United States)
  2. Indiana Univ., Bloomington, IN (United States)
  3. Santa Fe Institute, Santa Fe, NM (United States); Tokyo Institute of Technology, Tokyo (Japan)
  4. Santa Fe Institute, Santa Fe, NM (United States)
  5. Santa Fe Institute, Santa Fe, NM (United States); Ronin Institute, Montclair, NJ (United States)
  6. Univ. of New Mexico, Albuquerque, NM (United States); Univ. of California, Berkeley, CA (United States)
  7. Univ. of New Mexico, Albuquerque, NM (United States)
  8. Santa Fe Institute, Santa Fe, NM (United States); Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
How universal is human conceptual structure? The way concepts are organized in the human brain may reflect distinct features of cultural, historical, and environmental background in addition to properties universal to human cognition. Semantics, or meaning expressed through language, provides indirect access to the underlying conceptual structure, but meaning is notoriously difficult to measure, let alone parameterize. Here, we provide an empirical measure of semantic proximity between concepts using cross-linguistic dictionaries to translate words to and from languages carefully selected to be representative of worldwide diversity. These translations reveal cases where a particular language uses a single “polysemous” word to express multiple concepts that another language represents using distinct words. We use the frequency of such polysemies linking two concepts as a measure of their semantic proximity and represent the pattern of these linkages by a weighted network. This network is highly structured: Certain concepts are far more prone to polysemy than others, and naturally interpretable clusters of closely related concepts emerge. Statistical analysis of the polysemies observed in a subset of the basic vocabulary shows that these structural properties are consistent across different language groups, and largely independent of geography, environment, and the presence or absence of a literary tradition. As a result, the methods developed here can be applied to any semantic domain to reveal the extent to which its conceptual structure is, similarly, a universal attribute of human cognition and language use.
Research Organization:
Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
Santa Fe Institute; USDOE
Grant/Contract Number:
AC52-06NA25396
OSTI ID:
1329683
Report Number(s):
LA-UR--15-23327
Journal Information:
Proceedings of the National Academy of Sciences of the United States of America, Journal Name: Proceedings of the National Academy of Sciences of the United States of America Journal Issue: 7 Vol. 113; ISSN 0027-8424
Publisher:
National Academy of Sciences, Washington, DC (United States)Copyright Statement
Country of Publication:
United States
Language:
English

References (30)

Learning to express motion events in English and Korean: The influence of language-specific lexicalization patterns journal December 1991
Review of Décsy (1988): A Select Catalog of Language Universals journal January 1989
Large Linguistic Areas and Language Sampling journal January 1989
The Triples Distance for Rooted Bifurcating Phylogenetic Trees journal September 1996
Lexical Universals journal October 1978
figurative language In a universalist perspective journal August 1981
Elements of Information Theory book January 1991
Modern Applied Statistics with S book August 2002
The electrical resistance of a graph captures its commute and cover times journal December 1996
Comparing the shapes of trees book January 1975
Learning to express motion events in English and Korean: The influence of language-specific lexicalization patterns journal December 1991
Comparison of phylogenetic trees journal February 1981
The cross-linguistic categorization of everyday events: A study of cutting and breaking journal November 2008
Space in Language and Cognition journal January 2009
Language Typology and Syntactic Description book October 2007
Grammatical Categories and Cognition journal January 2010
Typology and Universals book January 2002
The myth of language universals: Language diversity and its importance for cognitive science journal October 2009
The weirdest people in the world? journal June 2010
Evolved structure of language shows lineage-specific trends in word-order universals journal April 2011
A Method of Language Sampling journal January 1993
The Logical Analysis of Kinship journal January 1949
Mapping the Origins and Expansion of the Indo-European Language Family journal August 2012
On Information and Sufficiency journal March 1951
Principles of Historical Linguistics book December 1986
New directions in lexical typology journal January 2012
The verbs of perception: a typological study journal January 1983
Language sampling journal January 1998
general principles of human anatomical partonomy and speculations on the growth of partonomic nomenclature 1 journal August 1976
The ade4 Package: Implementing the Duality Diagram for Ecologists journal January 2007

Cited By (15)

Knowledge gaps in the early growth of semantic feature networks journal September 2018
Languages Support Efficient Communication about the Environment: Words for Snow Revisited. text January 2016
Q&A: What is human language, when did it evolve and why should we care? journal July 2017
How Does Grammatical Gender Affect Noun Representations in Gender-Marking Languages? conference January 2019
Semantic typology: meaning in a cross–linguistic perspective journal July 2017
Knowledge gaps in the early growth of semantic networks preprint January 2017
Building the Mongolian WordNet text January 2020
EEG-based classification of natural sounds reveals specialized responses to speech and music journal April 2020
Cross-Linguistic Data Formats, advancing data sharing and re-use in comparative linguistics journal October 2018
Lexical semantics in language shift: Comparing emotion lexica in Dalabon and Barunga Kriol (northern Australia) journal May 2018
The semantic map model: State of the art and future avenues for linguistic research journal February 2018
Emotion semantics show both cultural variation and universal structure journal December 2019
Languages Support Efficient Communication about the Environment: Words for Snow Revisited journal April 2016
Towards a Universal Semantic Dictionary preprint July 2019
Towards a Universal Semantic Dictionary journal September 2019

Similar Records

SLC primer for Russian translation users. [For IBM 360/195]
Technical Report · Mon Aug 01 00:00:00 EDT 1977 · OSTI ID:7299874

Computerized language translation at ORNL. [Uses SLC language]
Conference · Wed Dec 31 23:00:00 EST 1975 · OSTI ID:7140781