On the universal structure of human lexical semantics
Journal Article
·
· Proceedings of the National Academy of Sciences of the United States of America
- Institute for New Economic Thinking at the Oxford Martin School, Oxford (United Kingdom); Univ. of Oxford, Oxford (United Kingdom); Santa Fe Institute, Santa Fe, NM (United States)
- Indiana Univ., Bloomington, IN (United States)
- Santa Fe Institute, Santa Fe, NM (United States); Tokyo Institute of Technology, Tokyo (Japan)
- Santa Fe Institute, Santa Fe, NM (United States)
- Santa Fe Institute, Santa Fe, NM (United States); Ronin Institute, Montclair, NJ (United States)
- Univ. of New Mexico, Albuquerque, NM (United States); Univ. of California, Berkeley, CA (United States)
- Univ. of New Mexico, Albuquerque, NM (United States)
- Santa Fe Institute, Santa Fe, NM (United States); Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
How universal is human conceptual structure? The way concepts are organized in the human brain may reflect distinct features of cultural, historical, and environmental background in addition to properties universal to human cognition. Semantics, or meaning expressed through language, provides indirect access to the underlying conceptual structure, but meaning is notoriously difficult to measure, let alone parameterize. Here, we provide an empirical measure of semantic proximity between concepts using cross-linguistic dictionaries to translate words to and from languages carefully selected to be representative of worldwide diversity. These translations reveal cases where a particular language uses a single “polysemous” word to express multiple concepts that another language represents using distinct words. We use the frequency of such polysemies linking two concepts as a measure of their semantic proximity and represent the pattern of these linkages by a weighted network. This network is highly structured: Certain concepts are far more prone to polysemy than others, and naturally interpretable clusters of closely related concepts emerge. Statistical analysis of the polysemies observed in a subset of the basic vocabulary shows that these structural properties are consistent across different language groups, and largely independent of geography, environment, and the presence or absence of a literary tradition. As a result, the methods developed here can be applied to any semantic domain to reveal the extent to which its conceptual structure is, similarly, a universal attribute of human cognition and language use.
- Research Organization:
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Sponsoring Organization:
- Santa Fe Institute; USDOE
- Grant/Contract Number:
- AC52-06NA25396
- OSTI ID:
- 1329683
- Report Number(s):
- LA-UR--15-23327
- Journal Information:
- Proceedings of the National Academy of Sciences of the United States of America, Journal Name: Proceedings of the National Academy of Sciences of the United States of America Journal Issue: 7 Vol. 113; ISSN 0027-8424
- Publisher:
- National Academy of Sciences, Washington, DC (United States)Copyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
SLC primer for Russian translation users. [For IBM 360/195]
Computerized language translation at ORNL. [Uses SLC language]
Technical Report
·
Mon Aug 01 00:00:00 EDT 1977
·
OSTI ID:7299874
Computerized language translation at ORNL. [Uses SLC language]
Conference
·
Wed Dec 31 23:00:00 EST 1975
·
OSTI ID:7140781