Unknown

Dataset Information

0

Greater Early Disambiguating Information for Less-Probable Words: The Lexicon Is Shaped by Incremental Processing.


ABSTRACT: There has been much work over the last century on optimization of the lexicon for efficient communication, with a particular focus on the form of words as an evolving balance between production ease and communicative accuracy. Zipf's law of abbreviation, the cross-linguistic trend for less-probable words to be longer, represents some of the strongest evidence the lexicon is shaped by a pressure for communicative efficiency. However, the various sounds that make up words do not all contribute the same amount of disambiguating information to a listener. Rather, the information a sound contributes depends in part on what specific lexical competitors exist in the lexicon. In addition, because the speech stream is perceived incrementally, early sounds in a word contribute on average more information than later sounds. Using a dataset of diverse languages, we demonstrate that, above and beyond containing more sounds, less-probable words contain sounds that convey more disambiguating information overall. We show further that this pattern tends to be strongest at word-beginnings, where sounds can contribute the most information.

SUBMITTER: King A 

PROVIDER: S-EPMC7323847 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC10205071 | biostudies-literature
| S-EPMC7148022 | biostudies-literature
| S-EPMC6506442 | biostudies-literature
| S-EPMC8793294 | biostudies-literature
| S-EPMC3278621 | biostudies-literature
| S-EPMC10523564 | biostudies-literature
| S-EPMC6638850 | biostudies-literature
| S-EPMC3116835 | biostudies-literature
| S-EPMC6084553 | biostudies-literature
| S-EPMC9580628 | biostudies-literature