Unknown

Dataset Information

0

Unzipping Zipf's law.


ABSTRACT: In spite of decades of theorizing, the origins of Zipf's law remain elusive. I propose that a Zipfian distribution straightforwardly follows from the interaction of syntax (word classes differing in class size) and semantics (words having to be sufficiently specific to be distinctive and sufficiently general to be reusable). These factors are independently motivated and well-established ingredients of a natural-language system. Using a computational model, it is shown that neither of these ingredients suffices to produce a Zipfian distribution on its own and that the results deviate from the Zipfian ideal only in the same way as natural language itself does.

SUBMITTER: Lestrade S 

PROVIDER: S-EPMC5549924 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Unzipping Zipf's law.

Lestrade Sander S  

PloS one 20170809 8


In spite of decades of theorizing, the origins of Zipf's law remain elusive. I propose that a Zipfian distribution straightforwardly follows from the interaction of syntax (word classes differing in class size) and semantics (words having to be sufficiently specific to be distinctive and sufficiently general to be reusable). These factors are independently motivated and well-established ingredients of a natural-language system. Using a computational model, it is shown that neither of these ingre  ...[more]

Similar Datasets

| S-EPMC8397718 | biostudies-literature
| S-EPMC4531284 | biostudies-literature
| S-EPMC555536 | biostudies-literature
| S-EPMC6303796 | biostudies-literature
| S-EPMC2996287 | biostudies-literature
| S-EPMC4723055 | biostudies-literature
| S-EPMC5033250 | biostudies-literature
| S-EPMC5172588 | biostudies-literature
| S-EPMC9326285 | biostudies-literature
| S-EPMC3596411 | biostudies-literature