Dataset Information

A Markov model of the Indus script.

ABSTRACT: Although no historical information exists about the Indus civilization (flourished ca. 2600-1900 B.C.), archaeologists have uncovered about 3,800 short samples of a script that was used throughout the civilization. The script remains undeciphered, despite a large number of attempts and claimed decipherments over the past 80 years. Here, we propose the use of probabilistic models to analyze the structure of the Indus script. The goal is to reveal, through probabilistic analysis, syntactic patterns that could point the way to eventual decipherment. We illustrate the approach using a simple Markov chain model to capture sequential dependencies between signs in the Indus script. The trained model allows new sample texts to be generated, revealing recurring patterns of signs that could potentially form functional subunits of a possible underlying language. The model also provides a quantitative way of testing whether a particular string belongs to the putative language as captured by the Markov model. Application of this test to Indus seals found in Mesopotamia and other sites in West Asia reveals that the script may have been used to express different content in these regions. Finally, we show how missing, ambiguous, or unreadable signs on damaged objects can be filled in with most likely predictions from the model. Taken together, our results indicate that the Indus script exhibits rich synactic structure and the ability to represent diverse content. both of which are suggestive of a linguistic writing system rather than a nonlinguistic symbol system.

SUBMITTER: Rao RP

PROVIDER: S-EPMC2721819 | biostudies-literature |

REPOSITORIES: biostudies-literature

ACCESS DATA

Dataset Information

A Markov model of the Indus script.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

VIPR HMM: A hidden Markov model for detecting recombination with microbial detection arrays
2012-10-18 | GSE34490 | GEO

Script Templates: A Practical Approach to Script Training in Aphasia.
| S-EPMC5006751 | biostudies-literature

MetabR: an R script for linear model analysis of quantitative metabolomic data.
| S-EPMC3532230 | biostudies-literature

Disease surveillance using a hidden Markov model.
| S-EPMC2735038 | biostudies-literature

Gene hunting with hidden Markov model knockoffs.
| S-EPMC6373422 | biostudies-literature

Group association test using a hidden Markov model.
| S-EPMC4834946 | biostudies-literature

Estimation of phylogeny using a general Markov model.
| S-EPMC2658871 | biostudies-literature