Unknown

Dataset Information

0

Estimating the reproduction number and transmission heterogeneity from the size distribution of clusters of identical pathogen sequences.


ABSTRACT: Quantifying transmission intensity and heterogeneity is crucial to ascertain the threat posed by infectious diseases and inform the design of interventions. Methods that jointly estimate the reproduction number R and the dispersion parameter k have however mainly remained limited to the analysis of epidemiological clusters or contact tracing data, whose collection often proves difficult. Here, we show that clusters of identical sequences are imprinted by the pathogen offspring distribution, and we derive an analytical formula for the distribution of the size of these clusters. We develop and evaluate an inference framework to jointly estimate the reproduction number and the dispersion parameter from the size distribution of clusters of identical sequences. We then illustrate its application across a range of epidemiological situations. Finally, we develop a hypothesis testing framework relying on clusters of identical sequences to determine whether a given pathogen genetic subpopulation is associated with increased or reduced transmissibility. Our work provides tools to estimate the reproduction number and transmission heterogeneity from pathogen sequences without building a phylogenetic tree, thus making it easily scalable to large pathogen genome datasets.

SUBMITTER: Tran-Kiem C 

PROVIDER: S-EPMC11009662 | biostudies-literature | 2024 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Estimating the reproduction number and transmission heterogeneity from the size distribution of clusters of identical pathogen sequences.

Tran-Kiem Cécile C   Bedford Trevor T  

Proceedings of the National Academy of Sciences of the United States of America 20240403 15


Quantifying transmission intensity and heterogeneity is crucial to ascertain the threat posed by infectious diseases and inform the design of interventions. Methods that jointly estimate the reproduction number <i>R</i> and the dispersion parameter <i>k</i> have however mainly remained limited to the analysis of epidemiological clusters or contact tracing data, whose collection often proves difficult. Here, we show that clusters of identical sequences are imprinted by the pathogen offspring dist  ...[more]

Similar Datasets

| S-EPMC6953795 | biostudies-literature
| S-EPMC10859083 | biostudies-literature
| S-EPMC9449464 | biostudies-literature
| S-EPMC4021574 | biostudies-literature
| S-EPMC9205483 | biostudies-literature
| S-EPMC7968714 | biostudies-literature
| S-EPMC3084214 | biostudies-literature
| S-EPMC7580988 | biostudies-literature
| S-EPMC5821398 | biostudies-literature
| S-EPMC4512769 | biostudies-literature