Genetic diversity of SARS-CoV-2 in South America: demographic history and structuration signals.
Ontology highlight
ABSTRACT: In 2020, the emergence of SARS-CoV-2 caused a global public health crisis with significant mortality rates and a large socioeconomic burden. The rapid spread of this new virus has led to the appearance of new variants, making the characterization and monitoring of genetic diversity necessary to understand the population dynamics and evolution of the virus. Here, a population-genetics-based study was performed starting with South American genome sequences available in the GISAID database to investigate the genetic diversity of SARS-CoV-2 on this continent and the evolutionary mechanisms that modulate it.
Project description:Vicuñas and guanacos are two species of wild South American camelids that are key ruminants in the ecosystems where they occur. Although closely related, these species feature differing ecologies and life history characters, which are expected to influence both their genetic diversity and population differentiation at different spatial scales. Here, using mitochondrial and microsatellite genetic markers, we show that vicuña display lower genetic diversity within populations than guanaco but exhibit more structure across their Peruvian range, which may reflect a combination of natural genetic differentiation linked to geographic isolation and recent anthropogenic population declines. Coalescent-based demographic analyses indicate that both species have passed through a strong bottleneck, reducing their effective population sizes from over 20,000 to less than 1000 individuals. For vicuña, this bottleneck is inferred to have taken place ~3300 years ago, but to have occurred more recently for guanaco at ~2000 years ago. These inferred dates are considerably later than the onset of domestication (when the alpaca was domesticated from the vicuña while the llama was domesticated from the guanaco), coinciding instead with a major human population expansion following the mid-Holocene cold period. As importantly, they imply earlier declines than the well-documented Spanish conquest, where major mass mortality events were recorded for Andean human and camelid populations. We argue that underlying species' differences and recent demographic perturbations have influenced genetic diversity in modern vicuña and guanaco populations, and these processes should be carefully evaluated in the development and implementation of management strategies for these important genetic resources.
Project description:South America has a complex demographic history shaped by multiple migration and admixture events in pre- and post-colonial times. Settled over 14,000 years ago by Native Americans, South America has experienced migrations of European and African individuals, similar to other regions in the Americas. However, the timing and magnitude of these events resulted in markedly different patterns of admixture throughout Latin America. We use genome-wide SNP data for 437 admixed individuals from 5 countries (Colombia, Ecuador, Peru, Chile, and Argentina) to explore the population structure and demographic history of South American Latinos. We combined these data with population reference panels from Africa, Asia, Europe and the Americas to perform global ancestry analysis and infer the subcontinental origin of the European and Native American ancestry components of the admixed individuals. By applying ancestry-specific PCA analyses we find that most of the European ancestry in South American Latinos is from the Iberian Peninsula; however, many individuals trace their ancestry back to Italy, especially within Argentina. We find a strong gradient in the Native American ancestry component of South American Latinos associated with country of origin and the geography of local indigenous populations. For example, Native American genomic segments in Peruvians show greater affinities with Andean indigenous peoples like Quechua and Aymara, whereas Native American haplotypes from Colombians tend to cluster with Amazonian and coastal tribes from northern South America. Using ancestry tract length analysis we modeled post-colonial South American migration history as the youngest in Latin America during European colonization (9-14 generations ago), with an additional strong pulse of European migration occurring between 3 and 9 generations ago. These genetic footprints can impact our understanding of population-level differences in biomedical traits and, thus, inform future medical genetic studies in the region.
Project description:After more than 4 months of the COVID-19 pandemics with genomic information of SARS-CoV-2 around the globe, there are more than 1000 complete genomes of this virus. We used 691 genomes from the GISAID database. Several studies have been reporting mutations and hotspots according to viral evolution. Our work intends to show and compare positions that have variants in 30 complete viral genomes from South American countries. We classified strains according to point alterations and portray the source where strains came into this region. Most viruses entered South America from Europe, followed by Oceania. Only Chilean isolates demonstrated a relationship with Asian isolates. Some changes in South American genomes are near to specific domains related to viral replication or the S protein. Our work contributes to the global understanding of which sort of strains are spreading throughout South America, and the differences among them according to the first isolates introduced to this region.
Project description:Since its emergence in Wuhan (China) on December 2019, the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has rapidly spread worldwide. After its arrival in South America in February 2020, the virus has expanded throughout the region, infecting over 900,000 individuals with approximately 41,000 reported deaths to date. In response to the rapidly growing number of cases, a number of different primer-probe sets have been developed. However, despite being highly specific, most of these primer-probe sets are known to exhibit variable sensitivity. Currently, there are more than 300 SARS-CoV2 whole genome sequences deposited in databases from Brazil, Chile, Ecuador, Colombia, Uruguay, Peru, and Argentina. To test how regional viral diversity may impact oligo binding sites and affect test performance, we reviewed all available primer-probe sets targeting the E, N, and RdRp genes against available South American SARS-CoV-2 genomes checking for nucleotide variations in annealing sites. Results from this in silico analysis showed no nucleotide variations on the E-gene target region, in contrast to the N and RdRp genes which showed massive nucleotide variations within oligo binding sites. In lines with previous data, our results suggest that the E-gene stands as the most conserved and reliable target when considering single-gene target testing for molecular diagnosis of SARS-CoV-2 in South America.
Project description:South Eastern Bantu-speaking (SEB) groups constitute more than 80% of the population in South Africa. Despite clear linguistic and geographic diversity, the genetic differences between these groups have not been systematically investigated. Based on genome-wide data of over 5000 individuals, representing eight major SEB groups, we provide strong evidence for fine-scale population structure that broadly aligns with geographic distribution and is also congruent with linguistic phylogeny (separation of Nguni, Sotho-Tswana and Tsonga speakers). Although differential Khoe-San admixture plays a key role, the structure persists after Khoe-San ancestry-masking. The timing of admixture, levels of sex-biased gene flow and population size dynamics also highlight differences in the demographic histories of individual groups. The comparisons with five Iron Age farmer genomes further support genetic continuity over ~400 years in certain regions of the country. Simulated trait genome-wide association studies further show that the observed population structure could have major implications for biomedical genomics research in South Africa.
Project description:A low level of genetic variation in mammalian populations where the census population size is relatively large has been attributed to various factors, such as a naturally small effective population size, historical bottlenecks and social behaviour. The killer whale (Orcinus orca) is an abundant, highly social species with reduced genetic variation. We find no consistent geographical pattern of global diversity and no mtDNA variation within some regional populations. The regional lack of variation is likely to be due to the strict matrilineal expansion of local populations. The worldwide pattern and paucity of diversity may indicate a historical bottleneck as an additional factor.
Project description:Since the zoonotic event from which SARS-CoV-2 started infecting humans late in 2019, the virus has caused more than 5 million deaths and has infected over 500 million people around the world. The pandemic has had a severe impact on social and economic activities, with greater repercussions in low-income countries. South America, with almost 5% of the world's population, has reckoned with almost a fifth of the total people infected and more than 26% (>1/4) of the deceased. Fortunately, the full genome structure and sequence of SARS-CoV-2 have been rapidly obtained and studied thanks to all the scientific efforts and data sharing around the world. Such molecular analysis of SARS-CoV-2 dynamics showed that rates of mutation, similar to other members of the Coronaviridae family, along with natural selection forces, could result in the emergence of new variants; few of them might be of high consequence. However, this is a serious threat to controlling the pandemic and, of course, enduring the process of returning to normalization with the implicit monetary cost of such a contingency. The lack of updated knowledge in South America justifies the need to develop a structured genomic surveillance program of current and emerging SARS-CoV-2 variants. The modeling of the molecular events and microevolution of the virus will contribute to making better decisions on public health management of the pandemic and developing accurate treatments and more efficient vaccines.
Project description:Genetic effects of habitat fragmentation may be undetectable because they are generally a recent event in evolutionary time or because of confounding effects such as historical bottlenecks and historical changes in species' distribution. To assess the effects of demographic history on the genetic diversity and population structure in the Neotropical tree Dipteryx alata (Fabaceae), we used coalescence analyses coupled with ecological niche modeling to hindcast its distribution over the last 21?000 years. Twenty-five populations (644 individuals) were sampled and all individuals were genotyped using eight microsatellite loci. All populations presented low allelic richness and genetic diversity. The estimated effective population size was small in all populations and gene flow was negligible among most. We also found a significant signal of demographic reduction in most cases. Genetic differentiation among populations was significantly correlated with geographical distance. Allelic richness showed a spatial cline pattern in relation to the species' paleodistribution 21?kyr BP (thousand years before present), as expected under a range expansion model. Our results show strong evidences that genetic diversity in D. alata is the outcome of the historical changes in species distribution during the late Pleistocene. Because of this historically low effective population size and the low genetic diversity, recent fragmentation of the Cerrado biome may increase population differentiation, causing population decline and compromising long-term persistence.