Project description:There are very few studies exploring the genetic diversity of tick-borne encephalitis complex viruses. Most of the viruses have been sequenced using capillary electrophoresis, however, very few viruses have been analyzed using deep sequencing to look at the genotypes in each virus population. In this study, different viruses and strains belonging to the tick-borne encephalitis complex were sequenced and genetic diversity was analyzed. Shannon entropy and single nucleotide variants were used to compare the viruses. Then genetic diversity was compared to the phylogenetic relationship of the viruses.
Project description:Eucalyptus urophylla is a commercially important wood crop plantation species due to its rapid growth, biomass yield, and use as bioenergy feedstock. We characterized the genetic diversity and population structure of 332 E. urophylla individuals from 19 geographically defined E. urophylla populations with a reliability of 14,468 single nucleotide polymorphisms (SNPs). We compared the patterns of genetic variation among these 19 populations. High levels of genetic diversity were observed throughout the 19 E. urophylla populations based on genome-wide SNP data (HE=0.2677 to 0.3487). Analysis with STRUCTURE software, Principal component analysis (PCA) and a neighbor-joining (NJ) tree indicated that E. urophylla populations could be divided into three groups, and moderate and weak population structure was observed with pairwise genetic differentiation (FST) values ranging from −0.09 to 0.074. The low genetic diversity and shallow genetic differentiation found within the 19 populations may be a consequence of their pollination system and seed dispersal mechanism. In addition, 55 core germplasms of E. urophylla were constructed according to the genetic marker data. The genome-wide SNPs we identified will provide a valuable resource for further genetic improvement and effective use of the germplasm resources.
Project description:Natural epigenetic variation provides a source for the generation of phenotypic diversity, but to understand its contribution to phenotypic diversity, its interaction with genetic variation requires further investigation. Here, we report population-wide DNA sequencing of genomes, transcriptomes, and methylomes of wild Arabidopsis thaliana accessions. Single cytosine methylation polymorphisms are unlinked to genotype. However, the rate of linkage disequilibrium decay amongst differentially methylated regions targeted by RNA-directed DNA methylation is similar to the rate for single nucleotide polymorphisms. Association analyses of these RNA-directed DNA methylation regions with genetic variants identified 2,372 methylQTL, which revealed the first population estimate of genetically dependent methylation variation. Analysis of invariably methylated transposons and genes across this population indicates that loci targeted by RNA-directed DNA methylation are epigenetically reactivated during male gametogenesis, which facilitates their silencing across generations. RNA-seq from naturally-occurring Arabidopsis accessions
Project description:Natural epigenetic variation provides a source for the generation of phenotypic diversity, but to understand its contribution to phenotypic diversity, its interaction with genetic variation requires further investigation. MethylC-seq from naturally-occurring Arabidopsis accessions
Project description:Natural epigenetic variation provides a source for the generation of phenotypic diversity, but to understand its contribution to phenotypic diversity, its interaction with genetic variation requires further investigation. Here, we report population-wide DNA sequencing of genomes, transcriptomes, and methylomes of wild Arabidopsis thaliana accessions. Single cytosine methylation polymorphisms are unlinked to genotype. However, the rate of linkage disequilibrium decay amongst differentially methylated regions targeted by RNA-directed DNA methylation is similar to the rate for single nucleotide polymorphisms. Association analyses of these RNA-directed DNA methylation regions with genetic variants identified 2,372 methylQTL, which revealed the first population estimate of genetically dependent methylation variation. Analysis of invariably methylated transposons and genes across this population indicates that loci targeted by RNA-directed DNA methylation are epigenetically reactivated during male gametogenesis, which facilitates their silencing across generations.
Project description:With the advent of advanced sequencing technology, studies of RNA viruses have shown that genetic diversity contribute to both attenuation and virulence. The differences in genetic diversity of wild-type Asibi virus and 17D-204 vaccine provides an unique opportunity to investigate RNA population theory in the context of a well described live attenuated vaccine. Utilizing infectious clone-derived viruses containing some of the amino acid substitutions that differentiate yellow fever wild-type Asibi strain from 17D vaccine and recovered in a controlled experiment, establishes that the genetic diversity differences that exist between wild-type Asibi and 17D-204 vaccine viruses are not influenced by either different passage history or source of samples, but rather resulted from the attenuation of wild-type Asibi virus to yield the 17D vaccine sub-strains.
Project description:The Ashkenazi Jewish population has long been considered a genetic isolate and presumed to have the genetic signatures of founder effects and isolation. We genotyped a large cohort of Ashkenazi Jews and analyzed their genetic structure compared to other worldwide populations.
Project description:Structural and functional impacts of copy number variations (CNVs) on livestock genomes are not yet well understood. In this study, we have identified 1853 CNV regions (CNVRs) using population-scale sequencing data generated from 75 cattle of 8 breeds (Holstein, Angus, Jersey, Limousin, Romagnola, Brahman, Gir and Nelore). Individual genome sequence coverage ranged from 4 to 30 fold, with a mean of 11.8 fold. A total of 3.1% (87.5 Mb) of the cattle genome is predicted to be copy number variable, representing a substantial increase over the previous estimates (~2%). This dataset was highly correlated with array CGH data (r2 = 0.761) and was validated to be accurate with an estimated 12% false positive rate and a 19% false negative rate based on qPCR and array CGH, respectively. Hundreds of CNVs were found to be either breed specific or differentially variable across breeds, including the RICTOR gene in dairy breeds and the PNPLA3 gene in the beef breeds. In contrast, clusters of the PRP and PAG genes are duplicated in all sequenced animals, implicating that subfunctionalization, neofunctionalization or overdominance play a role in diversifying these fertility related genes. Further population-genetic analyses based on CNVs revealed the population structures of these taurine and indicine breeds and uncovered hundreds of positively selected CNV candidates near important functional genes. These CNV results provide a new glimpse of diverse selections during cattle speciation, domestication, breed formation, and recent genetic improvement.