ABSTRACT: Phylogenetic implications of the whole mitochondrial genome provide insight into geographic distribution and morphological relationships of Bos spp.
Project description:To effectively monitor microbial populations in acidic environments and bioleaching systems, a comprehensive 50-mer-based oligonucleotide microarray was developed based on most of the known genes associated with the acidophiles. This array contained 1,072 probes in which there were 571 related to 16S rRNA and 501 related to functional genes. Acid mine drainage (AMD) presents numerous problems to the aquatic life and surrounding ecosystems. However, little is known about the geographic distribution, diversity, composition, structure and function of AMD microbial communities. In this study, we analyzed the geographic distribution of AMD microbial communities from twenty sites using restriction fragment length polymorphism (RFLP) analysis of 16S rRNA genes, and the results showed that AMD microbial communities were geographically distributed and had high variations among different sites. Then an AMD-specific microarray was used to further analyze nine AMD microbial communities, and showed that those nine AMD microbial communities had high variations measured by the number of detected genes, overlapping genes between samples, unique genes, and diversity indices. Statistical analyses indicated that the concentrations of Fe, S, Ca, Mg, Zn, Cu and pH had strong impacts on both phylogenetic and functional diversity, composition, and structure of AMD microbial communities. This study provides insights into our understanding of the geographic distribution, diversity, composition, structure and functional potential of AMD microbial communities and key environmental factors shaping them.
Project description:To effectively monitor microbial populations in acidic environments and bioleaching systems, a comprehensive 50-mer-based oligonucleotide microarray was developed based on most of the known genes associated with the acidophiles. This array contained 1,072 probes in which there were 571 related to 16S rRNA and 501 related to functional genes. Acid mine drainage (AMD) presents numerous problems to the aquatic life and surrounding ecosystems. However, little is known about the geographic distribution, diversity, composition, structure and function of AMD microbial communities. In this study, we analyzed the geographic distribution of AMD microbial communities from twenty sites using restriction fragment length polymorphism (RFLP) analysis of 16S rRNA genes, and the results showed that AMD microbial communities were geographically distributed and had high variations among different sites. Then an AMD-specific microarray was used to further analyze nine AMD microbial communities, and showed that those nine AMD microbial communities had high variations measured by the number of detected genes, overlapping genes between samples, unique genes, and diversity indices. Statistical analyses indicated that the concentrations of Fe, S, Ca, Mg, Zn, Cu and pH had strong impacts on both phylogenetic and functional diversity, composition, and structure of AMD microbial communities. This study provides insights into our understanding of the geographic distribution, diversity, composition, structure and functional potential of AMD microbial communities and key environmental factors shaping them. This study investigated the geographic distribution of Acid Mine Drainages microbial communities using a 16S rRNA gene-based RFLP method and the diversity, composition and structure of AMD microbial communities phylogenetically and functionally using an AMD-specific microarray which contained 1,072 probes ( 571 related to 16S rRNA and 501 related to functional genes). The functional genes in the microarray were involved in carbon metabolism (158), nitrogen metabolism (72), sulfur metabolism (39), iron metabolism (68), DNA replication and repair (97), metal-resistance (27), membrane-relate gene (16), transposon (13) and IST sequence (11).
Project description:The genus Flaveria has been extensively used as a model to study the evolution of C4 photosynthesis as it contains both C3 and C4 species as well as a number of species that exhibit intermediate types of photosynthesis. The current phylogenetic tree of the Flaveria genus contains 21 of the 23 known Flaveria species and has been constructed using a combination of morphologicial data and three non-coding DNA sequences (nuclear encoded ETS, ITS and chloroplast encoded trnl-F). However, recent studies have suggested that phylogenetic trees inferred using a small number of molecular sequences may often be incorrect. Moreover, studies in other genera have often shown substantial differences between trees inferred using morphological data and those using molecular sequence. To provide new insight into the phylogeny of the genus Flaveria we utilize RNA-Seq data to construct a multi-gene concatenated phylogenetic tree of 17 Flaveria species. Furthermore, we use this new data to identify 14 C4 specific non-synonymous mutation sites, 12 of which (86%) can be independently verified by public sequence data. We propose that the data collection method provided in this study can be used as a generic method for facilitating phylogenetic tree reconstruction in the absence of reference genomes for the target species. 18 Flaveria sample including 11 species are sequenced, other three samples were also sequenced as out-group. In all, 21 samples.
Project description:The pairing of CRISPR/Cas9-based gene editing with massively parallel single-cell readouts now enables large-scale lineage tracing. However, the rapid growth in complexity of data from these assays has outpaced our ability to accurately infer phylogenetic relationships. First, we introduce Cassiopeia - a suite of scalable maximum parsimony approaches for tree reconstruction. Second, we provide a simulation framework for evaluating algorithms and exploring lineage tracer design principles. Finally, we generate the most complex experimental lineage tracing dataset to date, 34,557 human cells continuously traced over 15 generations, and use it for benchmarking phylogenetic inference approaches. We show that Cassiopeia outperforms traditional methods by several metrics and under a wide variety of parameter regimes, and provide insight into the principles for the design of improved Cas9-enabled recorders. Together these should broadly enable large-scale mammalian lineage tracing efforts.Cassiopeia and its benchmarking resources are publicly available at https://www.github.com/YosefLab/Cassiopeia.
Project description:A phylogenetic analysis of seven different species (human, mouse, rat, worm, fly, yeast, and plant) utilizing all (541) basic helix-loop-helix (bHLH) genes identified, including expressed sequence tags (EST), was performed. A super-tree involving six clades and a structural categorization involving the entire coding sequence was established. A nomenclature was developed based on clade distribution to discuss the functional and ancestral relationships of all the genes. The position/location of specific genes on the phylogenetic tree in relation to known bHLH factors allows for predictions of the potential functions of uncharacterized bHLH factors, including EST's. A genomic analysis using microarrays for four different mouse cell types (i.e. Sertoli, Schwann, thymic, and muscle) was performed and considered all known bHLH family members on the microarray for comparison. Cell-specific groups of bHLH genes helped clarify those bHLH genes potentially involved in cell specific differentiation. This phylogenetic and genomic analysis of the bHLH gene family has revealed unique aspects of the evolution and functional relationships of the different genes in the bHLH gene family. PMID: 18557763 We used microarrays to determine bHLH expression in 20d rat Sertoli cells. RNA samples from two control groups (Sertoli cells cultured for 72 h) are compared to two treated groups (Sertoli cells cultured for 72 h with cAMP).
Project description:Copy number variations (CNVs) have been demonstrated as crucial substrates for evolution, adaptation and breed formation. Chinese indigenous cattle breeds exhibit a broad geographical distribution and diverse environmental adaptability. Here, we analyzed the population structure and adaptation to high altitude of Chinese indigenous cattle based on genome-wide CNVs derived from the high-density BovineHD SNP array. We successfully detected the genome-wide CNVs of 318 individuals from 24 Chinese indigenous cattle breeds and 37 yaks as outgroups. A total of 5,818 autosomal CNV regions (683 bp - 4,477,860 bp in size), covering ~14.34% of the bovine genome (UMD3.1), were identified, showing abundant CNV resources. Neighbor-joining clustering, principal component analysis (PCA), and population admixture analysis based on these CNVs support that most Chinese cattle breeds are hybrids of Bos taurus taurus (hereinafter to be referred as Bos taurus) and Bos taurus indicus (Bos indicus). The distribution patterns of the CNVs could to some extent be related to the geographical backgrounds of the habitat of the breeds, and admixture among cattle breeds from different districts. We analyzed the selective signatures of CNVs positively involved in high-altitude adaptation using pairwise Fst analysis within breeds with a strong Bos taurus background (taurine-type breeds) and within Bos taurus×Bos indicus hybrids, respectively. CNV-overlapping genes with strong selection signatures (at top 0.5% of Fst value), including LETM1 (Fst = 0.490), TXNRD2 (Fst=0.440) and STUB1 (Fst=0.420) within taurine-type breeds, and NOXA1 (Fst = 0.233), RUVBL1 (Fst=0.222) and SLC4A3 (Fst=0.154) within hybrids, were potentially involved in the adaptation to hypoxia. Thus, we provide a new profile of population structure from the CNV aspects of Chinese indigenous cattle and new insights into high-altitude adaptation in cattle.
Project description:A phylogenetic analysis of seven different species (human, mouse, rat, worm, fly, yeast, and plant) utilizing all (541) basic helix-loop-helix (bHLH) genes identified, including expressed sequence tags (EST), was performed. A super-tree involving six clades and a structural categorization involving the entire coding sequence was established. A nomenclature was developed based on clade distribution to discuss the functional and ancestral relationships of all the genes. The position/location of specific genes on the phylogenetic tree in relation to known bHLH factors allows for predictions of the potential functions of uncharacterized bHLH factors, including EST's. A genomic analysis using microarrays for four different mouse cell types (i.e. Sertoli, Schwann, thymic, and muscle) was performed and considered all known bHLH family members on the microarray for comparison. Cell-specific groups of bHLH genes helped clarify those bHLH genes potentially involved in cell specific differentiation. This phylogenetic and genomic analysis of the bHLH gene family has revealed unique aspects of the evolution and functional relationships of the different genes in the bHLH gene family. PMID: 18557763 We used microarrays to determine bHLH expression in 20d rat Sertoli cells.
Project description:The genus Flaveria has been extensively used as a model to study the evolution of C4 photosynthesis as it contains both C3 and C4 species as well as a number of species that exhibit intermediate types of photosynthesis. The current phylogenetic tree of the Flaveria genus contains 21 of the 23 known Flaveria species and has been constructed using a combination of morphologicial data and three non-coding DNA sequences (nuclear encoded ETS, ITS and chloroplast encoded trnl-F). However, recent studies have suggested that phylogenetic trees inferred using a small number of molecular sequences may often be incorrect. Moreover, studies in other genera have often shown substantial differences between trees inferred using morphological data and those using molecular sequence. To provide new insight into the phylogeny of the genus Flaveria we utilize RNA-Seq data to construct a multi-gene concatenated phylogenetic tree of 17 Flaveria species. Furthermore, we use this new data to identify 14 C4 specific non-synonymous mutation sites, 12 of which (86%) can be independently verified by public sequence data. We propose that the data collection method provided in this study can be used as a generic method for facilitating phylogenetic tree reconstruction in the absence of reference genomes for the target species.