Project description:The genus Flaveria has been extensively used as a model to study the evolution of C4 photosynthesis as it contains both C3 and C4 species as well as a number of species that exhibit intermediate types of photosynthesis. The current phylogenetic tree of the Flaveria genus contains 21 of the 23 known Flaveria species and has been constructed using a combination of morphologicial data and three non-coding DNA sequences (nuclear encoded ETS, ITS and chloroplast encoded trnl-F). However, recent studies have suggested that phylogenetic trees inferred using a small number of molecular sequences may often be incorrect. Moreover, studies in other genera have often shown substantial differences between trees inferred using morphological data and those using molecular sequence. To provide new insight into the phylogeny of the genus Flaveria we utilize RNA-Seq data to construct a multi-gene concatenated phylogenetic tree of 17 Flaveria species. Furthermore, we use this new data to identify 14 C4 specific non-synonymous mutation sites, 12 of which (86%) can be independently verified by public sequence data. We propose that the data collection method provided in this study can be used as a generic method for facilitating phylogenetic tree reconstruction in the absence of reference genomes for the target species.