Project description:BACKGROUND:Recent advances in genome sequencing technologies and the cost drop in high-throughput sequencing continue to give rise to a deluge of data available for downstream analyses. Among others, evolutionary biologists often make use of genomic data to uncover phenotypic diversity and adaptive evolution in protein-coding genes. Therefore, multiple sequence alignments (MSA) and phylogenetic trees (PT) need to be estimated with optimal results. However, the preparation of an initial dataset of multiple sequence file(s) (MSF) and the steps involved can be challenging when considering extensive amount of data. Thus, it becomes necessary the development of a tool that removes the potential source of error and automates the time-consuming steps of a typical workflow with high-throughput and optimal MSA and PT estimations. RESULTS:We introduce LMAP_S (Lightweight Multigene Alignment and Phylogeny eStimation), a user-friendly command-line and interactive package, designed to handle an improved alignment and phylogeny estimation workflow: MSF preparation, MSA estimation, outlier detection, refinement, consensus, phylogeny estimation, comparison and editing, among which file and directory organization, execution, manipulation of information are automated, with minimal manual user intervention. LMAP_S was developed for the workstation multi-core environment and provides a unique advantage for processing multiple datasets. Our software, proved to be efficient throughout the workflow, including, the (unlimited) handling of more than 20 datasets. CONCLUSIONS:We have developed a simple and versatile LMAP_S package enabling researchers to effectively estimate multiple datasets MSAs and PTs in a high-throughput fashion. LMAP_S integrates more than 25 software providing overall more than 65 algorithm choices distributed in five stages. At minimum, one FASTA file is required within a single input directory. To our knowledge, no other software combines MSA and phylogeny estimation with as many alternatives and provides means to find optimal MSAs and phylogenies. Moreover, we used a case study comparing methodologies that highlighted the usefulness of our software. LMAP_S has been developed as an open-source package, allowing its integration into more complex open-source bioinformatics pipelines. LMAP_S package is released under GPLv3 license and is freely available at https://lmap-s.sourceforge.io/.

Project description:BACKGROUND: Erinaceidae is a family of small mammals that include the spiny hedgehogs (Erinaceinae) and the silky-furred moonrats and gymnures (Galericinae). These animals are widely distributed across Eurasia and Africa, from the tundra to the tropics and the deserts to damp forests. The importance of these animals lies in the fact that they are the oldest known living placental mammals, which are well represented in the fossil record, a rarity fact given their size and vulnerability to destruction during fossilization. Although the Family has been well studied, their phylogenetic relationships remain controversial. To test previous phylogenetic hypotheses, we combined molecular and morphological data sets, including representatives of all the genera. METHODOLOGY AND PRINCIPAL FINDINGS: We included in the analyses 3,218 bp mitochondrial genes, one hundred and thirty-five morphological characters, twenty-two extant erinaceid taxa, and five outgroup taxa. Phylogenetic relationships were reconstructed using both partitioned and combined data sets. As in previous analyses, our results strongly support the monophyly of both subfamilies (Galericinae and Erinaceinae), the Hylomys group (to include Neotetracus and Neohylomys), and a sister-relationship of Atelerix and Erinaceus. As well, we verified that the extremely long branch lengths within the Galericinae are consistent with their fossil records. Not surprisingly, we found significant incongruence between the phylogenetic signals of the genes and the morphological characters, specifically in the case of Hylomys parvus, Mesechinus, and relationships between Hemiechinus and Paraechinus. CONCLUSIONS: Although we discovered new clues to understanding the evolutionary relationships within the Erinaceidae, our results nonetheless, strongly suggest that more robust analyses employing more complete taxon sampling (to include fossils) and multiple unlinked genes would greatly enhance our understanding of the Erinaceidae. Until then, we have left the nomenclature of the taxa unchanged; hence it does not yet precisely reflect their phylogenetic relationships or the depth of their genetic diversity.

Project description:IntroductionTelomeres, are composed of tandem repeat sequences located at the ends of chromosomes and are required to maintain genomic stability. Telomeres can become shorter due to cell division and specific lifestyle factors. Critically shortened telomeres are linked to cellular dysfunction, senescence and aging. A number of studies have used low resolution techniques to assess telomere length in the placenta. In this study, we applied Single Telomere Length Analysis (STELA) which provides high-resolution chromosome specific telomere length profiles to ask whether we could obtain more detailed information on the length of individual telomeres in the placenta.MethodsTerm placentas (37-42 weeks) were collected from women delivering at University Hospital of Wales or Royal Gwent Hospital within 2 h of delivery. Multiple telomere-length distributions were determined using STELA. Intraplacental variation of telomere length was analysed (N = 5). Telomere length distributions were compared between labouring (N = 10) and non-labouring (N = 11) participants. Finally, telomere length was compared between female (N = 17) and male (N = 20) placenta.ResultsThere were no significant influences of sampling site, mode of delivery or foetal sex on the telomere-length distributions obtained. The mean telomere length was 7.7 kb ranging from 5.0 kb to 11.7 kb across all samples (N = 42) and longer compared with other human tissues at birth. STELA also revealed considerable telomere length heterogeneity within samples.ConclusionsWe have shown that STELA can be used to study telomere length homeostasis in the placenta regardless of sampling site, mode of delivery and foetal sex. Moreover, as each amplicon is derived from a single telomeric molecule, from a single cell, STELA can reveal the full detail of telomere-length distributions, including telomeres within the length ranges observed in senescent cells. STELA thus provides a new tool to interrogate the relationship between telomere length and pregnancy complications linked to placental dysfunction.

Dataset Information

Corrigendum to: Phylogeny Estimation Given Sequence Length Heterogeneity.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets