Unknown

Dataset Information

0

Optimized splitting of mixed-species RNA sequencing data.


ABSTRACT: Gene expression studies using xenograft transplants or co-culture systems, usually with mixed human and mouse cells, have proven to be valuable to uncover cellular dynamics during development or in disease models. However, the mRNA sequence similarities among species presents a challenge for accurate transcript quantification. To identify optimal strategies for analyzing mixed-species RNA sequencing data, we evaluate both alignment-dependent and alignment-independent methods. Alignment of reads to a pooled reference index is effective, particularly if optimal alignments are used to classify sequencing reads by species, which are re-aligned with individual genomes, generating [Formula: see text] accuracy across a range of species ratios. Alignment-independent methods, such as convolutional neural networks, which extract the conserved patterns of sequences from two species, classify RNA sequencing reads with over 85% accuracy. Importantly, both methods perform well with different ratios of human and mouse reads. While non-alignment strategies successfully partitioned reads by species, a more traditional approach of mixed-genome alignment followed by optimized separation of reads proved to be the more successful with lower error rates.

SUBMITTER: Song X 

PROVIDER: S-EPMC9081140 | biostudies-literature | 2022 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Optimized splitting of mixed-species RNA sequencing data.

Song Xuan X   Gao Hai Yun HY   Herrup Karl K   Hart Ronald P RP  

Journal of bioinformatics and computational biology 20220106 2


Gene expression studies using xenograft transplants or co-culture systems, usually with mixed human and mouse cells, have proven to be valuable to uncover cellular dynamics during development or in disease models. However, the mRNA sequence similarities among species presents a challenge for accurate transcript quantification. To identify optimal strategies for analyzing mixed-species RNA sequencing data, we evaluate both alignment-dependent and alignment-independent methods. Alignment of reads  ...[more]

Similar Datasets

| S-EPMC8547352 | biostudies-literature
| S-EPMC7360662 | biostudies-literature
| S-EPMC7553720 | biostudies-literature
| S-EPMC4029068 | biostudies-literature
| S-EPMC10576752 | biostudies-literature
| S-EPMC8934446 | biostudies-literature
| S-EPMC7894898 | biostudies-literature
| S-EPMC5549930 | biostudies-literature
| S-EPMC6808545 | biostudies-literature
| S-EPMC8526306 | biostudies-literature