Unknown

Dataset Information

0

More Accurate Transcript Assembly via Parameter Advising.


ABSTRACT: Computational tools used for genomic analyses are becoming more accurate but also increasingly sophisticated and complex. This introduces a new problem in that these pieces of software have a large number of tunable parameters that often have a large influence on the results that are reported. We quantify the impact of parameter choice on transcript assembly and take some first steps toward generating a truly automated genomic analysis pipeline by developing a method for automatically choosing input-specific parameter values for reference-based transcript assembly using the Scallop tool. By choosing parameter values for each input, the area under the receiver operator characteristic curve (AUC) when comparing assembled transcripts to a reference transcriptome is increased by an average of 28.9% over using only the default parameter choices on 1595 RNA-Seq samples in the Sequence Read Archive. This approach is general, and when applied to StringTie, it increases the AUC by an average of 13.1% on a set of 65 RNA-Seq experiments from ENCODE. Parameter advisors for both Scallop and StringTie are available on Github.

SUBMITTER: Deblasio D 

PROVIDER: S-EPMC7415876 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

More Accurate Transcript Assembly via Parameter Advising.

Deblasio Dan D   Kim Kwanho K   Kingsford Carl C  

Journal of computational biology : a journal of computational molecular cell biology 20200421 8


<b>Computational tools used for genomic analyses are becoming more accurate but also increasingly sophisticated and complex. This introduces a new problem in that these pieces of software have a large number of tunable parameters that often have a large influence on the results that are reported. We quantify the impact of parameter choice on transcript assembly and take some first steps toward generating a truly automated genomic analysis pipeline by developing a method for automatically choosin  ...[more]

Similar Datasets

| S-EPMC5870651 | biostudies-literature
| S-EPMC6802306 | biostudies-literature
| S-EPMC8958329 | biostudies-literature
| S-EPMC7782737 | biostudies-literature
| S-EPMC6388693 | biostudies-literature
| S-EPMC5411767 | biostudies-literature
| S-EPMC8602663 | biostudies-literature
| S-EPMC10769104 | biostudies-literature
2013-07-15 | E-MTAB-1730 | biostudies-arrayexpress
| S-EPMC4744958 | biostudies-literature