Unknown

Dataset Information

0

CNIT: a fast and accurate web tool for identifying protein-coding and long non-coding transcripts based on intrinsic sequence composition.


ABSTRACT: As more and more high-throughput data has been produced by next-generation sequencing, it is still a challenge to classify RNA transcripts into protein-coding or non-coding, especially for poorly annotated species. We upgraded our original coding potential calculator, CNCI (Coding-Non-Coding Index), to CNIT (Coding-Non-Coding Identifying Tool), which provides faster and more accurate evaluation of the coding ability of RNA transcripts. CNIT runs ?200 times faster than CNCI and exhibits more accuracy compared with CNCI (0.98 versus 0.94 for human, 0.95 versus 0.93 for mouse, 0.93 versus 0.92 for zebrafish, 0.93 versus 0.92 for fruit fly, 0.92 versus 0.88 for worm, and 0.98 versus 0.85 for Arabidopsis transcripts). Moreover, the AUC values of 11 animal species and 27 plant species showed that CNIT was capable of obtaining relatively accurate identification results for almost all eukaryotic transcripts. In addition, a mobile-friendly web server is now freely available at http://cnit.noncode.org/CNIT.

SUBMITTER: Guo JC 

PROVIDER: S-EPMC6602462 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

CNIT: a fast and accurate web tool for identifying protein-coding and long non-coding transcripts based on intrinsic sequence composition.

Guo Jin-Cheng JC   Fang Shuang-Sang SS   Wu Yang Y   Zhang Jian-Hua JH   Chen Yang Y   Liu Jing J   Wu Bo B   Wu Jia-Rui JR   Li En-Min EM   Xu Li-Yan LY   Sun Liang L   Zhao Yi Y  

Nucleic acids research 20190701 W1


As more and more high-throughput data has been produced by next-generation sequencing, it is still a challenge to classify RNA transcripts into protein-coding or non-coding, especially for poorly annotated species. We upgraded our original coding potential calculator, CNCI (Coding-Non-Coding Index), to CNIT (Coding-Non-Coding Identifying Tool), which provides faster and more accurate evaluation of the coding ability of RNA transcripts. CNIT runs ∼200 times faster than CNCI and exhibits more accu  ...[more]

Similar Datasets

| S-EPMC3783192 | biostudies-literature
| S-EPMC5793834 | biostudies-literature
| S-EPMC3634467 | biostudies-literature
| S-EPMC7660903 | biostudies-literature
| S-EPMC4640205 | biostudies-literature
| S-EPMC2447787 | biostudies-literature
| S-EPMC8633610 | biostudies-literature
| S-EPMC9750101 | biostudies-literature
| S-EPMC3167602 | biostudies-literature
| S-EPMC8262746 | biostudies-literature