Unknown

Dataset Information

0

IOri-Human: identify human origin of replication by incorporating dinucleotide physicochemical properties into pseudo nucleotide composition.


ABSTRACT: The initiation of replication is an extremely important process in DNA life cycle. Given an uncharacterized DNA sequence, can we identify where its origin of replication (ORI) is located? It is no doubt a fundamental problem in genome analysis. Particularly, with the rapid development of genome sequencing technology that results in a huge amount of sequence data, it is highly desired to develop computational methods for rapidly and effectively identifying the ORIs in these genomes. Unfortunately, by means of the existing computational methods, such as sequence alignment or kmer strategies, it could hardly achieve decent success rates. To address this problem, we developed a predictor called "iOri-Human". Rigorous jackknife tests have shown that its overall accuracy and stability in identifying human ORIs are over 75% and 50%, respectively. In the predictor, it is through the pseudo nucleotide composition (an extension of pseudo amino acid composition) that 96 physicochemical properties for the 16 possible constituent dinucleotides have been incorporated to reflect the global sequence patterns in DNA as well as its local sequence patterns. Moreover, a user-friendly web-server for iOri-Human has been established at http://lin.uestc.edu.cn/server/iOri-Human.html, by which users can easily get their desired results without the need to through the complicated mathematics involved.

SUBMITTER: Zhang CJ 

PROVIDER: S-EPMC5342515 | biostudies-literature | 2016 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

iOri-Human: identify human origin of replication by incorporating dinucleotide physicochemical properties into pseudo nucleotide composition.

Zhang Chang-Jian CJ   Tang Hua H   Li Wen-Chao WC   Lin Hao H   Chen Wei W   Chou Kuo-Chen KC  

Oncotarget 20161001 43


The initiation of replication is an extremely important process in DNA life cycle. Given an uncharacterized DNA sequence, can we identify where its origin of replication (ORI) is located? It is no doubt a fundamental problem in genome analysis. Particularly, with the rapid development of genome sequencing technology that results in a huge amount of sequence data, it is highly desired to develop computational methods for rapidly and effectively identifying the ORIs in these genomes. Unfortunately  ...[more]

Similar Datasets

| S-EPMC5085147 | biostudies-literature
| S-EPMC3616736 | biostudies-literature
| S-EPMC5522291 | biostudies-literature
| S-EPMC4055483 | biostudies-literature
| S-EPMC6759546 | biostudies-literature
| S-EPMC3958819 | biostudies-literature
| S-EPMC6983959 | biostudies-literature
| S-EPMC2883347 | biostudies-literature
| S-EPMC4411653 | biostudies-literature
| S-EPMC3506597 | biostudies-literature