Unknown

Dataset Information

0

Non-canonical RNA-DNA differences and other human genomic features are enriched within very short tandem repeats.


ABSTRACT: Very short tandem repeats bear substantial genetic, evolutional, and pathological significance in genome analyses. Here, we compiled a census of tandem mono-nucleotide/di-nucleotide/tri-nucleotide repeats (MNRs/DNRs/TNRs) in GRCh38, which we term "polytracts" in general. Of the human genome, 144.4 million nucleotides (4.7%) are occupied by polytracts, and 0.47 million single nucleotides are identified as polytract hinges, i.e., break-points of tandem polytracts. Preliminary exploration of the census suggested polytract hinge sites and boundaries of AAC polytracts may bear a higher mapping error rate than other polytract regions. Further, we revealed landscapes of polytract enrichment with respect to nearly a hundred genomic features. We found MNRs, DNRs, and TNRs displayed noticeable difference in terms of locational enrichment for miscellaneous genomic features, especially RNA editing events. Non-canonical and C-to-U RNA-editing events are enriched inside and/or adjacent to MNRs, while all categories of RNA-editing events are under-represented in DNRs. A-to-I RNA-editing events are generally under-represented in polytracts. The selective enrichment of non-canonical RNA-editing events within MNR adjacency provides a negative evidence against their authenticity. To enable similar locational enrichment analyses in relation to polytracts, we developed a software Polytrap which can handle 11 reference genomes. Additionally, we compiled polytracts of four model organisms into a Track Hub which can be integrated into USCS Genome Browser as an official track for convenient visualization of polytracts.

SUBMITTER: Yu H 

PROVIDER: S-EPMC7302867 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Non-canonical RNA-DNA differences and other human genomic features are enriched within very short tandem repeats.

Yu Hui H   Zhao Shilin S   Ness Scott S   Kang Huining H   Sheng Quanhu Q   Samuels David C DC   Oyebamiji Olufunmilola O   Zhao Ying-Yong YY   Guo Yan Y  

PLoS computational biology 20200608 6


Very short tandem repeats bear substantial genetic, evolutional, and pathological significance in genome analyses. Here, we compiled a census of tandem mono-nucleotide/di-nucleotide/tri-nucleotide repeats (MNRs/DNRs/TNRs) in GRCh38, which we term "polytracts" in general. Of the human genome, 144.4 million nucleotides (4.7%) are occupied by polytracts, and 0.47 million single nucleotides are identified as polytract hinges, i.e., break-points of tandem polytracts. Preliminary exploration of the ce  ...[more]

Similar Datasets

| S-EPMC5026258 | biostudies-literature
| S-EPMC7549362 | biostudies-literature
| S-EPMC5894186 | biostudies-literature
2018-03-01 | E-MTAB-6411 | biostudies-arrayexpress
| S-EPMC2773041 | biostudies-literature
| S-EPMC4725869 | biostudies-literature
| S-EPMC8161180 | biostudies-literature
| S-EPMC5103432 | biostudies-literature
| S-EPMC4090240 | biostudies-literature
| S-EPMC6206671 | biostudies-literature