Unknown

Dataset Information

0

Genome-wide in silico identification and analysis of cis natural antisense transcripts (cis-NATs) in ten species.


ABSTRACT: We developed a fast, integrative pipeline to identify cis natural antisense transcripts (cis-NATs) at genome scale. The pipeline mapped mRNAs and ESTs in UniGene to genome sequences in GoldenPath to find overlapping transcripts and combining information from coding sequence, poly(A) signal, poly(A) tail and splicing sites to deduce transcription orientation. We identified cis-NATs in 10 eukaryotic species, including 7830 candidate sense-antisense (SA) genes in 3915 SA pairs in human. The abundance of SA genes is remarkably low in worm and does not seem to be caused by the prevalence of operons. Hundreds of SA pairs are conserved across different species, even maintaining the same overlapping patterns. The convergent SA class is prevalent in fly, worm and sea squirt, but not in human or mouse as reported previously. The percentage of SA genes among imprinted genes in human and mouse is 24-47%, a range between the two previous reports. There is significant shortage of SA genes on Chromosome X in human and mouse but not in fly or worm, supporting X-inactivation in mammals as a possible cause. SA genes are over-represented in the catalytic activities and basic metabolism functions. All candidate cis-NATs can be downloaded from http://nats.cbi.pku.edu.cn/download/.

SUBMITTER: Zhang Y 

PROVIDER: S-EPMC1524920 | biostudies-literature | 2006

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome-wide in silico identification and analysis of cis natural antisense transcripts (cis-NATs) in ten species.

Zhang Yong Y   Liu X Shirley XS   Liu Qing-Rong QR   Wei Liping L  

Nucleic acids research 20060718 12


We developed a fast, integrative pipeline to identify cis natural antisense transcripts (cis-NATs) at genome scale. The pipeline mapped mRNAs and ESTs in UniGene to genome sequences in GoldenPath to find overlapping transcripts and combining information from coding sequence, poly(A) signal, poly(A) tail and splicing sites to deduce transcription orientation. We identified cis-NATs in 10 eukaryotic species, including 7830 candidate sense-antisense (SA) genes in 3915 SA pairs in human. The abundan  ...[more]

Similar Datasets

| S-EPMC5946162 | biostudies-literature
| S-EPMC1088958 | biostudies-literature
| S-EPMC7445176 | biostudies-literature
| S-EPMC3287164 | biostudies-literature
| S-EPMC1369008 | biostudies-literature
2019-02-16 | GSE116553 | GEO
| S-EPMC7910453 | biostudies-literature
| S-EPMC4170719 | biostudies-literature
| S-EPMC2262095 | biostudies-literature
| S-EPMC4463847 | biostudies-other