Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Ab initio identification of transcription start sites (TSSs) in the Rhesus macaque genome by histone modification and RNA-Seq


ABSTRACT: We addressed the lack of experimentally supported transcript annotations in the Rhesus macaque genome by ab initio identification of the transcription start sites (TSSs). We took advantage of histone H3 lysine 4 trimethylation (H3K4me3)'s ability to mark TSSs and the recently developed ChIP-Seq and RNA-Seq technology to survey the transcript structures in the macaque brain. We then integrated the two types of our newly generated data with genomic sequence features and extended a TSS prediction algorithm to ab initio predict and verify 16,833 of previously electronically annotated transcription start sites at 500 bp resolution and predicted ~10,000 new TSSs. We took advantage of histone H3 lysine 4 trimethylation (H3K4me3)M-bM-^@M-^Ys ability to mark transcription start sites (TSSs) and the recently developed ChIP-Seq and RNA-Seq technology to survey the transcript structures. By integrating the ChIP-seq, RNA-seq and small RNA-seq data (previously uploaded to GEO as GSM450615 by our collaborator) with genomic sequence features and extending and improving a state-of-the-art TSS prediction algorithm, we ab initio predicted and verified previously electronically annotated TSSs at a high resolution, and predicted some novel TSSs.

ORGANISM(S): Macaca mulatta

SUBMITTER: Dali Han 

PROVIDER: E-GEOD-24538 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

altmetric image

Publications

Ab initio identification of transcription start sites in the Rhesus macaque genome by histone modification and RNA-Seq.

Liu Yi Y   Han Dali D   Han Yixing Y   Yan Zheng Z   Xie Bin B   Li Jing J   Qiao Nan N   Hu Haiyang H   Khaitovich Philipp P   Gao Yuan Y   Han Jing-Dong J JD  

Nucleic acids research 20101014 4


Rhesus macaque is a widely used primate model organism. Its genome annotations are however still largely comparative computational predictions derived mainly from human genes, which precludes studies on the macaque-specific genes, gene isoforms or their regulations. Here we took advantage of histone H3 lysine 4 trimethylation (H3K4me3)'s ability to mark transcription start sites (TSSs) and the recently developed ChIP-Seq and RNA-Seq technology to survey the transcript structures. We generated 14  ...[more]

Similar Datasets

2010-10-19 | GSE24538 | GEO
| PRJNA132563 | ENA
2014-10-17 | E-GEOD-55862 | biostudies-arrayexpress
2015-12-21 | E-GEOD-75398 | biostudies-arrayexpress
2016-03-01 | E-GEOD-61910 | biostudies-arrayexpress
2013-07-01 | E-MTAB-1700 | biostudies-arrayexpress
2014-07-27 | E-GEOD-56356 | biostudies-arrayexpress
2013-07-23 | E-GEOD-49114 | biostudies-arrayexpress
2016-03-01 | GSE78771 | GEO
2016-03-01 | E-GEOD-78771 | biostudies-arrayexpress