Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Targeted RT-PCR assays spanning unannotated splice junctions sequenced by Roche 454.


ABSTRACT: The ENCODE projects seeks to identify and characterize functional elements in the human genome. Throughout the scale-up phase of ENCODE, the transcriptome group has generate Long RNA-Seq, Small RNA-Seq, Cap-Analysis of Gene Expression (CAGE), and RNA-PET short read data on the Illumina platform for ~ 40 different human primary and transformed cell lines in replicate. From these data several high-resolution and discrete features/elements have been mined out (5’ caps, splice junctions, polyadenylation sites, small RNAs, etc…). However, because these data are obtained from short-read data, we have only limited “connectivity” information. For example, from the long RNA-Seq data, which was sequenced in mate-pair fashion with average insert sizes ~ 200 bp, we know that the sequence from mate 1 is physically linked to the sequence in mate 2. We don’t know the sequence in between and we don’t know how this mate-pair is connected to other mate-pairs in the context of longer transcripts in vivo. To date, this information is gleaned from models generated in silico: In our case, by the program Cufflinks. Consequently, we have a collection of transcript models exhibiting a vast array of local complexity assembled from short read data that need to be experimentally tested. Alternatively, one can “cut to the chase” and use a more raw/elemental form of the data as a basis for additional experimentation to clone out the longer sequences generated in vivo. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf 454 Data from HepG2, HUVEC, and H1 ES cells

ORGANISM(S): Homo sapiens

SUBMITTER: julien lagarde 

PROVIDER: E-GEOD-38886 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

altmetric image

Publications

Landscape of transcription in human cells.

Djebali Sarah S   Davis Carrie A CA   Merkel Angelika A   Dobin Alex A   Lassmann Timo T   Mortazavi Ali A   Tanzer Andrea A   Lagarde Julien J   Lin Wei W   Schlesinger Felix F   Xue Chenghai C   Marinov Georgi K GK   Khatun Jainab J   Williams Brian A BA   Zaleski Chris C   Rozowsky Joel J   Röder Maik M   Kokocinski Felix F   Abdelhamid Rehab F RF   Alioto Tyler T   Antoshechkin Igor I   Baer Michael T MT   Bar Nadav S NS   Batut Philippe P   Bell Kimberly K   Bell Ian I   Chakrabortty Sudipto S   Chen Xian X   Chrast Jacqueline J   Curado Joao J   Derrien Thomas T   Drenkow Jorg J   Dumais Erica E   Dumais Jacqueline J   Duttagupta Radha R   Falconnet Emilie E   Fastuca Meagan M   Fejes-Toth Kata K   Ferreira Pedro P   Foissac Sylvain S   Fullwood Melissa J MJ   Gao Hui H   Gonzalez David D   Gordon Assaf A   Gunawardena Harsha H   Howald Cedric C   Jha Sonali S   Johnson Rory R   Kapranov Philipp P   King Brandon B   Kingswood Colin C   Luo Oscar J OJ   Park Eddie E   Persaud Kimberly K   Preall Jonathan B JB   Ribeca Paolo P   Risk Brian B   Robyr Daniel D   Sammeth Michael M   Schaffer Lorian L   See Lei-Hoon LH   Shahab Atif A   Skancke Jorgen J   Suzuki Ana Maria AM   Takahashi Hazuki H   Tilgner Hagen H   Trout Diane D   Walters Nathalie N   Wang Huaien H   Wrobel John J   Yu Yanbao Y   Ruan Xiaoan X   Hayashizaki Yoshihide Y   Harrow Jennifer J   Gerstein Mark M   Hubbard Tim T   Reymond Alexandre A   Antonarakis Stylianos E SE   Hannon Gregory G   Giddings Morgan C MC   Ruan Yijun Y   Wold Barbara B   Carninci Piero P   Guigó Roderic R   Gingeras Thomas R TR  

Nature 20120901 7414


Eukaryotic cells make many types of primary and processed RNAs that are found either in specific subcellular compartments or throughout the cells. A complete catalogue of these RNAs is not yet available and their characteristic subcellular localizations are also poorly understood. Because RNA represents the direct output of the genetic information encoded by genomes and a significant proportion of a cell's regulatory capabilities are focused on its synthesis, processing, transport, modification  ...[more]

Similar Datasets

2012-06-26 | GSE38886 | GEO
| phs001255 | dbGaP
2022-05-30 | E-MTAB-11351 | biostudies-arrayexpress
2022-05-31 | PXD033870 | Pride
2020-01-23 | PXD014844 | Pride
2010-01-24 | E-GEOD-19986 | biostudies-arrayexpress
2021-02-02 | PXD023373 | Pride
2012-04-02 | E-GEOD-36025 | biostudies-arrayexpress
2010-01-25 | GSE19986 | GEO
| PRJEB67879 | ENA