Unknown

Dataset Information

0

Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences.


ABSTRACT: The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http:mgc.nci.nih.gov).

SUBMITTER: Strausberg RL 

PROVIDER: S-EPMC139241 | biostudies-literature | 2002 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences.

Strausberg Robert L RL   Feingold Elise A EA   Grouse Lynette H LH   Derge Jeffery G JG   Klausner Richard D RD   Collins Francis S FS   Wagner Lukas L   Shenmen Carolyn M CM   Schuler Gregory D GD   Altschul Stephen F SF   Zeeberg Barry B   Buetow Kenneth H KH   Schaefer Carl F CF   Bhat Narayan K NK   Hopkins Ralph F RF   Jordan Heather H   Moore Troy T   Max Steve I SI   Wang Jun J   Hsieh Florence F   Diatchenko Luda L   Marusina Kate K   Farmer Andrew A AA   Rubin Gerald M GM   Hong Ling L   Stapleton Mark M   Soares M Bento MB   Bonaldo Maria F MF   Casavant Tom L TL   Scheetz Todd E TE   Brownstein Michael J MJ   Usdin Ted B TB   Toshiyuki Shiraki S   Carninci Piero P   Prange Christa C   Raha Sam S SS   Loquellano Naomi A NA   Peters Garrick J GJ   Abramson Rick D RD   Mullahy Sara J SJ   Bosak Stephanie A SA   McEwan Paul J PJ   McKernan Kevin J KJ   Malek Joel A JA   Gunaratne Preethi H PH   Richards Stephen S   Worley Kim C KC   Hale Sarah S   Garcia Angela M AM   Gay Laura J LJ   Hulyk Stephen W SW   Villalon Debbie K DK   Muzny Donna M DM   Sodergren Erica J EJ   Lu Xiuhua X   Gibbs Richard A RA   Fahey Jessica J   Helton Erin E   Ketteman Mark M   Madan Anuradha A   Rodrigues Stephanie S   Sanchez Amy A   Whiting Michelle M   Madan Anup A   Young Alice C AC   Shevchenko Yuriy Y   Bouffard Gerard G GG   Blakesley Robert W RW   Touchman Jeffrey W JW   Green Eric D ED   Dickson Mark C MC   Rodriguez Alex C AC   Grimwood Jane J   Schmutz Jeremy J   Myers Richard M RM   Butterfield Yaron S N YS   Krzywinski Martin I MI   Skalska Ursula U   Smailus Duane E DE   Schnerch Angelique A   Schein Jacqueline E JE   Jones Steven J M SJ   Marra Marco A MA  

Proceedings of the National Academy of Sciences of the United States of America 20021211 26


The National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-OR  ...[more]

Similar Datasets

| S-EPMC3315734 | biostudies-literature
| S-EPMC1088967 | biostudies-literature
| S-EPMC151182 | biostudies-literature
| S-EPMC403704 | biostudies-literature
| S-EPMC403723 | biostudies-literature
| S-EPMC403720 | biostudies-literature
| S-EPMC3174253 | biostudies-literature
| S-EPMC2996955 | biostudies-literature
| S-EPMC393292 | biostudies-literature
| S-EPMC3017805 | biostudies-literature