Project description:DNA sequence and annotation of the entire human chromosome 7, encompassing nearly 158 million nucleotides of DNA and 1917 gene structures, are presented. To generate a higher order description, additional structural features such as imprinted genes, fragile sites, and segmental duplications were integrated at the level of the DNA sequence with medical genetic data, including 440 chromosome rearrangement breakpoints associated with disease. This approach enabled the discovery of candidate genes for developmental diseases including autism.
Project description:Chromosome 9 is highly structurally polymorphic. It contains the largest autosomal block of heterochromatin, which is heteromorphic in 6-8% of humans, whereas pericentric inversions occur in more than 1% of the population. The finished euchromatic sequence of chromosome 9 comprises 109,044,351 base pairs and represents >99.6% of the region. Analysis of the sequence reveals many intra- and interchromosomal duplications, including segmental duplications adjacent to both the centromere and the large heterochromatic block. We have annotated 1,149 genes, including genes implicated in male-to-female sex reversal, cancer and neurodegenerative disease, and 426 pseudogenes. The chromosome contains the largest interferon gene cluster in the human genome. There is also a region of exceptionally high gene and G + C content including genes paralogous to those in the major histocompatibility complex. We have also detected recently duplicated genes that exhibit different rates of sequence divergence, presumably reflecting natural selection.
Project description:The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence.
Project description:Chromosome 13 is the largest acrocentric human chromosome. It carries genes involved in cancer including the breast cancer type 2 (BRCA2) and retinoblastoma (RB1) genes, is frequently rearranged in B-cell chronic lymphocytic leukaemia, and contains the DAOA locus associated with bipolar disorder and schizophrenia. We describe completion and analysis of 95.5 megabases (Mb) of sequence from chromosome 13, which contains 633 genes and 296 pseudogenes. We estimate that more than 95.4% of the protein-coding genes of this chromosome have been identified, on the basis of comparison with other vertebrate genome sequences. Additionally, 105 putative non-coding RNA genes were found. Chromosome 13 has one of the lowest gene densities (6.5 genes per Mb) among human chromosomes, and contains a central region of 38 Mb where the gene density drops to only 3.1 genes per Mb.
Project description:A human cDNA encoding a protein homologous to the Escherichia coli DNA topoisomerase I subfamily of enzymes has been identified through cloning and sequencing. Expressing the cloned human cDNA in yeast (delta)top1 cells lacking endogenous DNA topoisomerase I yielded an activity in cell extracts that specifically reduces the number of supercoils in a highly negatively supercoiled DNA. On the basis of these results, the human gene containing the cDNA sequence has been denoted TOP3, and the protein it encodes has been denoted DNA topoisomerase III. Screening of a panel of human-rodent somatic hybrids and fluorescence in situ hybridization of cloned TOP3 genomic DNA to metaphase chromosomes indicate that human TOP3 is a single-copy gene located at chromosome 17p11.2-12.
Project description:Chromosomal rearrangements are frequently monitored by fluorescence in situ hybridization (FISH) using large, recombinant DNA probes consisting of contiguous genomic intervals that are often distant from disease loci. We developed smaller, targeted, single-copy probes directly from the human genome sequence. These single-copy FISH (scFISH) probes were designed by computational sequence analysis of approximately 100-kb genomic sequences. ScFISH probes are produced by long PCR, then purified, labeled, and hybridized individually or in combination to human chromosomes. Preannealing or blocking with unlabeled, repetitive DNA is unnecessary, as scFISH probes lack repetitive DNA sequences. The hybridization results are analogous to conventional FISH, except that shorter probes can be readily visualized. Combinations of probes from the same region gave single hybridization signals on metaphase chromosomes. ScFISH probes are produced directly from genomic DNA, and thus more quickly than by recombinant DNA techniques. We developed single-copy probes for three chromosomal regions-the CDC2L1 (chromosome 1p36), MAGEL2 (chromosome 15q11.2), and HIRA (chromosome 22q11.2) genes-and show their utility for FISH. The smallest probe tested was 2290 bp in length. To assess the potential utility of scFISH for high-resolution analysis, we determined chromosomal distributions of such probes. Single-copy intervals of this length or greater are separated by an average of 29.2 and 22.3 kb on chromosomes 21 and 22, respectively. This indicates that abnormalities seen on metaphase chromosomes could be characterized with scFISH probes at a resolution greater than previously possible.
Project description:Chromosome 17 is unusual among the human chromosomes in many respects. It is the largest human autosome with orthology to only a single mouse chromosome, mapping entirely to the distal half of mouse chromosome 11. Chromosome 17 is rich in protein-coding genes, having the second highest gene density in the genome. It is also enriched in segmental duplications, ranking third in density among the autosomes. Here we report a finished sequence for human chromosome 17, as well as a structural comparison with the finished sequence for mouse chromosome 11, the first finished mouse chromosome. Comparison of the orthologous regions reveals striking differences. In contrast to the typical pattern seen in mammalian evolution, the human sequence has undergone extensive intrachromosomal rearrangement, whereas the mouse sequence has been remarkably stable. Moreover, although the human sequence has a high density of segmental duplication, the mouse sequence has a very low density. Notably, these segmental duplications correspond closely to the sites of structural rearrangement, demonstrating a link between duplication and rearrangement. Examination of the main classes of duplicated segments provides insight into the dynamics underlying expansion of chromosome-specific, low-copy repeats in the human genome.