Project description:Naturally occurring networks exhibit quantitative features revealing underlying growth mechanisms. Numerous network mechanisms have recently been proposed to reproduce specific properties such as degree distributions or clustering coefficients. We present a method for inferring the mechanism most accurately capturing a given network topology, exploiting discriminative tools from machine learning. The Drosophila melanogaster protein network is confidently and robustly (to noise and training data subsampling) classified as a duplication-mutation-complementation network over preferential attachment, small-world, and a duplication-mutation mechanism without complementation. Systematic classification, rather than statistical study of specific properties, provides a discriminative approach to understand the design of complex networks.
Project description:BackgroundPotassium channels are the largest and most diverse type of ion channel found in nature. The completion of the sequencing of the genomes of Drosophila melanogaster and Anopheles gambiae, which belong to the same order, the Diptera, allows us to compare and contrast K+-channel genes and gene families present within the genomes of two dipterans.ResultsThis study identifies at least eight voltage-gated K+-channel genes in Anopheles, as well as three Slo-family, three Eag-family and six inward rectifier K+-channel genes. The genomic organization of K+-channel genes from Drosophila and Anopheles is well conserved. The sequence identity of the most similar K+-channel gene products between these two species ranges from 42% to 98%, with a mean value of 85%. Although most K+-channel genes in Drosophila and Anopheles are present in a 1:1 ratio, Anopheles has more genes in three K+-channel types, namely KQT, Kv3, and inward rectifier channels. Microsynteny between the genes flanking K+-channel genes in Drosophila and Anopheles was seldom observed; however, most of the K+-channel genes are indeed located at positions which a previous genome-wide comparison has designated as homologous chromosomal regions.ConclusionsThe Anopheles genome encodes more voltage-gated and inward rectifier K+-channel genes than that of Drosophila. Despite the conservation of intron-exon boundaries, orthologs of genes flanking K+-channel genes in Drosophila are generally not found adjacent to the Anopheles K+-channel orthologs, suggesting that extensive translocation of genes has occurred since the divergence of these two organisms.
Project description:We performed genome-wide sequence comparisons at the protein coding level between the genome sequences of Drosophila melanogaster and Anopheles gambiae. Such comparisons detect evolutionarily conserved regions (ecores) that can be used for a qualitative and quantitative evaluation of the available annotations of both genomes. They also provide novel candidate features for annotation. The percentage of ecores mapping outside annotations in the A. gambiae genome is about fourfold higher than in D. melanogaster. The A. gambiae genome assembly also contains a high proportion of duplicated ecores, possibly resulting from artefactual sequence duplications in the genome assembly. The occurrence of 4063 ecores in the D. melanogaster genome outside annotations suggests that some genes are not yet or only partially annotated. The present work illustrates the power of comparative genomics approaches towards an exhaustive and accurate establishment of gene models and gene catalogues in insect genomes.
Project description:Generating reference maps of interactome networks illuminates genetic studies by providing a protein-centric approach to finding new components of existing pathways, complexes, and processes. We apply state-of-the-art methods to identify binary protein-protein interactions (PPIs) for Drosophila melanogaster. Four all-by-all yeast two-hybrid (Y2H) screens of > 10,000 Drosophila proteins result in the 'FlyBi' dataset of 8723 PPIs among 2939 proteins. Testing subsets of data from FlyBi and previous PPI studies using an orthogonal assay allows for normalization of data quality; subsequent integration of FlyBi and previous data results in an expanded binary Drosophila reference interaction network, DroRI, comprising 17,232 interactions among 6511 proteins. We use FlyBi data to generate an autophagy network, then validate in vivo using autophagy-related assays. The deformed wings (dwg) gene encodes a protein that is both a regulator and a target of autophagy. Altogether, these resources provide a foundation for building new hypotheses regarding protein networks and function.
Project description:BackgroundIn eukaryotic cells, oxidative phosphorylation (OXPHOS) uses the products of both nuclear and mitochondrial genes to generate cellular ATP. Interspecies comparative analysis of these genes, which appear to be under strong functional constraints, may shed light on the evolutionary mechanisms that act on a set of genes correlated by function and subcellular localization of their products.ResultsWe have identified and annotated the Drosophila melanogaster, D. pseudoobscura and Anopheles gambiae orthologs of 78 nuclear genes encoding mitochondrial proteins involved in oxidative phosphorylation by a comparative analysis of their genomic sequences and organization. We have also identified 47 genes in these three dipteran species each of which shares significant sequence homology with one of the above-mentioned OXPHOS orthologs, and which are likely to have originated by duplication during evolution. Gene structure and intron length are essentially conserved in the three species, although gain or loss of introns is common in A. gambiae. In most tissues of D. melanogaster and A. gambiae the expression level of the duplicate gene is much lower than that of the original gene, and in D. melanogaster at least, its expression is almost always strongly testis-biased, in contrast to the soma-biased expression of the parent gene.ConclusionsQuickly achieving an expression pattern different from the parent genes may be required for new OXPHOS gene duplicates to be maintained in the genome. This may be a general evolutionary mechanism for originating phenotypic changes that could lead to species differentiation.
Project description:BackgroundSpecific genomic loci, termed Piwi-interacting RNA (piRNA) clusters, manufacture piRNAs that serve as guides for the inactivation of complementary transposable elements (TEs). The piRNA pathway has been accurately detailed in Drosophila melanogaster, while it remains poorly examined in other insects. This pathway is increasingly recognized as critical for germline development and reproduction. Understanding of the piRNA functions in mosquitoes could offer an opportunity for disease vector control by the reduction of their reproductive potential.ResultsTo analyze the similarities and differences in this pathway between Drosophila and mosquito, we performed an in-depth analysis of the genomic loci producing piRNAs and their targets in the African malaria vector Anopheles gambiae. We identified 187 piRNA clusters in the An. gambiae genome and 155 piRNA clusters in the D. melanogaster genome. We demonstrate that many more piRNA clusters in the mosquito compared with the fruit fly are uni-directionally transcribed and are located outside pericentromeric heterochromatin. About 11 % of the An. gambiae piRNA population map to gene transcripts. This is a noticeable increase compared with the ~6 % of the piRNA population mapped to genes in D. melanogaster. A subset of the piRNA-enriched genes in An. gambiae has functions related to reproduction and development. At least 24 and 65 % of the mapped piRNAs correspond to genomic TE sequences in An. gambiae and D. melanogaster, respectively. DNA transposons and non-LTR retrotransposons are more abundant in An. gambiae, while LTR retrotransposons are more abundant in D. melanogaster. Yet, piRNAs predominantly target LTR retrotransposons in both species, which may point to a distinct feature of these elements compared to the other classes of TEs concerning their silencing by the piRNA pathway.ConclusionsHere, we demonstrate that piRNA-producing loci have more ubiquitous distribution in the An. gambiae genome than in the genome of D. melanogaster. Also, protein-coding genes have an increased role in production of piRNAs in the germline of this mosquito. Genes involved in germline and embryonic development of An. gambiae generate a substantial portion of piRNAs, suggesting a role of the piRNA pathway in the epigenetic regulation of the reproductive processes in the African malaria vector.
Project description:The African mosquito Anopheles gambiae is the major vector of human malaria. We report a genome-wide survey of mosquito gene expression profiles clustered temporally into developmental programs and spatially into adult tissue-specific patterns. Global expression analysis shows that genes that belong to related functional categories or that encode the same or functionally linked protein domains are associated with characteristic developmental programs or tissue patterns. Comparative analysis of our data together with data published from Drosophila melanogaster reveal an overall strong and positive correlation of developmental expression between orthologous genes. The degree of correlation varies, depending on association of orthologs with certain developmental programs or functional groups. Interestingly, the similarity of gene expression is not correlated with the coding sequence similarity of orthologs, indicating that expression profiles and coding sequences evolve independently. In addition to providing a comprehensive view of temporal and spatial gene expression during the A. gambiae life cycle, this large-scale comparative transcriptomic analysis has detected important evolutionary features of insect transcriptomes.
Project description:Determining the composition of protein complexes is an essential step toward understanding the cell as an integrated system. Using coaffinity purification coupled to mass spectrometry analysis, we examined protein associations involving nearly 5,000 individual, FLAG-HA epitope-tagged Drosophila proteins. Stringent analysis of these data, based on a statistical framework designed to define individual protein-protein interactions, led to the generation of a Drosophila protein interaction map (DPiM) encompassing 556 protein complexes. The high quality of the DPiM and its usefulness as a paradigm for metazoan proteomes are apparent from the recovery of many known complexes, significant enrichment for shared functional attributes, and validation in human cells. The DPiM defines potential novel members for several important protein complexes and assigns functional links to 586 protein-coding genes lacking previous experimental annotation. The DPiM represents, to our knowledge, the largest metazoan protein complex map and provides a valuable resource for analysis of protein complex evolution.
Project description:Genome evolution entails changes in the DNA sequence of genes and intergenic regions, changes in gene numbers, and also changes in gene order along the chromosomes. Genes are reshuffled by chromosomal rearrangements such as deletions/insertions, inversions, translocations, and transpositions. Here we report a comparative study of genome organization in the main African malaria vector, Anopheles gambiae, relative to the recently determined sequence of the Drosophila melanogaster genome. The ancestral lines of these two dipteran insects are thought to have separated approximately 250 Myr, a long period that makes this genome comparison especially interesting. Sequence comparisons have identified 113 pairs of putative orthologs of the two species. Chromosomal mapping of orthologous genes reveals that each polytene chromosome arm has a homolog in the other species. Between 41% and 73% of the known orthologous genes remain linked in the respective homologous chromosomal arms, with the remainder translocated to various nonhomologous arms. Within homologous arms, gene order is extensively reshuffled, but a limited degree of conserved local synteny (microsynteny) can be recognized.
Project description:Many biological molecules are assembled into supramolecules that are essential to perform complicated functions in the cell. However, experimental information about the structures of supramolecules is not sufficient at this point. We developed a method of predicting and modeling the structures of supramolecules in a biological network by combining structural data of the Protein Data Bank (PDB) and interaction data in IntAct databases. Templates for binary complexes in IntAct were extracted from PDB. Modeling was attempted by assembling binary complexes with superposed shared subunits. A total of 3,197 models were constructed, and 1,306 (41% of the total) contained at least one subunit absent from experimental structures. The models also suggested 970 (25% of the total) experimentally undetected subunit interfaces, and 41 human disease-related amino acid variants were mapped onto these model-suggested interfaces. The models demonstrated that protein-protein interaction network modeling is useful to fill the information gap between biological networks and structures.