Unknown

Dataset Information

0

Definitive demonstration by synthesis of genome annotation completeness.


ABSTRACT: We develop a method for completing the genetics of natural living systems by which the absence of expected future discoveries can be established. We demonstrate the method using bacteriophage øX174, the first DNA genome to be sequenced. Like many well-studied natural organisms, closely related genome sequences are available-23 Bullavirinae genomes related to øX174. Using bioinformatic tools, we first identified 315 potential open reading frames (ORFs) within the genome, including the 11 established essential genes and 82 highly conserved ORFs that have no known gene products or assigned functions. Using genome-scale design and synthesis, we made a mutant genome in which all 11 essential genes are simultaneously disrupted, leaving intact only the 82 conserved but cryptic ORFs. The resulting genome is not viable. Cell-free gene expression followed by mass spectrometry revealed only a single peptide expressed from both the cryptic ORF and wild-type genomes, suggesting a potential new gene. A second synthetic genome in which 71 conserved cryptic ORFs were simultaneously disrupted is viable but with ?50% reduced fitness relative to the wild type. However, rather than finding any new genes, repeated evolutionary adaptation revealed a single point mutation that modulates expression of gene H, a known essential gene, and fully suppresses the fitness defect. Taken together, we conclude that the annotation of currently functional ORFs for the øX174 genome is formally complete. More broadly, we show that sequencing and bioinformatics followed by synthesis-enabled reverse genomics, proteomics, and evolutionary adaptation can definitely establish the sufficiency and completeness of natural genome annotations.

SUBMITTER: Jaschke PR 

PROVIDER: S-EPMC6883844 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Definitive demonstration by synthesis of genome annotation completeness.

Jaschke Paul R PR   Dotson Gabrielle A GA   Hung Kay S KS   Liu Diane D   Endy Drew D  

Proceedings of the National Academy of Sciences of the United States of America 20191112 48


We develop a method for completing the genetics of natural living systems by which the absence of expected future discoveries can be established. We demonstrate the method using bacteriophage øX174, the first DNA genome to be sequenced. Like many well-studied natural organisms, closely related genome sequences are available-23 <i>Bullavirinae</i> genomes related to øX174. Using bioinformatic tools, we first identified 315 potential open reading frames (ORFs) within the genome, including the 11 e  ...[more]

Similar Datasets

| S-EPMC6466665 | biostudies-literature
| S-EPMC6437941 | biostudies-literature
2009-12-01 | PRD000122 | Pride
| PRJEB65083 | ENA
| PRJEB68156 | ENA
| S-EPMC4571086 | biostudies-literature
| S-EPMC6380598 | biostudies-literature
| S-EPMC341809 | biostudies-other
| S-EPMC4778648 | biostudies-other
| S-EPMC7488777 | biostudies-literature