Dataset Information

ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes.

ABSTRACT: ArrayOme is a new program that calculates the size of genomes represented by microarray-based probes and facilitates recognition of key bacterial strains carrying large numbers of novel genes. Protein-coding sequences (CDS) that are contiguous on annotated reference templates and classified as 'Present' in the test strain by hybridization to microarrays are merged into ICs (ICs). These ICs are then extended to account for flanking intergenic sequences. Finally, the lengths of all extended ICs are summated to yield the 'microarray-visualized genome (MVG)' size. We tested and validated ArrayOme using both experimental and in silico-generated genomic hybridization data. MVG sizing of five sequenced Escherichia coli and Shigella strains resulted in an accuracy of 97-99%, as compared to true genome sizes, when the comprehensive ShE.coli meta-array gene sequences (6239 CDS) were used for in silico hybridization analysis. However, the E.coli CFT073 genome size was underestimated by 14% as this meta-array lacked probes for many CFT073 CDS. ArrayOme permits rapid recognition of discordances between PFGE-measured genome and MVG sizes, thereby enabling high-throughput identification of strains rich in novel genes. Gene discovery studies focused on these strains will greatly facilitate characterization of the global gene pool accessible to individual bacterial species.

SUBMITTER: Ou HY

PROVIDER: S-EPMC546176 | biostudies-literature | 2005

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes.

Ou Hong-Yu HY Smith Rebecca R Lucchini Sacha S Hinton Jay J Chaudhuri Roy R RR Pallen Mark M Barer Michael R MR Rajakumar Kumar K

Nucleic acids research 20050107 1

ArrayOme is a new program that calculates the size of genomes represented by microarray-based probes and facilitates recognition of key bacterial strains carrying large numbers of novel genes. Protein-coding sequences (CDS) that are contiguous on annotated reference templates and classified as 'Present' in the test strain by hybridization to microarrays are merged into ICs (ICs). These ICs are then extended to account for flanking intergenic sequences. Finally, the lengths of all extended ICs ar ...[more]

PMID: 15640440

Dataset Information

ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes.

Publications

ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Challenges in estimating effective population sizes from metagenome-assembled genomes.
| S-EPMC10797056 | biostudies-literature

Estimating variable effective population sizes from multiple genomes: a sequentially markov conditional sampling distribution approach.
| S-EPMC3697970 | biostudies-literature

Estimating atomic sizes with Raman spectroscopy.
| S-EPMC3601407 | biostudies-other

A process for analysis of microarray comparative genomics hybridisation studies for bacterial genomes.
| S-EPMC2262894 | biostudies-literature

Estimating effect sizes in genome-wide association studies.
| S-EPMC3923086 | biostudies-literature

Estimating cross-population genetic correlations of causal effect sizes.
| S-EPMC6375794 | biostudies-literature

Estimating colony sizes of emerging bats using acoustic recordings.
| S-EPMC4821278 | biostudies-literature

Estimating average single-neuron visual receptive field sizes by fMRI.
| S-EPMC6442598 | biostudies-literature

What Influences Saturation? Estimating Sample Sizes in Focus Group Research.
| S-EPMC6635912 | biostudies-literature