Ontology highlight
ABSTRACT:
SUBMITTER: Akhter S
PROVIDER: S-EPMC3539204 | biostudies-literature | 2013
REPOSITORIES: biostudies-literature
Akhter Sajia S Bailey Barbara A BA Salamon Peter P Aziz Ramy K RK Edwards Robert A RA
Scientific reports 20130108
All sequence data contain inherent information that can be measured by Shannon's uncertainty theory. Such measurement is valuable in evaluating large data sets, such as metagenomic libraries, to prioritize their analysis and annotation, thus saving computational resources. Here, Shannon's index of complete phage and bacterial genomes was examined. The information content of a genome was found to be highly dependent on the genome length, GC content, and sequence word size. In metagenomic sequence ...[more]