Unknown

Dataset Information

0

Automated finishing with autofinish.


ABSTRACT: Currently, the genome sequencing community is producing shotgun sequence data at a very high rate, but finishing (collecting additional directed sequence data to close gaps and improve the quality of the data) is not matching that rate. One reason for the difference is that shotgun sequencing is highly automated but finishing is not: Most finishing decisions, such as which directed reads to obtain and which specialized sequencing techniques to use, are made by people. If finishing rates are to increase to match shotgun sequencing rates, most finishing decisions also must be automated. The Autofinish computer program (which is part of the computer software package) does this by automatically choosing finishing reads. Autofinish is able to suggest most finishing reads required for completion of each sequencing project, greatly reducing the amount of human attention needed. sometimes completely finishes the project, with no human decisions required. It cannot solve the most complex problems, so we recommend that Autofinish be allowed to suggest reads for the first three rounds of finishing, and if the project still is not finished completely, a human finisher complete the work. We compared this Autofinish-Hybrid method of finishing against a human finisher in five different projects with a variety of shotgun depths by finishing each project twice--once with each method. This comparison shows that the Autofinish-Hybrid method saves many hours over a human finisher alone, while using roughly the same number and type of reads and closing gaps at roughly the same rate. Autofinish currently is in production use at several large sequencing centers. It is designed to be adaptable to the finishing strategy of the lab--it can finish using some or all of the following: resequencing reads, reverses, custom primer walks on either subclone templates or whole clone templates, PCR, or minilibraries. Autofinish has been used for finishing cDNA, genomic clones, and whole bacterial genomes (see http://www.phrap.org).

SUBMITTER: Gordon D 

PROVIDER: S-EPMC311035 | biostudies-literature | 2001 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Automated finishing with autofinish.

Gordon D D   Desmarais C C   Green P P  

Genome research 20010401 4


Currently, the genome sequencing community is producing shotgun sequence data at a very high rate, but finishing (collecting additional directed sequence data to close gaps and improve the quality of the data) is not matching that rate. One reason for the difference is that shotgun sequencing is highly automated but finishing is not: Most finishing decisions, such as which directed reads to obtain and which specialized sequencing techniques to use, are made by people. If finishing rates are to i  ...[more]

Similar Datasets

| S-EPMC3731737 | biostudies-literature
| S-EPMC9196995 | biostudies-literature
| PRJEB45792 | ENA
| S-EPMC2441464 | biostudies-literature
2014-06-01 | GSE53366 | GEO
| S-EPMC2825053 | biostudies-literature
| S-EPMC5596377 | biostudies-literature
2013-02-12 | E-ERAD-77 | biostudies-arrayexpress
| S-EPMC4515008 | biostudies-literature
| S-EPMC4896287 | biostudies-literature