Unknown

Dataset Information

0

Incidence of "quasi-ditags" in catalogs generated by Serial Analysis of Gene Expression (SAGE).


ABSTRACT: BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a functional genomic technique that quantitatively analyzes the cellular transcriptome. The analysis of SAGE libraries relies on the identification of ditags from sequencing files; however, the software used to examine SAGE libraries cannot distinguish between authentic versus false ditags ("quasi-ditags"). RESULTS: We provide examples of quasi-ditags that originate from cloning and sequencing artifacts (i.e. genomic contamination or random combinations of nucleotides) that are included in SAGE libraries. We have employed a mathematical model to predict the frequency of quasi-ditags in random nucleotide sequences, and our data show that clones containing less than or equal to 2 ditags (which include chromosomal cloning artifacts) should be excluded from the analysis of SAGE catalogs. CONCLUSIONS: Cloning and sequencing artifacts contaminating SAGE libraries could be eliminated using simple pre-screening procedure to increase the reliability of the data.

SUBMITTER: Anisimov SV 

PROVIDER: S-EPMC526221 | biostudies-literature | 2004 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Incidence of "quasi-ditags" in catalogs generated by Serial Analysis of Gene Expression (SAGE).

Anisimov Sergey V SV   Sharov Alexei A AA  

BMC bioinformatics 20041018


<h4>Background</h4>Serial Analysis of Gene Expression (SAGE) is a functional genomic technique that quantitatively analyzes the cellular transcriptome. The analysis of SAGE libraries relies on the identification of ditags from sequencing files; however, the software used to examine SAGE libraries cannot distinguish between authentic versus false ditags ("quasi-ditags").<h4>Results</h4>We provide examples of quasi-ditags that originate from cloning and sequencing artifacts (i.e. genomic contamina  ...[more]

Similar Datasets

| S-EPMC2121609 | biostudies-literature
| S-EPMC3081805 | biostudies-literature
| S-EPMC4616010 | biostudies-literature
| S-EPMC517707 | biostudies-literature
| S-EPMC311149 | biostudies-literature
| S-EPMC27040 | biostudies-literature
| S-EPMC1950885 | biostudies-other
| S-EPMC2527533 | biostudies-literature
| S-EPMC1261262 | biostudies-literature
| S-EPMC2896172 | biostudies-literature