Unknown

Dataset Information

0

A novel approach for transcription factor analysis using SELEX with high-throughput sequencing (TFAST).


ABSTRACT:

Background

In previous work, we designed a modified aptamer-free SELEX-seq protocol (afSELEX-seq) for the discovery of transcription factor binding sites. Here, we present original software, TFAST, designed to analyze afSELEX-seq data, validated against our previously generated afSELEX-seq dataset and a model dataset. TFAST is designed with a simple graphical interface (Java) so that it can be installed and executed without extensive expertise in bioinformatics. TFAST completes analysis within minutes on most personal computers.

Methodology

Once afSELEX-seq data are aligned to a target genome, TFAST identifies peaks and, uniquely, compares peak characteristics between cycles. TFAST generates a hierarchical report of graded peaks, their associated genomic sequences, binding site length predictions, and dummy sequences.

Principal findings

Including additional cycles of afSELEX-seq improved TFAST's ability to selectively identify peaks, leading to 7,274, 4,255, and 2,628 peaks identified in two-, three-, and four-cycle afSELEX-seq. Inter-round analysis by TFAST identified 457 peaks as the strongest candidates for true binding sites. Separating peaks by TFAST into classes of worst, second-best and best candidate peaks revealed a trend of increasing significance (e-values 4.5 × 10(12), 2.9 × 10(-46), and 1.2 × 10(-73)) and informational content (11.0, 11.9, and 12.5 bits over 15 bp) of discovered motifs within each respective class. TFAST also predicted a binding site length (28 bp) consistent with non-computational experimentally derived results for the transcription factor PapX (22 to 29 bp).

Conclusions/significance

TFAST offers a novel and intuitive approach for determining DNA binding sites of proteins subjected to afSELEX-seq. Here, we demonstrate that TFAST, using afSELEX-seq data, rapidly and accurately predicted sequence length and motif for a putative transcription factor's binding site.

SUBMITTER: Reiss DJ 

PROVIDER: S-EPMC3430675 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel approach for transcription factor analysis using SELEX with high-throughput sequencing (TFAST).

Reiss Daniel J DJ   Howard Frederick M FM   Mobley Harry L T HL  

PloS one 20120803 8


<h4>Background</h4>In previous work, we designed a modified aptamer-free SELEX-seq protocol (afSELEX-seq) for the discovery of transcription factor binding sites. Here, we present original software, TFAST, designed to analyze afSELEX-seq data, validated against our previously generated afSELEX-seq dataset and a model dataset. TFAST is designed with a simple graphical interface (Java) so that it can be installed and executed without extensive expertise in bioinformatics. TFAST completes analysis  ...[more]

Similar Datasets

| S-EPMC2820082 | biostudies-literature
| S-EPMC6544647 | biostudies-literature
| S-EPMC4070925 | biostudies-literature
| S-EPMC390317 | biostudies-literature
| S-EPMC3686578 | biostudies-literature
| S-EPMC6694822 | biostudies-literature
| S-EPMC4824076 | biostudies-literature
| S-EPMC5031971 | biostudies-literature
| S-EPMC2838866 | biostudies-other
| S-EPMC4666376 | biostudies-literature