Dataset Information

Detection of microRNAs in color space.

ABSTRACT:

Motivation

Deep sequencing provides inexpensive opportunities to characterize the transcriptional diversity of known genomes. The AB SOLiD technology generates millions of short sequencing reads in color-space; that is, the raw data is a sequence of colors, where each color represents 2 nt and each nucleotide is represented by two consecutive colors. This strategy is purported to have several advantages, including increased ability to distinguish sequencing errors from polymorphisms. Several programs have been developed to map short reads to genomes in color space. However, a number of previously unexplored technical issues arise when using SOLiD technology to characterize microRNAs.

Results

Here we explore these technical difficulties. First, since the sequenced reads are longer than the biological sequences, every read is expected to contain linker fragments. The color-calling error rate increases toward the 3(') end of the read such that recognizing the linker sequence for removal becomes problematic. Second, mapping in color space may lead to the loss of the first nucleotide of each read. We propose a sequential trimming and mapping approach to map small RNAs. Using our strategy, we reanalyze three published insect small RNA deep sequencing datasets and characterize 22 new microRNAs.

Availability and implementation

A bash shell script to perform the sequential trimming and mapping procedure, called SeqTrimMap, is available at: http://www.mirbase.org/tools/seqtrimmap/

Contact

antonio.marco@manchester.ac.uk

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Marco A

PROVIDER: S-EPMC3268249 | biostudies-literature | 2012 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Detection of microRNAs in color space.

Marco Antonio A Griffiths-Jones Sam S

Bioinformatics (Oxford, England) 20111209 3

<h4>Motivation</h4>Deep sequencing provides inexpensive opportunities to characterize the transcriptional diversity of known genomes. The AB SOLiD technology generates millions of short sequencing reads in color-space; that is, the raw data is a sequence of colors, where each color represents 2 nt and each nucleotide is represented by two consecutive colors. This strategy is purported to have several advantages, including increased ability to distinguish sequencing errors from polymorphisms. Sev ...[more]

PMID: 22171334

Dataset Information

Detection of microRNAs in color space.

Motivation

Results

Availability and implementation

Contact

Supplementary information

Publications

Detection of microRNAs in color space.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Delving Deeper Into Color Space.
| S-EPMC6109856 | biostudies-literature

A dichotomy color quantization algorithm for the HSI color space.
| S-EPMC10199057 | biostudies-literature

MicroRNAs Responding to Space Radiation.
| S-EPMC7555309 | biostudies-literature

Color Space Geometry Uncovered with Magnetoencephalography.
| S-EPMC9036622 | biostudies-literature

Color Space Geometry Uncovered with Magnetoencephalography.
| S-EPMC7878424 | biostudies-literature

Color image segmentation based on different color space models using automatic GrabCut.
| S-EPMC4165205 | biostudies-literature

A hybrid color space for skin detection using genetic algorithm heuristic search and principal component analysis technique.
| S-EPMC4534136 | biostudies-literature

Color Space Transformation-Based Smartphone Algorithm for Colorimetric Urinalysis.
| S-EPMC6175489 | biostudies-literature

CUSHAW3: sensitive and accurate base-space and color-space short-read alignment with hybrid seeding.
| S-EPMC3899341 | biostudies-literature

CIELAB Color Space as a Field for Tracking Color-Changing Chemical Reactions of Polymeric pH Indicators.
| S-EPMC11360014 | biostudies-literature