Unknown

Dataset Information

0

DeML: robust demultiplexing of Illumina sequences using a likelihood-based approach.


ABSTRACT:

Motivation

Pooling multiple samples increases the efficiency and lowers the cost of DNA sequencing. One approach to multiplexing is to use short DNA indices to uniquely identify each sample. After sequencing, reads must be assigned in silico to the sample of origin, a process referred to as demultiplexing. Demultiplexing software typically identifies the sample of origin using a fixed number of mismatches between the read index and a reference index set. This approach may fail or misassign reads when the sequencing quality of the indices is poor.

Results

We introduce deML, a maximum likelihood algorithm that demultiplexes Illumina sequences. deML computes the likelihood of an observed index sequence being derived from a specified sample. A quality score which reflects the probability of the assignment being correct is generated for each read. Using these quality scores, even very problematic datasets can be demultiplexed and an error threshold can be set.

Availability and implementation

deML is freely available for use under the GPL (http://bioinf.eva.mpg.de/deml/).

SUBMITTER: Renaud G 

PROVIDER: S-EPMC4341068 | biostudies-literature | 2015 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

deML: robust demultiplexing of Illumina sequences using a likelihood-based approach.

Renaud Gabriel G   Stenzel Udo U   Maricic Tomislav T   Wiebe Victor V   Kelso Janet J  

Bioinformatics (Oxford, England) 20141030 5


<h4>Motivation</h4>Pooling multiple samples increases the efficiency and lowers the cost of DNA sequencing. One approach to multiplexing is to use short DNA indices to uniquely identify each sample. After sequencing, reads must be assigned in silico to the sample of origin, a process referred to as demultiplexing. Demultiplexing software typically identifies the sample of origin using a fixed number of mismatches between the read index and a reference index set. This approach may fail or misassi  ...[more]

Similar Datasets

| S-EPMC7077088 | biostudies-literature
| S-EPMC6907891 | biostudies-literature
| S-EPMC4080998 | biostudies-literature
| S-EPMC7285394 | biostudies-literature
| S-EPMC7210008 | biostudies-literature
| S-EPMC6305257 | biostudies-literature
| S-EPMC6586021 | biostudies-literature
| S-EPMC8322939 | biostudies-literature
| S-EPMC8484467 | biostudies-literature
| S-EPMC3471323 | biostudies-literature