Unknown

Dataset Information

0

Simulation of microarray data with realistic characteristics.


ABSTRACT:

Background

Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been proposed.

Results

We present a microarray simulation model which can be used to validate different kinds of data analysis algorithms. The proposed model is unique in the sense that it includes all the steps that affect the quality of real microarray data. These steps include the simulation of biological ground truth data, applying biological and measurement technology specific error models, and finally simulating the microarray slide manufacturing and hybridization. After all these steps are taken into account, the simulated data has realistic biological and statistical characteristics. The applicability of the proposed model is demonstrated by several examples.

Conclusion

The proposed microarray simulation model is modular and can be used in different kinds of applications. It includes several error models that have been proposed earlier and it can be used with different types of input data. The model can be used to simulate both spotted two-channel and oligonucleotide based single-channel microarrays. All this makes the model a valuable tool for example in validation of data analysis algorithms.

SUBMITTER: Nykter M 

PROVIDER: S-EPMC1574357 | biostudies-literature | 2006 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Simulation of microarray data with realistic characteristics.

Nykter Matti M   Aho Tommi T   Ahdesmäki Miika M   Ruusuvuori Pekka P   Lehmussola Antti A   Yli-Harja Olli O  

BMC bioinformatics 20060718


<h4>Background</h4>Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been propose  ...[more]

Similar Datasets

| S-EPMC5003477 | biostudies-literature
| S-EPMC2542380 | biostudies-literature
| S-EPMC6582938 | biostudies-literature
| S-EPMC6239949 | biostudies-literature
| S-EPMC7805882 | biostudies-literature
| S-EPMC2654463 | biostudies-literature
| S-EPMC5882714 | biostudies-literature
| S-EPMC3472240 | biostudies-literature
| S-EPMC5415003 | biostudies-literature