Unknown

Dataset Information

0

Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.


ABSTRACT:

Motivation

High through-put mass spectrometry (MS) is now being used to profile small molecular compounds across multiple biological sample types from the same subjects with the goal of leveraging information across biospecimens. Multivariate statistical methods that combine information from all biospecimens could be more powerful than the usual univariate analyses. However, missing values are common in MS data and imputation can impact between-biospecimen correlation and multivariate analysis results.

Results

We propose two multivariate two-part statistics that accommodate missing values and combine data from all biospecimens to identify differentially regulated compounds. Statistical significance is determined using a multivariate permutation null distribution. Relative to univariate tests, the multivariate procedures detected more significant compounds in three biological datasets. In a simulation study, we showed that multi-biospecimen testing procedures were more powerful than single-biospecimen methods when compounds are differentially regulated in multiple biospecimens but univariate methods can be more powerful if compounds are differentially regulated in only one biospecimen.

Availability and implementation

We provide R functions to implement and illustrate our method as supplementary information CONTACT: sltaylor@ucdavis.eduSupplementary information: Supplementary data are available at Bioinformatics online.

SUBMITTER: Taylor SL 

PROVIDER: S-EPMC6075023 | biostudies-literature | 2017 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.

Taylor Sandra L SL   Ruhaak L Renee LR   Weiss Robert H RH   Kelly Karen K   Kim Kyoungmi K  

Bioinformatics (Oxford, England) 20160904 1


<h4>Motivation</h4>High through-put mass spectrometry (MS) is now being used to profile small molecular compounds across multiple biological sample types from the same subjects with the goal of leveraging information across biospecimens. Multivariate statistical methods that combine information from all biospecimens could be more powerful than the usual univariate analyses. However, missing values are common in MS data and imputation can impact between-biospecimen correlation and multivariate an  ...[more]

Similar Datasets

| S-EPMC6750869 | biostudies-literature
| S-EPMC8613810 | biostudies-literature
| S-EPMC4655102 | biostudies-literature
| S-EPMC9047006 | biostudies-literature
| S-EPMC9631095 | biostudies-literature
2005-09-20 | GSE2744 | GEO
| S-EPMC7568363 | biostudies-literature
| S-EPMC10775142 | biostudies-literature
| S-EPMC5862252 | biostudies-literature
| S-EPMC6494713 | biostudies-literature