Dataset Information

Application of Multiple Imputation for Missing Values in Three-Way Three-Mode Multi-Environment Trial Data.

ABSTRACT: It is a common occurrence in plant breeding programs to observe missing values in three-way three-mode multi-environment trial (MET) data. We proposed modifications of models for estimating missing observations for these data arrays, and developed a novel approach in terms of hierarchical clustering. Multiple imputation (MI) was used in four ways, multiple agglomerative hierarchical clustering, normal distribution model, normal regression model, and predictive mean match. The later three models used both Bayesian analysis and non-Bayesian analysis, while the first approach used a clustering procedure with randomly selected attributes and assigned real values from the nearest neighbour to the one with missing observations. Different proportions of data entries in six complete datasets were randomly selected to be missing and the MI methods were compared based on the efficiency and accuracy of estimating those values. The results indicated that the models using Bayesian analysis had slightly higher accuracy of estimation performance than those using non-Bayesian analysis but they were more time-consuming. However, the novel approach of multiple agglomerative hierarchical clustering demonstrated the overall best performances.

SUBMITTER: Tian T

PROVIDER: S-EPMC4686903 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Application of Multiple Imputation for Missing Values in Three-Way Three-Mode Multi-Environment Trial Data.

Tian Ting T McLachlan Geoffrey J GJ Dieters Mark J MJ Basford Kaye E KE

PloS one 20151221 12

It is a common occurrence in plant breeding programs to observe missing values in three-way three-mode multi-environment trial (MET) data. We proposed modifications of models for estimating missing observations for these data arrays, and developed a novel approach in terms of hierarchical clustering. Multiple imputation (MI) was used in four ways, multiple agglomerative hierarchical clustering, normal distribution model, normal regression model, and predictive mean match. The later three models ...[more]

PMID: 26689369

Dataset Information

Application of Multiple Imputation for Missing Values in Three-Way Three-Mode Multi-Environment Trial Data.

Publications

Application of Multiple Imputation for Missing Values in Three-Way Three-Mode Multi-Environment Trial Data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Multiple imputation for missing values through conditional Semiparametric odds ratio models.
| S-EPMC3135790 | biostudies-literature

Bayesian profiling multiple imputation for missing hemoglobin values in electronic health records.
| S-EPMC9600600 | biostudies-literature

Missing Values in Longitudinal Proteome Dynamics Studies: Making a Case for Data Multiple Imputation.
| S-EPMC11385379 | biostudies-literature

Imputation of Missing Values for Multi-Biospecimen Metabolomics Studies: Bias and Effects on Statistical Validity.
| S-EPMC9317643 | biostudies-literature

Multiple imputation with missing data indicators.
| S-EPMC9205685 | biostudies-literature

Multiple imputation strategies for missing event times in a multi-state model analysis.
| S-EPMC7616776 | biostudies-literature

Addressing Missing Data Mechanism Uncertainty using Multiple-Model Multiple Imputation: Application to a Longitudinal Clinical Trial.
| S-EPMC3596844 | biostudies-literature

Advanced methods for missing values imputation based on similarity learning.
| S-EPMC8323724 | biostudies-literature

Multi-scale variational autoencoder for imputation of missing values in untargeted metabolomics using whole-genome sequencing data.
| S-EPMC11324385 | biostudies-literature

Multiple imputation methods for missing multilevel ordinal outcomes.
| S-EPMC10169455 | biostudies-literature