It is a common occurrence in plant breeding programs to observe missing values in three-way three-mode multi-environment trial (MET) data. We proposed modifications of models for estimating missing observations for these data arrays, and developed a novel approach in terms of hierarchical clustering. Multiple imputation (MI) was used in four ways, multiple agglomerative hierarchical clustering, normal distribution model, normal regression model, and predictive mean match. The later three models used both Bayesian analysis and non-Bayesian analysis, while the first approach used a clustering procedure with randomly selected attributes and assigned real values from the nearest neighbour to the one with missing observations. Different proportions of data entries in six complete datasets were randomly selected to be missing and the MI methods were compared based on the efficiency and accuracy of estimating those values. The results indicated that the models using Bayesian analysis had slightly higher accuracy of estimation performance than those using non-Bayesian analysis but they were more time-consuming. However, the novel approach of multiple agglomerative hierarchical clustering demonstrated the overall best performances.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4686903PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0144370PLOS

Publication Analysis

Top Keywords

hierarchical clustering
12
multiple imputation
8
missing values
8
values three-way
8
three-way three-mode
8
three-mode multi-environment
8
multi-environment trial
8
missing observations
8
novel approach
8
multiple agglomerative
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!