Characteristic features of statistical models and machine learning methods derived from pest and disease monitoring datasets.

R Soc Open Sci

Research Center for Agricultural Information and Technology, National Agriculture and Food Research Organization 105-0003, 2-14-1 Kowa Nishi-Shimbashi Building, Nishi-Shimbashi, Minato, Tokyo, Japan.

Published: June 2023

While many studies have used traditional statistical methods when analysing monitoring data to predict future population dynamics of crop pests and diseases, increasing studies have used machine learning methods. The characteristic features of these methods have not been fully elucidated and arranged. We compared the prediction performance between two statistical and seven machine learning methods using 203 monitoring datasets recorded over several decades on four major crops in Japan and meteorological and geographical information as the explanatory variables. The decision tree and random forest of machine learning were found to be most efficient, while regression models of statistical and machine learning methods were relatively inferior. The best two methods were better for biased and scarce data, while the statistical Bayesian model was better for larger dataset sizes. Therefore, researchers should consider data characteristics when selecting the most appropriate method.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10300670PMC
http://dx.doi.org/10.1098/rsos.230079DOI Listing

Publication Analysis

Top Keywords

machine learning
20
learning methods
16
characteristic features
8
monitoring datasets
8
statistical machine
8
methods
7
statistical
5
machine
5
learning
5
features statistical
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!