Background: Radix Bupleuri has been widely used for its plentiful pharmacological effects. But it is hard to evaluate their safety and efficacy because the concentrations of components are tightly affected by the surrounding environment. Thus, Radix Bupleuri samples from different regions and varieties were collected. Based on the experimental and computational Raman spectrum, machine learning is emphasized for certain obscured characteristics; for example, linear discriminant analysis (LDA), support vector machine (SVM), eXtreme gradient boosting (XGBoost) and light gradient boosting machine (LightGBM).

Results: After dimension reduction by LDA, models of SVM, XGBoost and LightGBM were trained for classification and regression prediction of Bupleurum production regions. Support vector classifiers achieved the best accuracy of 98% and an F1 score above 0.96 on the test set. Support vector regression has a good fitting performance with an R score above 0.90 and a relatively low mean square error. However, complex models were prone to overfitting, resulting in poor generalization ability.

Conclusion: Among these machine learning models, the typical LDA-SVM models, consistent with the high-performance liquid chromatography results, demonstrate great performance and stability. We envision that this rapid classification and regression technique can be extended to predictions for other herbs. © 2024 Society of Chemical Industry.

Download full-text PDF

Source
http://dx.doi.org/10.1002/jsfa.14012DOI Listing

Publication Analysis

Top Keywords

machine learning
12
radix bupleuri
12
support vector
12
gradient boosting
8
classification regression
8
machine
5
rapid raman
4
raman spectroscopy
4
spectroscopy analysis
4
analysis assisted
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!