This study exhaustively explores leaf features seeking diagnostic characters to aid the classification (assigning cases to groups, i.e. populations to taxa) in a polyploid plant-species complex. A challenging case study was selected: Veronica subsection Pentasepalae, a taxonomically intricate group. The "divide and conquer" approach was implemented-that is, a difficult primary dataset was split into more manageable subsets. Three techniques were explored: two data-mining tools (artificial neural networks and decision trees) and one unsupervised discriminant analysis. However, only the decision trees and discriminant analysis were finally used to select diagnostic traits. A previously established classification hypothesis based on other data sources was used as a starting point. A guided discriminant analysis (i.e. involving manual character selection) was used to produce a grouping scheme fitting this hypothesis so that it could be taken as a reference. Sequential unsupervised multivariate analysis enabled the recognition of all species and infraspecific taxa; however, a suboptimal classification rate was achieved. Decision trees resulted in better classification rates than unsupervised multivariate analysis, but three complete taxa were misidentified (not present in terminal nodes). The variable selection led to a different grouping scheme in the case of decision trees. The resulting groups displayed low misclassification rates when analyzed using artificial neural networks. The decision trees as well as the discriminant analysis are recommended in the search of diagnostic characters. Due to the high sensitivity that artificial neural networks have to the combination of input/output layers, they are proposed as evaluation tools for morphometric studies. The "divide and conquer" principle is a promising strategy, providing success in the present case study.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6025878 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0199818 | PLOS |
Sci Rep
December 2024
School of Big Data, Fuzhou University of International Studies and Trade, Fuzhou, 350202, China.
The traditional machine learning methods such as decision tree (DT), random forest (RF), and support vector machine (SVM) have low classification performance. This paper proposes an algorithm for the dry bean dataset and obesity levels dataset that can balance the minority class and the majority class and has a clustering function to improve the traditional machine learning classification accuracy and various performance indicators such as precision, recall, f1-score, and area under curve (AUC) for imbalanced data. The key idea is to use the advantages of borderline-synthetic minority oversampling technique (BLSMOTE) to generate new samples using samples on the boundary of minority class samples to reduce the impact of noise on model building, and the advantages of K-means clustering to divide data into different groups according to similarities or common features.
View Article and Find Full Text PDFSci Rep
December 2024
New Technology Research Institute, BYD Auto Industry Co., Ltd., Shenzhen, 518118, China.
Effective road terrain recognition is crucial for enhancing the driving safety, passability, and comfort of autonomous vehicles. This study addresses the challenges of accurately identifying diverse road surfaces using deep learning in complex environments. We introduce a novel end-to-end Tire Noise Recognition Residual Network (TNResNet) integrated with a time-frequency attention module, designed to capture and leverage time-frequency information from tire noise signals for road terrain classification.
View Article and Find Full Text PDFJ Glob Health
December 2024
Hunan Key Laboratory of Molecular Epidemiology, School of Medicine, Hunan Normal University, Changsha, Hu Nan, China.
Background: Since 2019, China has implemented Public Health and Social Measures (PHSMs) to manage the coronavirus disease 2019 (COVID-19) outbreak. As the threat from SARS-CoV-2 diminished, these measures were relaxed, leading to increased respiratory infections and strained health care resources by mid-2023.
Methods: The study utilised WHO's FluNet and Oxford's COVID-19 Government Response Tracker to assess how policy shifts have affected influenza.
Front Nutr
December 2024
Department of Systems Biology and Bioinformatics, Institute of Computer Science, University of Rostock, Rostock, Germany.
Introduction: Disease-related malnutrition is common but often underdiagnosed in patients with chronic gastrointestinal diseases, such as liver cirrhosis, short bowel and intestinal insufficiency, and chronic pancreatitis. To improve malnutrition diagnosis in these patients, an evaluation of the current Global Leadership Initiative on Malnutrition (GLIM) diagnostic criteria, and possibly the implementation of additional criteria, is needed.
Aim: This study aimed to identify previously unknown and potentially specific features of malnutrition in patients with different chronic gastrointestinal diseases and to validate the relevance of the GLIM criteria for clinical practice using machine learning (ML).
PLoS One
December 2024
School of Systems Engineering, Kochi University of Technology, Kami, Kochi, Japan.
This study conducts a comprehensive analysis of gender inequality in Sri Lanka, focusing on the relationship between key socioeconomic factors and the Gender Inequality Index (GII) from 1990 to 2022. By applying machine learning techniques, including Decision Trees and Ensemble methods, the study investigates the influence of economic indicators such as GDP per capita, government expenditure, government revenue, and unemployment rates on gender disparities. The analysis reveals that higher GDP and government revenues are associated with reduced gender inequality, while greater unemployment rates exacerbate disparities.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!