The application of supervised machine learning methods to identify behavioural modes from inertial measurements of bio-loggers has become a standard tool in behavioural ecology. Several design choices can affect the accuracy of identifying the behavioural modes. One such choice is the inclusion or exclusion of segments consisting of more than a single behaviour (mixed segments) in the machine learning model training data. Currently, the common practice is to ignore such segments during model training. In this paper we tested the hypothesis that including mixed segments in model training will improve accuracy, as the model would perform better in identifying them in the test data. We test this hypothesis using a series of data simulations on four datasets of accelerometer data coupled with behaviour observations, obtained from four study species (Damaraland mole-rats, meerkats, olive baboons, polar bears). Results show that when a substantial proportion of the test data are mixed behaviour segments (above ~ 10%), including mixed segments in machine learning model training improves the accuracy of classification. These results were consistent across the four study species, and robust to changes in segment length, sample size, and degree of mixture within the mixed segments. However, we also find that in some cases (particularly in baboons) models trained with mixed segments show reduced accuracy in classifying test data containing only single behaviour (pure) segments, compared to models trained without mixed segments. Based on these results, we recommend that when the classification model is expected to deal with a substantial proportion of mixed behaviour segments (> 10%), it is beneficial to include them in model training, otherwise, it is unnecessary but also not harmful. The exception is when there is a basis to assume that the training data contains a higher rate of mixed segments than the actual (unobserved) data to be classified-such a situation may occur particularly when training data are collected in captivity and used to classify data from the wild. In this case, excess inclusion of mixed segments in training data should probably be avoided.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11165886PMC
http://dx.doi.org/10.1186/s40462-024-00485-7DOI Listing

Publication Analysis

Top Keywords

mixed segments
32
model training
20
machine learning
16
training data
16
segments
14
behavioural modes
12
data
12
test data
12
mixed
10
supervised machine
8

Similar Publications

Background: Perivascular Spaces (PVS) are a marker of cerebral small vessel disease (CSVD) that are visible on brain imaging. Larger PVS has been associated with poor quality of life and cognitive impairment post-stroke. However, the association between PVS and post-stroke sensorimotor outcomes has not been investigated.

View Article and Find Full Text PDF

Introduction: Mucinous Cystadenocarcinoma (MCA) of the breast remains a relatively rare condition, and to date, there is no systematic summary of its imaging manifestations. Therefore, this report presents a detailed account of the diagnosis and treatment of mucinous cystadenocarcinoma in a 40-year-old woman, with a particular focus on imaging findings. Additionally, we conducted a comprehensive literature review on this disease and summarized its key imaging features.

View Article and Find Full Text PDF

Background: National Comprehensive Cancer Network guidelines recommend segmental colectomy for appendiceal neuroendocrine neoplasms >2.0 cm given the risk for lymph node involvement. However, additional clinicopathologic factors are associated with nodal metastases, and thus survival.

View Article and Find Full Text PDF

Removal of mixed PhACs by combined UV/HO and biologically activated carbon process: Toxicity assessment, transformation products and microbial community.

J Environ Manage

January 2025

Key Laboratory of the Three Gorges Reservoir Region's Eco-Environment, Ministry of Education, Chongqing University, Chongqing 400044, China; State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, PR China. Electronic address:

This study examined the removal and toxicity reduction of mixed pharmaceutically active compounds (PhACs), including carbamazepine, erythromycin, gemfibrozil, and diclofenac, in the UV/HO tandem with biologically activated carbon (UV/HO-BAC) process and explored potential detoxification mechanisms. Results indicated that the combined process effectively removed the mixed PhACs, with the UV/HO segment being the primary contributor. As distinct from concentration removal, the effluent toxicity significantly increased after UV/HO treatment.

View Article and Find Full Text PDF

The coffee-ring effect, involving spontaneous solute separation, has demonstrated promising potential in the context of patient serum analysis. In this study, an approach leveraging the coffee-ring-based analyte redistribution was developed for spectral analysis of surface-enhanced Raman scattering (SERS). By performing radical SERS scanning through the coffee-ring area and sampling across the coffee ring, complicated chemical information was spatially gathered for further spectra analysis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!