Cross-Modality Interaction Network for Equine Activity Recognition Using Imbalanced Multi-Modal Data.

Axiu Mao Endai Huang Haiming Gan Rebecca S V Parkes Weitao Xu Kai Liu

Sensors (Basel)

Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China.

Published: August 2021

With the recent advances in deep learning, wearable sensors have increasingly been used in automated animal activity recognition. However, there are two major challenges in improving recognition performance-multi-modal feature fusion and imbalanced data modeling. In this study, to improve classification performance for equine activities while tackling these two challenges, we developed a cross-modality interaction network (CMI-Net) involving a dual convolution neural network architecture and a cross-modality interaction module (CMIM). The CMIM adaptively recalibrated the temporal- and axis-wise features in each modality by leveraging multi-modal information to achieve deep intermodality interaction. A class-balanced (CB) focal loss was adopted to supervise the training of CMI-Net to alleviate the class imbalance problem. Motion data was acquired from six neck-attached inertial measurement units from six horses. The CMI-Net was trained and verified with leave-one-out cross-validation. The results demonstrated that our CMI-Net outperformed the existing algorithms with high precision (79.74%), recall (79.57%), F1-score (79.02%), and accuracy (93.37%). The adoption of CB focal loss improved the performance of CMI-Net, with increases of 2.76%, 4.16%, and 3.92% in precision, recall, and F1-score, respectively. In conclusion, CMI-Net and CB focal loss effectively enhanced the equine activity classification performance using imbalanced multi-modal sensor data.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434387	PMC
http://dx.doi.org/10.3390/s21175818	DOI Listing

Publication Analysis

Top Keywords

cross-modality interaction

focal loss

interaction network

equine activity

activity recognition

imbalanced multi-modal

classification performance

cmi-net

network equine

recognition imbalanced

Similar Publications

Food properties modulate activities in posterior parietal and visual cortex during chewing.

Physiol Behav

January 2025

Department of Anesthesiology, Nihon University School of Dentistry at Matsudo, 2-870-1 Sakaecho-Nishi, Matsudo, Chiba, 271-8587, Japan. Electronic address:

Noriyuki Narita Sunao Iwaki Tomohiro Ishii Kazunobu Kamiya Michiharu Shimosaka

Cross-modal interactions between sensory modalities may be necessary for recognition of chewing food by the invisible oral cavity to avoid damaging the tongue and/or oral mucosa. The present study used functional near-infrared spectroscopy to investigate whether the food properties hardness and size influence activities in the posterior parietal cortex and visual cortex during chewing performance in healthy individuals. It was found that an increase in food hardness enhanced both posterior parietal cortex and visual cortex activities, while an increase in food size enhanced activities in the same regions.

View Article and Find Full Text PDF

Similar Publications

Integrating scRNA-seq and scATAC-seq with inter-type attention heterogeneous graph neural networks.

Brief Bioinform

November 2024

Department of Electronic Engineering, Tsinghua University, 100084 Beijing, China.

Lingsheng Cai Xiuli Ma Jianzhu Ma

Single-cell multi-omics techniques, which enable the simultaneous measurement of multiple modalities such as RNA gene expression and Assay for Transposase-Accessible Chromatin (ATAC) within individual cells, have become a powerful tool for deciphering the intricate complexity of cellular systems. Most current methods rely on motif databases to establish cross-modality relationships between genes from RNA-seq data and peaks from ATAC-seq data. However, these approaches are constrained by incomplete database coverage, particularly for novel or poorly characterized relationships.

View Article and Find Full Text PDF

Similar Publications

Neural representation of sensorimotor features in language-motor areas during auditory and visual perception.

Commun Biol

January 2025

School of Psychology, Shenzhen University, Shenzhen, China.

Yuanyi Zheng Jianfeng Zhang Yang Yang Min Xu

Speech processing involves a complex interplay between sensory and motor systems in the brain, essential for early language development. Recent studies have extended this sensory-motor interaction to visual word processing, emphasizing the connection between reading and handwriting during literacy acquisition. Here we show how language-motor areas encode motoric and sensory features of language stimuli during auditory and visual perception, using functional magnetic resonance imaging (fMRI) combined with representational similarity analysis.

View Article and Find Full Text PDF

Similar Publications

Multi-Scale Contrastive Learning with Hierarchical Knowledge Synergy for Visible-Infrared Person Re-Identification.

Sensors (Basel)

January 2025

Faculty of Applied Sciences, Macao Polytechnic University, Macao SAR 999078, China.

Yongheng Qian Su-Kit Tang

Visible-infrared person re-identification (VI-ReID) is a challenging cross-modality retrieval task to match a person across different spectral camera views. Most existing works focus on learning shared feature representations from the final embedding space of advanced networks to alleviate modality differences between visible and infrared images. However, exclusively relying on high-level semantic information from the network's final layers can restrict shared feature representations and overlook the benefits of low-level details.

View Article and Find Full Text PDF

Similar Publications

Multi-scale multimodal deep learning framework for Alzheimer's disease diagnosis.

Comput Biol Med

January 2025

National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, 518060, China; Computer Science Department, Misr Higher Institute of Commerce and Computers, Mansoura, 35516, Egypt.

Mohammed Abdelaziz Tianfu Wang Waqas Anwaar Ahmed Elazab

Multimodal neuroimaging data, including magnetic resonance imaging (MRI) and positron emission tomography (PET), provides complementary information about the brain that can aid in Alzheimer's disease (AD) diagnosis. However, most existing deep learning methods still rely on patch-based extraction from neuroimaging data, which typically yields suboptimal performance due to its isolation from the subsequent network and does not effectively capture the varying scales of structural changes in the cerebrum. Moreover, these methods often simply concatenate multimodal data, ignoring the interactions between them that can highlight discriminative regions and thereby improve the diagnosis of AD.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!