Cardiotocography (CTG) monitoring is an important medical diagnostic tool for fetal well-being evaluation in late pregnancy. In this regard, intelligent CTG classification based on Fetal Heart Rate (FHR) signals is a challenging research area that can assist obstetricians in making clinical decisions, thereby improving the efficiency and accuracy of pregnancy management. Most existing methods focus on one specific modality, that is, they only detect one type of modality and inevitably have limitations such as incomplete or redundant source domain feature extraction, and poor repeatability. This study focuses on modeling multimodal learning for Fetal Distress Diagnosis (FDD); however, exists three major challenges: unaligned multimodalities; failure to learn and fuse the causality and inclusion between multimodal biomedical data; modality sensitivity, that is, difficulty in implementing a task in the absence of modalities. To address these three issues, we propose a Multimodal Medical Information Fusion framework named MMIF, where the Category Constrained-Parallel ViT model (CCPViT) was first proposed to explore multimodal learning tasks and address the misalignment between multimodalities. Based on CCPViT, a cross-attention-based image-text joint component is introduced to establish a Multimodal Representation Alignment Network model (MRAN), explore the deep-level interactive representation between cross-modal data, and assist multimodal learning. Furthermore, we designed a simple-structured FDD test model based on the highly modal alignment MMIF, realizing task delegation from multimodal model training (image and text) to unimodal pathological diagnosis (image). Extensive experiments, including model parameter sensitivity analysis, cross-modal alignment assessment, and pathological diagnostic accuracy evaluation, were conducted to show our models' superior performance and effectiveness.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9676934PMC
http://dx.doi.org/10.3389/fphys.2022.1021400DOI Listing

Publication Analysis

Top Keywords

multimodal learning
16
multimodal
9
learning fetal
8
fetal distress
8
distress diagnosis
8
multimodal medical
8
medical fusion
8
fusion framework
8
model
5
fetal
4

Similar Publications

Given the heterogeneous nature of attention-deficit/hyperactivity disorder (ADHD) and the absence of established biomarkers, accurate diagnosis and effective treatment remain a challenge in clinical practice. This study investigates the predictive utility of multimodal data, including eye tracking, EEG, actigraphy, and behavioral indices, in differentiating adults with ADHD from healthy individuals. Using a support vector machine model, we analyzed independent training (n = 50) and test (n = 36) samples from two clinically controlled studies.

View Article and Find Full Text PDF

Ensemble learning driven Kolmogorov-Arnold Networks-based Lung Cancer classification.

PLoS One

December 2024

Department of Biochemistry, S S Hospital, S S Institute of Medical Sciences & Research Centre, Rajiv Gandhi University of Health Sciences, Davangere, Karnataka, India.

Early Lung Cancer (LC) detection is essential for reducing the global mortality rate. The limitations of traditional diagnostic techniques cause challenges in identifying LC using medical imaging data. In this study, we aim to develop a robust LC detection model.

View Article and Find Full Text PDF

This study aims to explore the feasibility of applying the "Three-Low" technique (low injection rate, low iodine contrast volume, low radiation dose) in coronary CT angiography (CCTA). We prospectively collected data from 90 patients who underwent CCTA at our hospital between 2021 and 2024. The patients were randomly assigned to either the experimental group (n = 45) or the control group (n = 45).

View Article and Find Full Text PDF

Comparing auditory and visual aspects of multisensory working memory using bimodally matched feature patterns.

Exp Brain Res

December 2024

Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, CNY 149, 13th St, Charlestown, MA, 02129, USA.

Working memory (WM) reflects the transient maintenance of information in the absence of external input, which can be attained via multiple senses separately or simultaneously. Pertaining to WM, the prevailing literature suggests the dominance of vision over other sensory systems. However, this imbalance may be stemming from challenges in finding comparable stimuli across modalities.

View Article and Find Full Text PDF

Classification of cervical cancer using Dense CapsNet with Seg-UNet and denoising autoencoders.

Sci Rep

December 2024

Decisions LAB, Department of Law, Economics and Human Sciences, University Mediterranea of Reggio Calabria, Via dei Bianchi, 2, 89131, Reggio Calabria, Italy.

Cervical cancer is one of the deadly diseases that affects women, which requires periodic examinations to identify and treat any cancerous tumors at a preliminary stage. The most prevalent examination tool for cervical cancer prompt identification is the cervical smear (Pap smear) testing; however, due to human negligence, this examination method has an elevated probability of negative findings. Cervical cancer classification using machine learning (ML) and deep learning (DL) has been extensively studied to enhance the conventional diagnostic process.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!