Regulating Modality Utilization within Multimodal Fusion Networks.

Saurav Singh Eli Saber Panos P Markopoulos Jamison Heard

Sensors (Basel)

Department of Electrical & Microelectronic Engineering, Rochester Institute of Technology, Rochester, NY 14623, USA.

Published: September 2024

Multimodal fusion networks play a pivotal role in leveraging diverse sources of information for enhanced machine learning applications in aerial imagery. However, current approaches often suffer from a bias towards certain modalities, diminishing the potential benefits of multimodal data. This paper addresses this issue by proposing a novel modality utilization-based training method for multimodal fusion networks. The method aims to guide the network's utilization on its input modalities, ensuring a balanced integration of complementary information streams, effectively mitigating the overutilization of dominant modalities. The method is validated on multimodal aerial imagery classification and image segmentation tasks, effectively maintaining modality utilization within ±10% of the user-defined target utilization and demonstrating the versatility and efficacy of the proposed method across various applications. Furthermore, the study explores the robustness of the fusion networks against noise in input modalities, a crucial aspect in real-world scenarios. The method showcases better noise robustness by maintaining performance amidst environmental changes affecting different aerial imagery sensing modalities. The network trained with 75.0% EO utilization achieves significantly better accuracy (81.4%) in noisy conditions (noise variance = 0.12) compared to traditional training methods with 99.59% EO utilization (73.7%). Additionally, it maintains an average accuracy of 85.0% across different noise levels, outperforming the traditional method's average accuracy of 81.9%. Overall, the proposed approach presents a significant step towards harnessing the full potential of multimodal data fusion in diverse machine learning applications such as robotics, healthcare, satellite imagery, and defense applications.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11435562	PMC
http://dx.doi.org/10.3390/s24186054	DOI Listing

Publication Analysis

Top Keywords

fusion networks

multimodal fusion

aerial imagery

modality utilization

machine learning

learning applications

multimodal data

input modalities

average accuracy

utilization

Similar Publications

scMMAE: masked cross-attention network for single-cell multimodal omics fusion to enhance unimodal omics.

Brief Bioinform

November 2024

Guangdong Provincial Key Laboratory of Mathematical and Neural Dynamical Systems, Great Bay University, No. 16 Daxue Rd, Songshanhu District, Dongguan, Guangdong, 523000, China.

Dian Meng Yu Feng Kaishen Yuan Zitong Yu Qin Cao

Multimodal omics provide deeper insight into the biological processes and cellular functions, especially transcriptomics and proteomics. Computational methods have been proposed for the integration of single-cell multimodal omics of transcriptomics and proteomics. However, existing methods primarily concentrate on the alignment of different omics, overlooking the unique information inherent in each omics type.

View Article and Find Full Text PDF

Similar Publications

Color fundus photograph-based diabetic retinopathy grading via label relaxed collaborative learning on deep features and radiomics features.

Front Cell Dev Biol

January 2025

Department of Medical Informatics, Nantong University, Nantong, Jiangsu, China.

Chao Zhang Guanglei Sheng Jie Su Lian Duan

Introduction: Diabetic retinopathy (DR) has long been recognized as a common complication of diabetes, making accurate automated grading of its severity essential. Color fundus photographs play a crucial role in the grading of DR. With the advancement of artificial intelligence technologies, numerous researchers have conducted studies on DR grading based on deep features and radiomic features extracted from color fundus photographs.

View Article and Find Full Text PDF

Similar Publications

Advancing precision agriculture with deep learning enhanced SIS-YOLOv8 for Solanaceae crop monitoring.

Front Plant Sci

January 2025

College of Information Technology, Jilin Agricultural University, Changchun, China.

Ruiqian Qin Yiming Wang Xiaoyan Liu Helong Yu

Introduction: Potatoes and tomatoes are important Solanaceae crops that require effective disease monitoring for optimal agricultural production. Traditional disease monitoring methods rely on manual visual inspection, which is inefficient and prone to subjective bias. The application of deep learning in image recognition has led to object detection models such as YOLO (You Only Look Once), which have shown high efficiency in disease identification.

View Article and Find Full Text PDF

Similar Publications

A deep learning approach for early prediction of breast cancer neoadjuvant chemotherapy response on multistage bimodal ultrasound images.

BMC Med Imaging

January 2025

Department of Ultrasound, Renji Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China.

Jiang Xie Jinzhu Wei Huachan Shi Zhe Lin Jinsong Lu

Neoadjuvant chemotherapy (NAC) is a systemic and systematic chemotherapy regimen for breast cancer patients before surgery. However, NAC is not effective for everyone, and the process is excruciating. Therefore, accurate early prediction of the efficacy of NAC is essential for the clinical diagnosis and treatment of patients.

View Article and Find Full Text PDF

Similar Publications

Embracing the Future of Clinical Trials in Radiotherapy: An NRG Oncology CIRO Technology Retreat Whitepaper on Pioneering Technologies and AI-Driven Solutions.

Int J Radiat Oncol Biol Phys

January 2025

National Cancer Institute, Bethesda, MD. Electronic address:

Ying Xiao Stanley Benedict Yunfeng Cui Carri Glide-Hurst Stephen Graves

This white paper examines the potential of pioneering technologies and artificial intelligence (AI)-driven solutions in advancing clinical trials involving radiotherapy. As the field of radiotherapy evolves, the integration of cutting-edge approaches such as radiopharmaceutical dosimetry, FLASH radiotherapy, image-guided radiation therapy (IGRT), and AI promises to improve treatment planning, patient care, and outcomes. Additionally, recent advancements in quantum science, linear energy transfer/relative biological effect (LET/RBE), and the combination of radiotherapy and immunotherapy create new avenues for innovation in clinical trials.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!