Multimodal reasoning based on knowledge graph embedding for specific diseases.

Chaoyu Zhu Zhihao Yang Xiaoqiong Xia Nan Li Fan Zhong Lei Liu

Bioinformatics

Institute of Biomedical Sciences and School of Basic Medical Science, Shanghai Medical College, Fudan University, Shanghai 200032, China.

Published: April 2022

Motivation: Knowledge Graph (KG) is becoming increasingly important in the biomedical field. Deriving new and reliable knowledge from existing knowledge by KG embedding technology is a cutting-edge method. Some add a variety of additional information to aid reasoning, namely multimodal reasoning. However, few works based on the existing biomedical KGs are focused on specific diseases.

Results: This work develops a construction and multimodal reasoning process of Specific Disease Knowledge Graphs (SDKGs). We construct SDKG-11, a SDKG set including five cancers, six non-cancer diseases, a combined Cancer5 and a combined Diseases11, aiming to discover new reliable knowledge and provide universal pre-trained knowledge for that specific disease field. SDKG-11 is obtained through original triplet extraction, standard entity set construction, entity linking and relation linking. We implement multimodal reasoning by reverse-hyperplane projection for SDKGs based on structure, category and description embeddings. Multimodal reasoning improves pre-existing models on all SDKGs using entity prediction task as the evaluation protocol. We verify the model's reliability in discovering new knowledge by manually proofreading predicted drug-gene, gene-disease and disease-drug pairs. Using embedding results as initialization parameters for the biomolecular interaction classification, we demonstrate the universality of embedding models.

Availability And Implementation: The constructed SDKG-11 and the implementation by TensorFlow are available from https://github.com/ZhuChaoY/SDKG-11.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9004655	PMC
http://dx.doi.org/10.1093/bioinformatics/btac085	DOI Listing

Publication Analysis

Top Keywords

multimodal reasoning

knowledge

knowledge graph

reliable knowledge

specific disease

multimodal

reasoning

reasoning based

based knowledge

embedding

Similar Publications

DICCR: Double-gated intervention and confounder causal reasoning for vision-language navigation.

Neural Netw

December 2024

School of Computer and Electronic Information, Guangxi University, University Road, Nanning, 530004, Guangxi, China. Electronic address:

Dongming Zhou Jinsheng Deng Zhengbin Pang Wei Li

Vision-language navigation (VLN) is a challenging task that requires agents to capture the correlation between different modalities from redundant information according to instructions, and then make sequential decisions on visual scenes and text instructions in the action space. Recent research has focused on extracting visual features and enhancing text knowledge, ignoring the potential bias in multi-modal data and the problem of spurious correlations between vision and text. Therefore, this paper studies the relationship structure between multi-modal data from the perspective of causality and weakens the potential correlation between different modalities through cross-modal causality reasoning.

View Article and Find Full Text PDF

Similar Publications

Integrating Vision and Olfaction via Multi-Modal LLM for Robotic Odor Source Localization.

Sensors (Basel)

December 2024

Department of Computer Science, Louisiana Tech University, 201 Mayfield Ave, Ruston, LA 71272, USA.

Sunzid Hassan Lingxiao Wang Khan Raqib Mahmud

Odor source localization (OSL) technology allows autonomous agents like mobile robots to localize a target odor source in an unknown environment. This is achieved by an OSL navigation algorithm that processes an agent's sensor readings to calculate action commands to guide the robot to locate the odor source. Compared to traditional 'olfaction-only' OSL algorithms, our proposed OSL algorithm integrates vision and olfaction sensor modalities to localize odor sources even if olfaction sensing is disrupted by non-unidirectional airflow or vision sensing is impaired by environmental complexities.

View Article and Find Full Text PDF

Similar Publications

Enterprise chart question and answer method based on multi modal cross fusion.

Sci Rep

January 2025

Chongqing Vocational Institute of Tourism, Chongqing, China.

Xinxin Wang Liang Chen Changhong Liu Jinyu Liu

To enhance enterprises' interactive exploration capabilities for unstructured chart data, this paper proposes a multimodal chart question-answering method. Facing the challenge of recognizing curved and irregular text in charts, we introduce Gaussian heatmap encoding technology to achieve character-level precise text annotation. Additionally, we combine a key point detection algorithm to extract numerical information from the charts and convert it into structured table data.

View Article and Find Full Text PDF

Similar Publications

Dense Paraphrasing for multimodal dialogue interpretation.

Front Artif Intell

December 2024

Computer Science Department, Brandeis University, Waltham, MA, United States.

Jingxuan Tu Kyeongmin Rim Bingyang Ye Kenneth Lai James Pustejovsky

Multimodal dialogue involving multiple participants presents complex computational challenges, primarily due to the rich interplay of diverse communicative modalities including speech, gesture, action, and gaze. These modalities interact in complex ways that traditional dialogue systems often struggle to accurately track and interpret. To address these challenges, we extend the textual enrichment strategy of Dense Paraphrasing (DP), by translating each nonverbal modality into linguistic expressions.

View Article and Find Full Text PDF

Similar Publications

An evaluation framework for clinical use of large language models in patient interaction tasks.

Nat Med

January 2025

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Shreya Johri Jaehwan Jeong Benjamin A Tran Daniel I Schlessinger Shannon Wongvibulsin

The integration of large language models (LLMs) into clinical diagnostics has the potential to transform doctor-patient interactions. However, the readiness of these models for real-world clinical application remains inadequately tested. This paper introduces the Conversational Reasoning Assessment Framework for Testing in Medicine (CRAFT-MD) approach for evaluating clinical LLMs.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!