Given the visual-semantic hierarchy between images and texts, hyperbolic embeddings have been employed for visual-semantic representation learning, leveraging the advantages of hierarchy modeling in hyperbolic space. This approach demonstrates notable advantages in zero-shot learning tasks. However, unlike general image-text alignment tasks, textual data in the medical domain often comprises complex sentences describing various conditions or diseases, posing challenges for vision language models to comprehend free-text medical reports. Consequently, we propose a novel pretraining method specifically for medical image-text data in hyperbolic space. This method uses structured radiology reports, which consist of a set of triplets, and then converts these triplets into sentences through prompt engineering. To address the challenge that diseases or symptoms generally occur in local regions, we introduce a global + local image feature extraction module. By leveraging the hierarchy modeling advantages of hyperbolic space, we employ entailment loss to model the partial order relationship between images and texts. Experimental results show that our method exhibits better generalization and superior performance compared to baseline methods in various zero-shot tasks and different datasets.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11891115 | PMC |
http://dx.doi.org/10.1007/s13755-025-00341-x | DOI Listing |
Nanomaterials (Basel)
February 2025
Department of Physics, Changzhi University, Changzhi 046011, China.
Metasurface-based longitudinal modulation introduces the propagation distance as a new degree of freedom, extending the light modulation with metasurfaces from 2D to 3D space. However, relevant longitudinal studies have been constrained to designing the metasurface of half-wave plate (HWP) meta-atoms and generating either non-focused or two-channel vortex and vector beams. In this study, we propose a metasurface composed of quarter-wave plate (QWP) meta-atoms to generate the longitudinal multi-channel focused vortex and vector beams.
View Article and Find Full Text PDFHealth Inf Sci Syst
December 2025
Department of Anesthesiology, Shanghai Chest Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030 China.
Given the visual-semantic hierarchy between images and texts, hyperbolic embeddings have been employed for visual-semantic representation learning, leveraging the advantages of hierarchy modeling in hyperbolic space. This approach demonstrates notable advantages in zero-shot learning tasks. However, unlike general image-text alignment tasks, textual data in the medical domain often comprises complex sentences describing various conditions or diseases, posing challenges for vision language models to comprehend free-text medical reports.
View Article and Find Full Text PDFPhys Rev Lett
February 2025
Joint Quantum Institute, University of Maryland, College Park, Maryland 20783 USA.
The interplay between disorder and quantum interference leads to a wide variety of physical phenomena including celebrated Anderson localization-the complete absence of diffusive transport due to quantum interference between different particle trajectories. In two dimensions, any amount of disorder is thought to induce localization of all states at long enough length scales. In this Letter, we present an argument providing a mechanism for disrupting localization: by tuning the underlying curvature of the manifold on which diffusion takes place.
View Article and Find Full Text PDFIEEE J Biomed Health Inform
March 2025
Characterizing age-related alterations in brain networks is crucial for understanding aging trajectories and identifying deviations indicative of neurodegenerative disorders, such as Alzheimer's disease. In this study, we developed a Fully Hyperbolic Neural Network (FHNN) to embed functional brain connectivity graphs derived from magnetoencephalography (MEG) data into low dimensions on a Lorentz model of hyperbolic space. Using this model, we computed hyperbolic embeddings of the MEG brain networks of 587 individuals from the Cambridge Centre for Ageing and Neuroscience (Cam-CAN) dataset.
View Article and Find Full Text PDFPLoS One
March 2025
Faculty of Biology, Insitute of Organismic and Molecular Evolution, Johannes Gutenberg University, Biozentrum I, Mainz, Germany.
Protein-protein interactions (PPIs) form a complex network called "interactome" that regulates many functions in the cell. In recent years, there is an increasing accumulation of evidence supporting the existence of a hyperbolic geometry underlying the network representation of complex systems such as the interactome. In particular, it has been shown that the embedding of the human Protein-Interaction Network (hPIN) in hyperbolic space (H2) captures biologically relevant information.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!