Learning to Embed Semantic Similarity for Joint Image-Text Retrieval.

IEEE Trans Pattern Anal Mach Intell

Published: December 2022

We present a deep learning approach for learning the joint semantic embeddings of images and captions in a euclidean space, such that the semantic similarity is approximated by the L distances in the embedding space. For that, we introduce a metric learning scheme that utilizes multitask learning to learn the embedding of identical semantic concepts using a center loss. By introducing a differentiable quantization scheme into the end-to-end trainable network, we derive a semantic embedding of semantically similar concepts in euclidean space. We also propose a novel metric learning formulation using an adaptive margin hinge loss, that is refined during the training phase. The proposed scheme was applied to the MS-COCO, Flicke30K and Flickr8K datasets, and was shown to compare favorably with contemporary state-of-the-art approaches.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2021.3132163DOI Listing

Publication Analysis

Top Keywords

semantic similarity
8
euclidean space
8
metric learning
8
learning
6
semantic
5
learning embed
4
embed semantic
4
similarity joint
4
joint image-text
4
image-text retrieval
4

Similar Publications

In order to solve the limitations of flipped classroom in personalized teaching and interactive effect improvement, this paper designs a new model of flipped classroom in colleges and universities based on Virtual Reality (VR) by combining the algorithm of Contrastive Language-Image Pre-Training (CLIP). Through cross-modal data fusion, the model deeply combines students' operation behavior with teaching content, and improves teaching effect through intelligent feedback mechanism. The test data shows that the similarity between video and image modes reaches 0.

View Article and Find Full Text PDF

ARCH: Large-scale knowledge graph via aggregated narrative codified health records analysis.

J Biomed Inform

January 2025

Harvard T.H. Chan School of Public Health, 677 Huntington Ave, Boston, 02115, MA, USA; VA Boston Healthcare System, 150 S Huntington Ave, Boston, 02130, MA, USA. Electronic address:

Objective: Electronic health record (EHR) systems contain a wealth of clinical data stored as both codified data and free-text narrative notes (NLP). The complexity of EHR presents challenges in feature representation, information extraction, and uncertainty quantification. To address these challenges, we proposed an efficient Aggregated naRrative Codified Health (ARCH) records analysis to generate a large-scale knowledge graph (KG) for a comprehensive set of EHR codified and narrative features.

View Article and Find Full Text PDF

Unlabelled: We investigated the impact of participation in post-secondary university education (PSE) on the academic knowledge of adult students with severe intellectual disability and extensive support needs (SIDESN) vs. a similar group of controls who did not participate in PSE. We also examined whether the PSE would result in a "near transfer" to basic crystallized (facts and information) and fluid (problems involving executive functions and working memory) cognitive abilities, the contribution of background characteristics and crystallized and fluid abilities to their academic knowledge, semantic fluency and temporal relations.

View Article and Find Full Text PDF

People with concealable stigmatized identities may strategically share or hide cues to their identity. They may likewise seek or avoid interpersonal invisibility (i.e.

View Article and Find Full Text PDF

Modern dialogue systems rely on emotion recognition in conversation (ERC) as a core element enabling empathetic and human-like interactions. However, the weak correlation between emotions and semantics poses significant challenges to emotion recognition in dialogue. Semantically similar utterances can express different types of emotions, depending on the context or speaker.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!