In the past decades, supervised cross-modal hashing methods have attracted considerable attentions due to their high searching efficiency on large-scale multimedia databases. Many of these methods leverage semantic correlations among heterogeneous modalities by constructing a similarity matrix or building a common semantic space with the collective matrix factorization method. However, the similarity matrix may sacrifice the scalability and cannot preserve more semantic information into hash codes in the existing methods. Meanwhile, the matrix factorization methods cannot embed the main modality-specific information into hash codes. To address these issues, we propose a novel supervised cross-modal hashing method called random online hashing (ROH) in this article. ROH proposes a linear bridging strategy to simplify the pair-wise similarities factorization problem into a linear optimization one. Specifically, a bridging matrix is introduced to establish a bidirectional linear relation between hash codes and labels, which preserves more semantic similarities into hash codes and significantly reduces the semantic distances between hash codes of samples with similar labels. Additionally, a novel maximum eigenvalue direction (MED) embedding method is proposed to identify the direction of maximum eigenvalue for the original features and preserve critical information into modality-specific hash codes. Eventually, to handle real-time data dynamically, an online structure is adopted to solve the problem of dealing with new arrival data chunks without considering pairwise constraints. Extensive experimental results on three benchmark datasets demonstrate that the proposed ROH outperforms several state-of-the-art cross-modal hashing methods.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2023.3330975DOI Listing

Publication Analysis

Top Keywords

hash codes
24
cross-modal hashing
12
random online
8
online hashing
8
supervised cross-modal
8
hashing methods
8
similarity matrix
8
matrix factorization
8
modality-specific hash
8
maximum eigenvalue
8

Similar Publications

Article Synopsis
  • - The study addresses challenges in retrieving microscopic images of osteosarcoma by using advanced deep hashing techniques and attention mechanisms, which enhance both efficiency and accuracy in image retrieval.
  • - The algorithm employs various preprocessing methods and a WRN-AM model for feature extraction, achieving a high classification accuracy of 93.2% and a mean Average Precision (mAP) of 97.09% with 64-bit hash codes.
  • - This innovative method not only improves the retrieval process for healthcare professionals, aiding in faster diagnosis and treatment planning, but also benefits researchers by enhancing the utilization of medical image data for further advancements in the field.
View Article and Find Full Text PDF

Text-Enhanced Graph Attention Hashing for Cross-Modal Retrieval.

Entropy (Basel)

October 2024

College of Computer Science and Technology, Xinjiang University, Urumqi 830046, China.

Deep hashing technology, known for its low-cost storage and rapid retrieval, has become a focal point in cross-modal retrieval research as multimodal data continue to grow. However, existing supervised methods often overlook noisy labels and multiscale features in different modal datasets, leading to higher information entropy in the generated hash codes and features, which reduces retrieval performance. The variation in text annotation information across datasets further increases the information entropy during text feature extraction, resulting in suboptimal outcomes.

View Article and Find Full Text PDF

Parameter Adaptive Contrastive Hashing for multimedia retrieval.

Neural Netw

February 2025

Big Data Institute, School of Computer Science and Engineering, Central South University, ChangSha, Hunan, 410000, China. Electronic address:

With the emergence of massive amounts of multi-source heterogeneous data on the Internet, quickly retrieving effective information from this extensive data has become a hot research topic. Due to the efficiency and speed of hash learning methods in multimedia retrieval, they have become a mainstream method for multimedia retrieval. However, unsupervised multimedia hash learning methods still face challenges with the difficulties of tuning due to the excessive number of hyperparameters and the lack of precise guidance on semantic similarity.

View Article and Find Full Text PDF

In this research, we present PerceptHashing, a technique designed to categorize million-scale agricultural scenic images by incorporating human gaze shifting paths (GSPs) into a hashing framework. For each agricultural image, we identify visually and semantically significant object patches, such as fields, crops, and water bodies. These patches are linked to form a graphlet, establishing a network of spatially adjacent patches, and a GSP is then extracted using an active learning algorithm.

View Article and Find Full Text PDF

The post-processing of quantum key distribution mainly includes error correction and privacy amplification. The error correction algorithms and privacy amplification methods used in the existing quantum key distribution are completely unrelated. Based on the principle of correspondence between error-correcting codes and hash function families, we proposed the idea of time-division multiplexing for error correction and privacy amplification for the first time.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!