FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval.

IEEE Trans Neural Netw Learn Syst

Published: November 2022

Cross-modal hashing, favored for its effectiveness and efficiency, has received wide attention to facilitating efficient retrieval across different modalities. Nevertheless, most existing methods do not sufficiently exploit the discriminative power of semantic information when learning the hash codes while often involving time-consuming training procedure for handling the large-scale dataset. To tackle these issues, we formulate the learning of similarity-preserving hash codes in terms of orthogonally rotating the semantic data, so as to minimize the quantization loss of mapping such data to hamming space and propose an efficient fast discriminative discrete hashing (FDDH) approach for large-scale cross-modal retrieval. More specifically, FDDH introduces an orthogonal basis to regress the targeted hash codes of training examples to their corresponding semantic labels and utilizes the ε -dragging technique to provide provable large semantic margins. Accordingly, the discriminative power of semantic information can be explicitly captured and maximized. Moreover, an orthogonal transformation scheme is further proposed to map the nonlinear embedding data into the semantic subspace, which can well guarantee the semantic consistency between the data feature and its semantic representation. Consequently, an efficient closed-form solution is derived for discriminative hash code learning, which is very computationally efficient. In addition, an effective and stable online learning strategy is presented for optimizing modality-specific projection functions, featuring adaptivity to different training sizes and streaming data. The proposed FDDH approach theoretically approximates the bi-Lipschitz continuity, runs sufficiently fast, and also significantly improves the retrieval performance over the state-of-the-art methods. The source code is released at https://github.com/starxliu/FDDH.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TNNLS.2021.3076684	DOI Listing

Publication Analysis

Top Keywords

hash codes

fast discriminative

discriminative discrete

discrete hashing

large-scale cross-modal

cross-modal retrieval

discriminative power

semantic

power semantic

fddh approach

Similar Publications

Enhanced Image Retrieval Using Multiscale Deep Feature Fusion in Supervised Hashing.

J Imaging

January 2025

RCAM Laboratory, Telecommunications Department, Sidi Bel Abbes University, Sidi Bel Abbes 22000, Algeria.

Amina Belalia Kamel Belloulata Adil Redaoui

In recent years, deep-network-based hashing has gained prominence in image retrieval for its ability to generate compact and efficient binary representations. However, most existing methods predominantly focus on high-level semantic features extracted from the final layers of networks, often neglecting structural details that are crucial for capturing spatial relationships within images. Achieving a balance between preserving structural information and maximizing retrieval accuracy is the key to effective image hashing and retrieval.

View Article and Find Full Text PDF

Similar Publications

Deep hashing and attention mechanism-based image retrieval of osteosarcoma scans for diagnosis of bone cancer.

J Bone Oncol

December 2024

College of Engineering, Huaqiao University, Quanzhou 362021, China.

Taisheng Zeng Yuguang Ye Yusi Chen Daxin Zhu Yifeng Huang

Article Synopsis

- The study addresses challenges in retrieving microscopic images of osteosarcoma by using advanced deep hashing techniques and attention mechanisms, which enhance both efficiency and accuracy in image retrieval.
- The algorithm employs various preprocessing methods and a WRN-AM model for feature extraction, achieving a high classification accuracy of 93.2% and a mean Average Precision (mAP) of 97.09% with 64-bit hash codes.
- This innovative method not only improves the retrieval process for healthcare professionals, aiding in faster diagnosis and treatment planning, but also benefits researchers by enhancing the utilization of medical image data for further advancements in the field.

View Article and Find Full Text PDF

Similar Publications

Text-Enhanced Graph Attention Hashing for Cross-Modal Retrieval.

Entropy (Basel)

October 2024

College of Computer Science and Technology, Xinjiang University, Urumqi 830046, China.

Qiang Zou Shuli Cheng Anyu Du Jiayi Chen

Deep hashing technology, known for its low-cost storage and rapid retrieval, has become a focal point in cross-modal retrieval research as multimodal data continue to grow. However, existing supervised methods often overlook noisy labels and multiscale features in different modal datasets, leading to higher information entropy in the generated hash codes and features, which reduces retrieval performance. The variation in text annotation information across datasets further increases the information entropy during text feature extraction, resulting in suboptimal outcomes.

View Article and Find Full Text PDF

Similar Publications

Parameter Adaptive Contrastive Hashing for multimedia retrieval.

Neural Netw

February 2025

Big Data Institute, School of Computer Science and Engineering, Central South University, ChangSha, Hunan, 410000, China. Electronic address:

Yunfei Chen Yitian Long Zhan Yang Jun Long

With the emergence of massive amounts of multi-source heterogeneous data on the Internet, quickly retrieving effective information from this extensive data has become a hot research topic. Due to the efficiency and speed of hash learning methods in multimedia retrieval, they have become a mainstream method for multimedia retrieval. However, unsupervised multimedia hash learning methods still face challenges with the difficulties of tuning due to the excessive number of hyperparameters and the lack of precise guidance on semantic similarity.

View Article and Find Full Text PDF

Similar Publications

A novel binary hashing for agricultural scenery classification.

Sci Rep

November 2024

College of Computer Sciences, Beijing Technology and Business University, Beijing, 102488, China.

Han Wang Jun-He Liu Yi Yang

In this research, we present PerceptHashing, a technique designed to categorize million-scale agricultural scenic images by incorporating human gaze shifting paths (GSPs) into a hashing framework. For each agricultural image, we identify visually and semantically significant object patches, such as fields, crops, and water bodies. These patches are linked to form a graphlet, establishing a network of spatially adjacent patches, and a GSP is then extracted using an active learning algorithm.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!