Cross-Modal Hashing With Feature Semi-Interaction and Semantic Ranking for Remote Sensing Ship Image RetrievalShow others and affiliations
2024 (English)In: IEEE Transactions on Geoscience and Remote Sensing, ISSN 0196-2892, E-ISSN 1558-0644, Vol. 62, article id 4701915Article in journal (Refereed) Published
Abstract [en]
Cross-modal hashing plays a pivotal role in large-scale remote sensing (RS) ship image retrieval. RS ship images often exhibit similar overall appearance with subtle differences. Existing hashing methods typically employ feature non-interaction strategies to generate common hash codes, which may not effectively capture the correlations between cross-modal ship images to reduce intermodality discrepancies. To address this issue, we propose a novel cross-modal hashing approach based on feature semi-interaction and semantic ranking (FSISR) for RS ship image retrieval. Our FSISR approach not only captures intricate correlations between different ship image modalities, but also enables the construction of hash tables for large-scale retrieval. FSISR comprises a feature semi-interaction module and a semantic ranking objective function. The semi-interaction module utilizes clustering centers from one modality to learn the correlations between two modalities and generate robust shared representations. The objective function optimizes these representations in a common Hamming space, consisting of a shared semantic alignment loss and a margin-free ranking loss. The alignment loss employs a shared semantic layer to preserve label-level similarity, while the ranking loss incorporates hard examples to establish a margin-free loss that captures similarity ranking relationships. We evaluate the performance of our method on benchmark datasets and demonstrate its effectiveness for cross-modal RS ship image retrieval. https://github.com/sunyuxi/FSISR.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024. Vol. 62, article id 4701915
Keywords [en]
Marine vehicles, Semantics, Codes, Image retrieval, Correlation, Linear programming, Visualization, Cross-modal remote sensing (RS) ship images, deep supervised hashing, learning to hash, multisource RS images, RS ship image retrieval
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-345157DOI: 10.1109/TGRS.2024.3368194ISI: 001173985500014Scopus ID: 2-s2.0-85186075542OAI: oai:DiVA.org:kth-345157DiVA, id: diva2:1849610
Note
QC 20240408
2024-04-082024-04-082025-02-07Bibliographically approved