kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Cross-Modal Hashing With Feature Semi-Interaction and Semantic Ranking for Remote Sensing Ship Image Retrieval
Harbin Inst Technol, Dept Comp Sci, Shenzhen 518055, Peoples R China..ORCID iD: 0000-0002-3040-5880
Harbin Inst Technol, Dept Comp Sci, Shenzhen 518055, Peoples R China..
Soochow Univ, Sch Elect & Informat Engn, Suzhou 215006, Peoples R China..ORCID iD: 0000-0001-6284-3044
Univ Murcia, Dept Comp Sci & Syst, Murcia 30100, Spain..
Show others and affiliations
2024 (English)In: IEEE Transactions on Geoscience and Remote Sensing, ISSN 0196-2892, E-ISSN 1558-0644, Vol. 62, article id 4701915Article in journal (Refereed) Published
Abstract [en]

Cross-modal hashing plays a pivotal role in large-scale remote sensing (RS) ship image retrieval. RS ship images often exhibit similar overall appearance with subtle differences. Existing hashing methods typically employ feature non-interaction strategies to generate common hash codes, which may not effectively capture the correlations between cross-modal ship images to reduce intermodality discrepancies. To address this issue, we propose a novel cross-modal hashing approach based on feature semi-interaction and semantic ranking (FSISR) for RS ship image retrieval. Our FSISR approach not only captures intricate correlations between different ship image modalities, but also enables the construction of hash tables for large-scale retrieval. FSISR comprises a feature semi-interaction module and a semantic ranking objective function. The semi-interaction module utilizes clustering centers from one modality to learn the correlations between two modalities and generate robust shared representations. The objective function optimizes these representations in a common Hamming space, consisting of a shared semantic alignment loss and a margin-free ranking loss. The alignment loss employs a shared semantic layer to preserve label-level similarity, while the ranking loss incorporates hard examples to establish a margin-free loss that captures similarity ranking relationships. We evaluate the performance of our method on benchmark datasets and demonstrate its effectiveness for cross-modal RS ship image retrieval. https://github.com/sunyuxi/FSISR.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024. Vol. 62, article id 4701915
Keywords [en]
Marine vehicles, Semantics, Codes, Image retrieval, Correlation, Linear programming, Visualization, Cross-modal remote sensing (RS) ship images, deep supervised hashing, learning to hash, multisource RS images, RS ship image retrieval
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-345157DOI: 10.1109/TGRS.2024.3368194ISI: 001173985500014Scopus ID: 2-s2.0-85186075542OAI: oai:DiVA.org:kth-345157DiVA, id: diva2:1849610
Note

QC 20240408

Available from: 2024-04-08 Created: 2024-04-08 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Ban, YifangHafner, Sebastian

Search in DiVA

By author/editor
Sun, YuxiKang, JianBan, YifangHafner, SebastianPlaza, Antonio
By organisation
Geoinformatics
In the same journal
IEEE Transactions on Geoscience and Remote Sensing
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 36 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf