kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
RWKVSR: Receptance Weighted Key-Value Network for Hyperspectral Image Super-Resolution
Guangzhou University, School of Electronic and Communication Engineering, Guangzhou, China.
Guangzhou University, School of Electronic and Communication Engineering, Guangzhou, China.
Chinese Academy of Sciences, Aerospace Information Research Institute, Beijing, China, 100094; University of Macau, Department of Computer and Information Science, Macau, China.
Guangzhou University, School of Electronic and Communication Engineering, Guangzhou, China.
Show others and affiliations
2025 (English)In: IEEE transactions on circuits and systems for video technology (Print), ISSN 1051-8215, E-ISSN 1558-2205Article in journal (Refereed) Epub ahead of print
Abstract [en]

Deep learning has achieved significant success in hyperspectral image super-resolution (HSISR) by leveraging advanced feature extraction techniques to reconstruct high-resolution images from low-resolution counterparts. However, existing methods predominantly utilize 2D/3D convolutions or Transformer architectures, which are often hindered by limited receptive fields, quadratic computational complexity, and inadequate fusion of spatial-spectral dependencies. To address these challenges, this paper proposes RWKVSR, a novel lightweight network that integrates a Receptance Weighted Key-Value (RWKV) architecture for efficient HSISR. The proposed RWKVSR comprises of three key components: (1) A linear-complexity RWKV module replacing quadratic self-attention, enabling efficient global spectral-spatial modeling; (2) A Spectral-Spatial Residual Module (SSRM) employing anisotropic, direction-separable 3D convolutions to hierarchically extract multi-scale features while enhancing local-global interactions; and (3) A Hyperspectral Frequency Loss (HFL) optimizing spectral consistency by prioritizing high-frequency structural alignment between reconstructed and ground-truth images in the frequency domain. Extensive experiments conducted on the CAVE and Harvard datasets demonstrate that RWKVSR outperforms the existing state-of-the-art methods, effectively balancing accuracy and efficiency, and providing a practical solution for high-quality HSI reconstruction.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2025.
Keywords [en]
Deep Learning, Hyperspectral image Super-resolution, Receptance Weighted Key-Value Network, Transformer
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-373151DOI: 10.1109/TCSVT.2025.3626779Scopus ID: 2-s2.0-105020750828OAI: oai:DiVA.org:kth-373151DiVA, id: diva2:2015425
Note

QC 20251121

Available from: 2025-11-21 Created: 2025-11-21 Last updated: 2025-11-21Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Ban, Yifang

Search in DiVA

By author/editor
Ban, Yifang
By organisation
Geoinformatics
In the same journal
IEEE transactions on circuits and systems for video technology (Print)
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 55 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf