kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Visual Instance Retrieval with Deep Convolutional Networks
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0003-2784-7300
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0002-4266-6746
2019 (English)In: Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, ISSN 1342-6907, Vol. 73, no 5, p. 956-964Article in journal (Refereed) Published
Abstract [en]

This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval.Besides the choice of convolutional layers, we present an efficient pipeline exploiting multi-scale schemes to extract local features, in particular, by taking geometric invariancc into explicit account, i.e.positions, scales and spatial consistency.In our experiments using five standard image retrieval datasets, we demonstrate that generic ConvNet image representations can outperform other state-of-the-art methods if they are extracted appropriately. 

Place, publisher, year, edition, pages
Institute of Image Information and Television Engineers , 2019. Vol. 73, no 5, p. 956-964
Keywords [en]
Convolutional network, Learning representation, Multi-resolution search, Visual instance retrieval, Convolution, Image retrieval, Convolutional networks, Image representations, Instance retrieval, Local feature, Multi-scales, Spatial consistency, Standard images, Image representation
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-328992DOI: 10.3169/ITEJ.73.956Scopus ID: 2-s2.0-85142357643OAI: oai:DiVA.org:kth-328992DiVA, id: diva2:1767298
Note

QC 20230614

Available from: 2023-06-14 Created: 2023-06-14 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Razavian, Ali SharifSullivan, JosephineCarlsson, StefanMaki, Atsuto

Search in DiVA

By author/editor
Razavian, Ali SharifSullivan, JosephineCarlsson, StefanMaki, Atsuto
By organisation
Robotics, Perception and Learning, RPL
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 114 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf