kth.sePublications
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
ADCNN: Towards learning adaptive dilation for convolutional neural networks
Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China..
Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA..
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.
Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China..
Show others and affiliations
2022 (English)In: Pattern Recognition, ISSN 0031-3203, E-ISSN 1873-5142, Vol. 123, article id 108369Article in journal (Refereed) Published
Abstract [en]

Dilated convolution kernels are constrained by their shared dilation, keeping them from being aware of diverse spatial contents at different locations. We address such limitations by formulating the dilation as trainable weights with respect to individual positions. We propose Adaptive Dilation Convolutional Neural Networks (ADCNN), a light-weighted extension that allows convolutional kernels to adjust their dilation value based on different contents at the pixel level. Unlike previous content-adaptive models, ADCNN dynamically infers pixel-wise dilation via modeling feed-forward inter-patterns, which provides a new perspective for developing adaptive network structures other than sampling kernel spaces. Our evaluation results indicate ADCNNs can be easily integrated into various backbone networks and consistently outperform their regular counterparts on various visual tasks.

Place, publisher, year, edition, pages
Elsevier BV , 2022. Vol. 123, article id 108369
Keywords [en]
Adaptive dilated convolution, Representation learning, Image classification
National Category
Computer Sciences Computer graphics and computer vision Communication Systems
Identifiers
URN: urn:nbn:se:kth:diva-305118DOI: 10.1016/j.patcog.2021.108369ISI: 000711834400003Scopus ID: 2-s2.0-85117736740OAI: oai:DiVA.org:kth-305118DiVA, id: diva2:1613311
Note

QC 20211122

Available from: 2021-11-22 Created: 2021-11-22 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Hu, Hao

Search in DiVA

By author/editor
Hu, Hao
By organisation
Robotics, Perception and Learning, RPL
In the same journal
Pattern Recognition
Computer SciencesComputer graphics and computer visionCommunication Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 173 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf