ADCNN: Towards learning adaptive dilation for convolutional neural networks Show others and affiliations
2022 (English) In: Pattern Recognition, ISSN 0031-3203, E-ISSN 1873-5142, Vol. 123, article id 108369Article in journal (Refereed) Published
Abstract [en]
Dilated convolution kernels are constrained by their shared dilation, keeping them from being aware of diverse spatial contents at different locations. We address such limitations by formulating the dilation as trainable weights with respect to individual positions. We propose Adaptive Dilation Convolutional Neural Networks (ADCNN), a light-weighted extension that allows convolutional kernels to adjust their dilation value based on different contents at the pixel level. Unlike previous content-adaptive models, ADCNN dynamically infers pixel-wise dilation via modeling feed-forward inter-patterns, which provides a new perspective for developing adaptive network structures other than sampling kernel spaces. Our evaluation results indicate ADCNNs can be easily integrated into various backbone networks and consistently outperform their regular counterparts on various visual tasks.
Place, publisher, year, edition, pages Elsevier BV , 2022. Vol. 123, article id 108369
Keywords [en]
Adaptive dilated convolution, Representation learning, Image classification
National Category
Computer Sciences Computer graphics and computer vision Communication Systems
Identifiers URN: urn:nbn:se:kth:diva-305118 DOI: 10.1016/j.patcog.2021.108369 ISI: 000711834400003 Scopus ID: 2-s2.0-85117736740 OAI: oai:DiVA.org:kth-305118 DiVA, id: diva2:1613311
Note QC 20211122
2021-11-222021-11-222025-02-01 Bibliographically approved