Öppna denna publikation i ny flik eller fönster >>Visa övriga...
2022 (Engelska)Ingår i: Pattern Recognition, ISSN 0031-3203, E-ISSN 1873-5142, Vol. 123, artikel-id 108369Artikel i tidskrift (Refereegranskat) Published
Abstract [en]
Dilated convolution kernels are constrained by their shared dilation, keeping them from being aware of diverse spatial contents at different locations. We address such limitations by formulating the dilation as trainable weights with respect to individual positions. We propose Adaptive Dilation Convolutional Neural Networks (ADCNN), a light-weighted extension that allows convolutional kernels to adjust their dilation value based on different contents at the pixel level. Unlike previous content-adaptive models, ADCNN dynamically infers pixel-wise dilation via modeling feed-forward inter-patterns, which provides a new perspective for developing adaptive network structures other than sampling kernel spaces. Our evaluation results indicate ADCNNs can be easily integrated into various backbone networks and consistently outperform their regular counterparts on various visual tasks.
Ort, förlag, år, upplaga, sidor
Elsevier BV, 2022
Nyckelord
Adaptive dilated convolution, Representation learning, Image classification
Nationell ämneskategori
Datavetenskap (datalogi) Datorgrafik och datorseende Kommunikationssystem
Identifikatorer
urn:nbn:se:kth:diva-305118 (URN)10.1016/j.patcog.2021.108369 (DOI)000711834400003 ()2-s2.0-85117736740 (Scopus ID)
Anmärkning
QC 20211122
2021-11-222021-11-222025-02-01Bibliografiskt granskad