kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Function Space and Critical Points of Linear Convolutional Networks
KTH, School of Engineering Sciences (SCI), Mathematics (Dept.).ORCID iD: 0000-0002-4627-8812
Departments of Mathematics and Statistics, UCLA, Los Angeles, CA 90095 USA; Max Planck Institute for Mathematics in the Sciences, Leipzig, 04103, Germany.
Amazon AWS AI Labs, New York, NY, USA.
2024 (English)In: SIAM Journal on Applied Algebra and Geometry, E-ISSN 2470-6566, Vol. 8, no 2, p. 333-362Article in journal (Refereed) Published
Abstract [en]

We study the geometry of linear networks with one-dimensional convolutional layers. The function spaces of these networks can be identified with semialgebraic families of polynomials admitting sparse factorizations. We analyze the impact of the network's architecture on the function space's dimension, boundary, and singular points. We also describe the critical points of the network's parameterization map. Furthermore, we study the optimization problem of training a network with the squared error loss. We prove that for architectures where all strides are larger than one and generic data, the nonzero critical points of that optimization problem are smooth interior points of the function space. This property is known to be false for dense linear networks and linear convolutional networks with stride one.

Place, publisher, year, edition, pages
Society for Industrial & Applied Mathematics (SIAM) , 2024. Vol. 8, no 2, p. 333-362
Keywords [en]
critical points, neural network, semialgebraic set
National Category
Mathematics
Identifiers
URN: urn:nbn:se:kth:diva-347621DOI: 10.1137/23M1565504ISI: 001231979000001Scopus ID: 2-s2.0-85195041549OAI: oai:DiVA.org:kth-347621DiVA, id: diva2:1869216
Note

QC 20240613

Available from: 2024-06-12 Created: 2024-06-12 Last updated: 2024-06-13Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Kohn, Kathlén

Search in DiVA

By author/editor
Kohn, Kathlén
By organisation
Mathematics (Dept.)
Mathematics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 85 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf