The Power of Linear Combinations: Learning with Random Convolutions

Gavrikov, Paul; Keuper, Janis

doi:10.48550/arXiv.2301.11360

search hit 14 of 566

Back to Result List

The Power of Linear Combinations: Learning with Random Convolutions

Paul Gavrikov, Janis Keuper

Following the traditional paradigm of convolutional neural networks (CNNs), modern CNNs manage to keep pace with more recent, for example transformer-based, models by not only increasing model depth and width but also the kernel size. This results in large amounts of learnable model parameters that need to be handled during training. While following the convolutional paradigm with the accordingFollowing the traditional paradigm of convolutional neural networks (CNNs), modern CNNs manage to keep pace with more recent, for example transformer-based, models by not only increasing model depth and width but also the kernel size. This results in large amounts of learnable model parameters that need to be handled during training. While following the convolutional paradigm with the according spatial inductive bias, we question the significance of \emph{learned} convolution filters. In fact, our findings demonstrate that many contemporary CNN architectures can achieve high test accuracies without ever updating randomly initialized (spatial) convolution filters. Instead, simple linear combinations (implemented through efficient 1×1 convolutions) suffice to effectively recombine even random filters into expressive network operators. Furthermore, these combinations of random filters can implicitly regularize the resulting operations, mitigating overfitting and enhancing overall performance and robustness. Conversely, retaining the ability to learn filter updates can impair network performance. Lastly, although we only observe relatively small gains from learning 3×3 convolutions, the learning gains increase proportionally with kernel size, owing to the non-idealities of the independent and identically distributed (\textit{i.i.d.}) nature of default initialization techniques.…

Metadaten
Document Type:	Article (unreviewed)
Zitierlink:	https://opus.hs-offenburg.de/8404
Bibliografische Angaben
Title (English):	The Power of Linear Combinations: Learning with Random Convolutions
Author:	Paul Gavrikov Staff Member ORCiD GND, Janis Keuper Staff Member ORCiD GND
Year of Publication:	2023
Page Number:	14, 6
DOI:	https://doi.org/10.48550/arXiv.2301.11360
Language:	English
Inhaltliche Informationen
Institutes:	Fakultät Elektrotechnik, Medizintechnik und Informatik (EMI) (ab 04/2019)
	Forschung / IMLA - Institute for Machine Learning and Analytics
Institutes:	Bibliografie
Formale Angaben
Relevance:	Keine Relevanz
Open Access:	Open Access
	Diamond
Licence (German):	Creative Commons - CC BY-SA - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International
Comment:	Preprint
ArXiv Id:	http://arxiv.org/abs/2301.11360

Open Access

The Power of Linear Combinations: Learning with Random Convolutions

Export metadata

Additional Services

Statistics