Rethinking 1x1 Convolutions: Can we train CNNs with Frozen Random Filters?

Modern CNNs are learning the weights of vast numbers of convolutional operators. In this paper, we raise the fundamental question if this is actually necessary. We show that even in the extreme case of only randomly initializing and never updating spatial filters, certain CNN architectures can be trained to surpass the accuracy of standard training. By reinterpreting the notion of pointwiseModern CNNs are learning the weights of vast numbers of convolutional operators. In this paper, we raise the fundamental question if this is actually necessary. We show that even in the extreme case of only randomly initializing and never updating spatial filters, certain CNN architectures can be trained to surpass the accuracy of standard training. By reinterpreting the notion of pointwise ($1\times 1$) convolutions as an operator to learn linear combinations (LC) of frozen (random) spatial filters, we are able to analyze these effects and propose a generic LC convolution block that allows tuning of the linear combination rate. Empirically, we show that this approach not only allows us to reach high test accuracies on CIFAR and ImageNet but also has favorable properties regarding model robustness, generalization, sparsity, and the total number of necessary weights. Additionally, we propose a novel weight sharing mechanism, which allows sharing of a single weight tensor between all spatial convolution layers to massively reduce the number of weights.…

Metadaten
Document Type:	Article (unreviewed)
Zitierlink:	https://opus.hs-offenburg.de/8433
Bibliografische Angaben
Title (English):	Rethinking 1x1 Convolutions: Can we train CNNs with Frozen Random Filters?
Author:	Paul Gavrikov Staff Member ORCiD GND, Janis Keuper Staff Member ORCiD GND
Year of Publication:	2023
First Page:	1
Last Page:	17
URL:	https://www.researchgate.net/publication/367529908
Language:	English
Inhaltliche Informationen
Institutes:	Fakultät Elektrotechnik, Medizintechnik und Informatik (EMI) (ab 04/2019)
	Forschung / IMLA - Institute for Machine Learning and Analytics
Institutes:	Bibliografie
Formale Angaben
Relevance:	Keine Relevanz
Open Access:	Open Access
	Diamond
Licence (German):	Creative Commons - CC BY-SA - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International
Comment:	Preprint
ArXiv Id:	http://arxiv.org/abs/2301.11360

Open Access