Volltext-Downloads (blau) und Frontdoor-Views (grau)
  • search hit 9 of 566
Back to Result List

Rethinking 1x1 Convolutions: Can we train CNNs with Frozen Random Filters?

  • Modern CNNs are learning the weights of vast numbers of convolutional operators. In this paper, we raise the fundamental question if this is actually necessary. We show that even in the extreme case of only randomly initializing and never updating spatial filters, certain CNN architectures can be trained to surpass the accuracy of standard training. By reinterpreting the notion of pointwiseModern CNNs are learning the weights of vast numbers of convolutional operators. In this paper, we raise the fundamental question if this is actually necessary. We show that even in the extreme case of only randomly initializing and never updating spatial filters, certain CNN architectures can be trained to surpass the accuracy of standard training. By reinterpreting the notion of pointwise ($1\times 1$) convolutions as an operator to learn linear combinations (LC) of frozen (random) spatial filters, we are able to analyze these effects and propose a generic LC convolution block that allows tuning of the linear combination rate. Empirically, we show that this approach not only allows us to reach high test accuracies on CIFAR and ImageNet but also has favorable properties regarding model robustness, generalization, sparsity, and the total number of necessary weights. Additionally, we propose a novel weight sharing mechanism, which allows sharing of a single weight tensor between all spatial convolution layers to massively reduce the number of weights.show moreshow less

Export metadata

Additional Services

Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Document Type:Article (unreviewed)
Zitierlink: https://opus.hs-offenburg.de/8433
Bibliografische Angaben
Title (English):Rethinking 1x1 Convolutions: Can we train CNNs with Frozen Random Filters?
Author:Paul GavrikovStaff MemberORCiDGND, Janis KeuperStaff MemberORCiDGND
Year of Publication:2023
First Page:1
Last Page:17
URL:https://www.researchgate.net/publication/367529908
Language:English
Inhaltliche Informationen
Institutes:Fakultät Elektrotechnik, Medizintechnik und Informatik (EMI) (ab 04/2019)
Forschung / IMLA - Institute for Machine Learning and Analytics
Institutes:Bibliografie
Formale Angaben
Relevance:Keine Relevanz
Open Access: Open Access 
 Diamond 
Licence (German):License LogoCreative Commons - CC BY-SA - Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International
Comment:
Preprint
ArXiv Id:http://arxiv.org/abs/2301.11360