Hyper-convolutions via implicit kernels for medical image analysis.

Med Image Anal

School of Electrical and Computer Engineering, Cornell University, Ithaca, NY, USA; Cornell Tech, NYC, NY, USA; Department of Radiology, Weill Cornell Medical School, NYC, NY, USA. Electronic address:

Published: May 2023

The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares weights across all pixels. A standard CNN's capacity, and thus its performance, is directly related to the number of learnable kernel weights, which is determined by the number of channels and the kernel size (support). In this paper, we present the hyper-convolution, a novel building block that implicitly encodes the convolutional kernel using spatial coordinates. Unlike a regular convolutional kernel, whose weights are independently learned, hyper-convolution kernel weights are correlated through an encoder that maps spatial coordinates to their corresponding values. Hyper-convolutions decouple kernel size from the total number of learnable parameters, enabling a more flexible architecture design. We demonstrate in our experiments that replacing regular convolutions with hyper-convolutions can improve performance with less parameters, and increase robustness against noise. We provide our code here: https://github.com/tym002/Hyper-Convolution.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.media.2023.102796DOI Listing

Publication Analysis

Top Keywords

convolutional kernel
12
kernel weights
12
building block
8
number learnable
8
kernel size
8
spatial coordinates
8
kernel
7
hyper-convolutions implicit
4
implicit kernels
4
kernels medical
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!