Global covariance pooling (GCP) as an effective alternative to global average pooling has shown good capacity to improve deep convolutional neural networks (CNNs) in a variety of vision tasks. Although promising performance, it is still an open problem on how GCP (especially its post-normalization) works in deep learning. In this paper, we make the effort towards understanding the effect of GCP on deep learning from an optimization perspective. Specifically, we first analyze behavior of GCP with matrix power normalization on optimization loss and gradient computation of deep architectures. Our findings show that GCP can improve Lipschitzness of optimization loss and achieve flatter local minima, while improving gradient predictiveness and functioning as a special pre-conditioner on gradients. Then, we explore the effect of post-normalization on GCP from the model optimization perspective, which encourages us to propose a simple yet effective normalization, namely DropCov. Based on above findings, we point out several merits of deep GCP that have not been recognized previously or fully explored, including faster convergence, stronger model robustness and better generalization across tasks. Extensive experimental results using both CNNs and vision transformers on diversified vision tasks provide strong support to our findings while verifying the effectiveness of our method.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2023.3321392DOI Listing

Publication Analysis

Top Keywords

deep learning
12
optimization perspective
12
global covariance
8
covariance pooling
8
learning optimization
8
vision tasks
8
optimization loss
8
gcp
7
deep
6
optimization
5

Similar Publications

The stomatal phenotype is a crucial microscopic characteristic of the leaf surface, and modulating the stomata of maize leaves can enhance photosynthetic carbon assimilation and water use efficiency, thereby playing a vital role in maize yield formation. The evolving imaging and image processing technologies offer effective tools for precise analysis of stomatal phenotypes. This study employed Jingnongke 728 and its parental inbred to capture stomatal images from various leaf positions and abaxial surfaces during key reproductive stages using rapid scanning electron microscopy.

View Article and Find Full Text PDF

With the rapid advancement of plant phenotyping research, understanding plant genetic information and growth trends has become crucial. Measuring seedling length is a key criterion for assessing seed viability, but traditional ruler-based methods are time-consuming and labor-intensive. To address these limitations, we propose an efficient deep learning approach to enhance plant seedling phenotyping analysis.

View Article and Find Full Text PDF

Hyperspectral images are rich in spectral and spatial information, providing a detailed and comprehensive description of objects, which makes hyperspectral image analysis technology essential in intelligent agriculture. With various corn seed varieties exhibiting significant internal structural differences, accurate classification is crucial for planting, monitoring, and consumption. However, due to the large volume and complex features of hyperspectral corn image data, existing methods often fall short in feature extraction and utilization, leading to low classification accuracy.

View Article and Find Full Text PDF

Physics-driven deep learning (PD-DL) methods have gained popularity for improved reconstruction of fast MRI scans. Though supervised learning has been used in early works, there has been a recent interest in unsupervised learning methods for training PD-DL. In this work, we take inspiration from statistical image processing and compressed sensing (CS), and propose a novel convex loss function as an alternative learning strategy.

View Article and Find Full Text PDF

AI-driven multi-omics integration for multi-scale predictive modeling of genotype-environment-phenotype relationships.

Comput Struct Biotechnol J

January 2025

Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, NY, USA.

Despite the wealth of single-cell multi-omics data, it remains challenging to predict the consequences of novel genetic and chemical perturbations in the human body. It requires knowledge of molecular interactions at all biological levels, encompassing disease models and humans. Current machine learning methods primarily establish statistical correlations between genotypes and phenotypes but struggle to identify physiologically significant causal factors, limiting their predictive power.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!