Towards a Deeper Understanding of Global Covariance Pooling in Deep Learning: An Optimization Perspective.

Qilong Wang Zhaolin Zhang Mingze Gao Jiangtao Xie Pengfei Zhu Peihua Li Wangmeng Zuo Qinghua Hu

IEEE Trans Pattern Anal Mach Intell

Published: December 2023

Global covariance pooling (GCP) as an effective alternative to global average pooling has shown good capacity to improve deep convolutional neural networks (CNNs) in a variety of vision tasks. Although promising performance, it is still an open problem on how GCP (especially its post-normalization) works in deep learning. In this paper, we make the effort towards understanding the effect of GCP on deep learning from an optimization perspective. Specifically, we first analyze behavior of GCP with matrix power normalization on optimization loss and gradient computation of deep architectures. Our findings show that GCP can improve Lipschitzness of optimization loss and achieve flatter local minima, while improving gradient predictiveness and functioning as a special pre-conditioner on gradients. Then, we explore the effect of post-normalization on GCP from the model optimization perspective, which encourages us to propose a simple yet effective normalization, namely DropCov. Based on above findings, we point out several merits of deep GCP that have not been recognized previously or fully explored, including faster convergence, stronger model robustness and better generalization across tasks. Extensive experimental results using both CNNs and vision transformers on diversified vision tasks provide strong support to our findings while verifying the effectiveness of our method.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TPAMI.2023.3321392	DOI Listing

Publication Analysis

Top Keywords

deep learning

optimization perspective

global covariance

covariance pooling

learning optimization

vision tasks

optimization loss

gcp

deep

optimization

Similar Publications

Analysis of stomatal characteristics of maize hybrids and their parental inbred lines during critical reproductive periods.

Front Plant Sci

January 2025

Beijing Key Lab of Digital Plant, Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China.

Changyu Zhang Yu Jin Jinglu Wang Ying Zhang Yanxin Zhao

The stomatal phenotype is a crucial microscopic characteristic of the leaf surface, and modulating the stomata of maize leaves can enhance photosynthetic carbon assimilation and water use efficiency, thereby playing a vital role in maize yield formation. The evolving imaging and image processing technologies offer effective tools for precise analysis of stomatal phenotypes. This study employed Jingnongke 728 and its parental inbred to capture stomatal images from various leaf positions and abaxial surfaces during key reproductive stages using rapid scanning electron microscopy.

View Article and Find Full Text PDF

Similar Publications

DFMA: an improved DeepLabv3+ based on FasterNet, multi-receptive field, and attention mechanism for high-throughput phenotyping of seedlings.

Front Plant Sci

January 2025

Institute of Crop Science, Huzhou Academy of Agriculture Sciences, Huzhou, China.

Liangquan Jia Tao Wang Xiangge Li Lu Gao Qiangguo Yu

With the rapid advancement of plant phenotyping research, understanding plant genetic information and growth trends has become crucial. Measuring seedling length is a key criterion for assessing seed viability, but traditional ruler-based methods are time-consuming and labor-intensive. To address these limitations, we propose an efficient deep learning approach to enhance plant seedling phenotyping analysis.

View Article and Find Full Text PDF

Similar Publications

SSATNet: Spectral-spatial attention transformer for hyperspectral corn image classification.

Front Plant Sci

January 2025

School of Information Engineering, Henan Institute of Science and Technology, Xinxiang, China.

Bin Wang Gongchao Chen Juan Wen Linfang Li Songlin Jin

Hyperspectral images are rich in spectral and spatial information, providing a detailed and comprehensive description of objects, which makes hyperspectral image analysis technology essential in intelligent agriculture. With various corn seed varieties exhibiting significant internal structural differences, accurate classification is crucial for planting, monitoring, and consumption. However, due to the large volume and complex features of hyperspectral corn image data, existing methods often fall short in feature extraction and utilization, leading to low classification accuracy.

View Article and Find Full Text PDF

Similar Publications

A CONVEX COMPRESSIBILITY-INSPIRED UNSUPERVISED LOSS FUNCTION FOR PHYSICS-DRIVEN DEEP LEARNING RECONSTRUCTION.

Proc IEEE Int Symp Biomed Imaging

May 2024

Electrical and Computer Engineering, University of Minnesota, Minneapolis, Minnesota, USA.

Yaşar Utku Alçalar Merve Gülle Mehmet Akçakaya

Physics-driven deep learning (PD-DL) methods have gained popularity for improved reconstruction of fast MRI scans. Though supervised learning has been used in early works, there has been a recent interest in unsupervised learning methods for training PD-DL. In this work, we take inspiration from statistical image processing and compressed sensing (CS), and propose a novel convex loss function as an alternative learning strategy.

View Article and Find Full Text PDF

Similar Publications

AI-driven multi-omics integration for multi-scale predictive modeling of genotype-environment-phenotype relationships.

Comput Struct Biotechnol J

January 2025

Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, NY, USA.

You Wu Lei Xie

Despite the wealth of single-cell multi-omics data, it remains challenging to predict the consequences of novel genetic and chemical perturbations in the human body. It requires knowledge of molecular interactions at all biological levels, encompassing disease models and humans. Current machine learning methods primarily establish statistical correlations between genotypes and phenotypes but struggle to identify physiologically significant causal factors, limiting their predictive power.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!