Global covariance pooling (GCP) as an effective alternative to global average pooling has shown good capacity to improve deep convolutional neural networks (CNNs) in a variety of vision tasks. Although promising performance, it is still an open problem on how GCP (especially its post-normalization) works in deep learning. In this paper, we make the effort towards understanding the effect of GCP on deep learning from an optimization perspective. Specifically, we first analyze behavior of GCP with matrix power normalization on optimization loss and gradient computation of deep architectures. Our findings show that GCP can improve Lipschitzness of optimization loss and achieve flatter local minima, while improving gradient predictiveness and functioning as a special pre-conditioner on gradients. Then, we explore the effect of post-normalization on GCP from the model optimization perspective, which encourages us to propose a simple yet effective normalization, namely DropCov. Based on above findings, we point out several merits of deep GCP that have not been recognized previously or fully explored, including faster convergence, stronger model robustness and better generalization across tasks. Extensive experimental results using both CNNs and vision transformers on diversified vision tasks provide strong support to our findings while verifying the effectiveness of our method.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2023.3321392 | DOI Listing |
Front Plant Sci
January 2025
Beijing Key Lab of Digital Plant, Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China.
The stomatal phenotype is a crucial microscopic characteristic of the leaf surface, and modulating the stomata of maize leaves can enhance photosynthetic carbon assimilation and water use efficiency, thereby playing a vital role in maize yield formation. The evolving imaging and image processing technologies offer effective tools for precise analysis of stomatal phenotypes. This study employed Jingnongke 728 and its parental inbred to capture stomatal images from various leaf positions and abaxial surfaces during key reproductive stages using rapid scanning electron microscopy.
View Article and Find Full Text PDFFront Plant Sci
January 2025
Institute of Crop Science, Huzhou Academy of Agriculture Sciences, Huzhou, China.
With the rapid advancement of plant phenotyping research, understanding plant genetic information and growth trends has become crucial. Measuring seedling length is a key criterion for assessing seed viability, but traditional ruler-based methods are time-consuming and labor-intensive. To address these limitations, we propose an efficient deep learning approach to enhance plant seedling phenotyping analysis.
View Article and Find Full Text PDFFront Plant Sci
January 2025
School of Information Engineering, Henan Institute of Science and Technology, Xinxiang, China.
Hyperspectral images are rich in spectral and spatial information, providing a detailed and comprehensive description of objects, which makes hyperspectral image analysis technology essential in intelligent agriculture. With various corn seed varieties exhibiting significant internal structural differences, accurate classification is crucial for planting, monitoring, and consumption. However, due to the large volume and complex features of hyperspectral corn image data, existing methods often fall short in feature extraction and utilization, leading to low classification accuracy.
View Article and Find Full Text PDFProc IEEE Int Symp Biomed Imaging
May 2024
Electrical and Computer Engineering, University of Minnesota, Minneapolis, Minnesota, USA.
Physics-driven deep learning (PD-DL) methods have gained popularity for improved reconstruction of fast MRI scans. Though supervised learning has been used in early works, there has been a recent interest in unsupervised learning methods for training PD-DL. In this work, we take inspiration from statistical image processing and compressed sensing (CS), and propose a novel convex loss function as an alternative learning strategy.
View Article and Find Full Text PDFComput Struct Biotechnol J
January 2025
Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, NY, USA.
Despite the wealth of single-cell multi-omics data, it remains challenging to predict the consequences of novel genetic and chemical perturbations in the human body. It requires knowledge of molecular interactions at all biological levels, encompassing disease models and humans. Current machine learning methods primarily establish statistical correlations between genotypes and phenotypes but struggle to identify physiologically significant causal factors, limiting their predictive power.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!