Current state-of-the-art visual recognition systems usually rely on the following pipeline: 1) pretraining a neural network on a large-scale data set (e.g., ImageNet) and 2) finetuning the network weights on a smaller, task-specific data set. Such a pipeline assumes that the sole weight adaptation is able to transfer the network capability from one domain to another domain based on a strong assumption that a fixed architecture is appropriate for all domains. However, each domain with a distinct recognition target may need different levels/paths of feature hierarchy, where some neurons may become redundant, and some others are reactivated to form new network structures. In this work, we prove that dynamically adapting network architectures tailored for each domain task along with weight finetuning benefits in both efficiency and effectiveness, compared to the existing image recognition pipeline that only tunes the weights regardless of the architecture. Our method can be easily generalized to an unsupervised paradigm by replacing supernet training with self-supervised learning in the source domain tasks and performing linear evaluation in the downstream tasks. This further improves the search efficiency of our method. Moreover, we also provide principled and empirical analysis to explain why our approach works by investigating the ineffectiveness of existing neural architecture search. We find that preserving the joint distribution of the network architecture and weights is of importance. This analysis not only benefits image recognition but also provides insights for crafting neural networks. Experiments on five representative image recognition tasks, such as person re-identification, age estimation, gender recognition, image classification, and unsupervised domain adaptation, demonstrate the effectiveness of our method.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TNNLS.2021.3070605 | DOI Listing |
Acc Chem Res
January 2025
Molecular Sensing and Imaging Center, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, China.
ConspectusIons are the crucial signaling components for living organisms. In cells, their transportation across pore-forming membrane proteins is vital for regulating physiological functions, such as generating ionic current signals in response to target molecule recognition. This ion transport is affected by confined interactions and local environments within the protein pore.
View Article and Find Full Text PDFAnal Chem
January 2025
State Key Laboratory of Chemical Resource Engineering, College of Chemistry, Beijing University of Chemical Technology, Beijing 100029, China.
The spontaneous aggregation of amyloid-β (Aβ) leads to neuronal cell death in the brain and causes the development of Alzheimer's disease (AD). The efficient detection of the aggregation state of Aβ holds significant promise for the early diagnosis and subsequent treatment of this neurodegenerative disorder. Currently, most of the fluorescent probes used for the detection of Aβ fibrils share similar recognition moieties, such as the ,-dimethylamino group, ,-diethylamino group, and piperidyl group.
View Article and Find Full Text PDFJ Dent Sci
January 2025
Division of Physiology, Department of Health Promotion, Kyushu Dental University, Kitakyushu, Japan.
Background/purpose: OpenAI's GPT-4V and Google's Gemini Pro, being Large Language Models (LLMs) equipped with image recognition capabilities, have the potential to be utilized in future medical diagnosis and treatment, ands serve as valuable educational support tools for students. This study compared and evaluated the image recognition capabilities of GPT-4V and Gemini Pro using questions from the Japanese National Dental Examination (JNDE) to investigate their potential as educational support tools.
Materials And Methods: We analyzed 160 questions from the 116th JNDE, administered in March 2023, using ChatGPT-4V, and Gemini Pro, which have image recognition functions.
Front Plant Sci
January 2025
College of Big Data, Yunnan Agricultural University, Kunming, China.
Introduction: Weeds are a major factor affecting crop yield and quality. Accurate identification and localization of crops and weeds are essential for achieving automated weed management in precision agriculture, especially given the challenges in recognition accuracy and real-time processing in complex field environments. To address this issue, this paper proposes an efficient crop-weed segmentation model based on an improved UNet architecture and attention mechanisms to enhance both recognition accuracy and processing speed.
View Article and Find Full Text PDFLaryngoscope
January 2025
Department of Otolaryngology/Head & Neck Surgery, University of North Carolina School of Medicine, Chapel Hill, North Carolina, U.S.A.
Objectives: Bimodal cochlear implant (CI) users vary in speech recognition outcomes. This variability may be influenced partly by the CI and contralateral hearing aid (HA) programming procedures, which can result in mismatches in latency and frequency. We assessed the performance of bimodal listeners when latency mismatches were corrected and analyzed how frequency mismatches influenced outcomes.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!