Understanding how feature learning affects generalization is among the foremost goals of modern deep learning theory. Here, we study how the ability to learn representations affects the generalization performance of a simple class of models: deep Bayesian linear neural networks trained on unstructured Gaussian data. By comparing deep random feature models to deep networks in which all layers are trained, we provide a detailed characterization of the interplay between width, depth, data density, and prior mismatch. We show that both models display samplewise double-descent behavior in the presence of label noise. Random feature models can also display modelwise double descent if there are narrow bottleneck layers, while deep networks do not show these divergences. Random feature models can have particular widths that are optimal for generalization at a given data density, while making neural networks as wide or as narrow as possible is always optimal. Moreover, we show that the leading-order correction to the kernel-limit learning curve cannot distinguish between random feature models and deep networks in which all layers are trained. Taken together, our findings begin to elucidate how architectural details affect generalization performance in this simple class of deep regression models.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1103/PhysRevE.105.064118 | DOI Listing |
Trials
January 2025
Internal Medicine (Rheumatology), Academic Hospital, Istanbul, Turkey.
Background: It was our impression that safety outcome trials were getting more frequent, raising ethical issues mainly related to patient autonomy. We and others had also proposed this autonomy would be best served if wording of the informed consents would be in the public domain.
Methods: Initially two observers and an arbiter tabulated the main aims of randomized controlled trials (RCTs) published in 1990-1991 vs.
Sci Rep
January 2025
Research Unit of Health Sciences and Technology, University of Oulu, Oulu, Finland.
Optical techniques, such as functional near-infrared spectroscopy (fNIRS), contain high potential for the development of non-invasive wearable systems for evaluating cerebral vascular condition in aging, due to their portability and ability to monitor real-time changes in cerebral hemodynamics. In this study, thirty-six healthy adults were measured by single channel fNIRS to explore differences between two age groups using machine learning (ML). The subjects, measured during functional magnetic resonance imaging (fMRI) at Oulu University Hospital, were divided into young (age ≤ 32) and elderly (age ≥ 57) groups.
View Article and Find Full Text PDFSci Rep
January 2025
Faculty of Engineering, Université de Moncton, Moncton, NB, E1A3E9, Canada.
Diabetes is a growing health concern in developing countries, causing considerable mortality rates. While machine learning (ML) approaches have been widely used to improve early detection and treatment, several studies have shown low classification accuracies due to overfitting, underfitting, and data noise. This research employs parallel and sequential ensemble ML approaches paired with feature selection techniques to boost classification accuracy.
View Article and Find Full Text PDFJ Shoulder Elbow Surg
January 2025
Department of Orthopedics and Trauma, Peking University People's Hospital, Beijing 100044, China; Key Laboratory of Trauma and Neural Regeneration (Peking University), Ministry of Education, Beijing 100044, China; National Center for Trauma Medicine, Peking University People's Hospital, Beijing 100044, China. Electronic address:
Objective: The bare area is defined as a transverse region within the trochlear notch, serving as an optimal entry point for olecranon osteotomy due to the absence of articular cartilage coverage. However, there is limited research on the morphology and location of the bare area, and there is a lack of intuitive visual description. Thus, the purpose of this study is to delineate anatomical features of the bare area and visualize its morphology and refine the olecranon osteotomy approach.
View Article and Find Full Text PDFInt J Med Inform
January 2025
Oulu Advanced Research on Service and Information Systems, University of Oulu, Linnanmaa campus, Pentti Kaiteran katu 1 90570 Oulu, Finland. Electronic address:
Background: Studies have demonstrated that interventions targeting weight loss and body mass index (BMI) reduction can be successful, although the specific factors that influence their effectiveness are still unclear. Behavior change support systems (BCSS) are an approach that aims to help users in their efforts to modify their behavior. A useful tool for assessing BCSS is the Persuasive Systems Design model (PSD), where different features and postulates can be employed.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!