A multivariate adaptive gradient algorithm with reduced tuning efforts.

Samer Saab Khaled Saab Shashi Phoha Minghui Zhu Asok Ray

Neural Netw

Department of Mechanical Engineering and Mathematics, The Pennsylvania State University, State College, PA, 16802, USA. Electronic address:

Published: August 2022

Large neural networks usually perform well for executing machine learning tasks. However, models that achieve state-of-the-art performance involve arbitrarily large number of parameters and therefore their training is very expensive. It is thus desired to implement methods with small per-iteration costs, fast convergence rates, and reduced tuning. This paper proposes a multivariate adaptive gradient descent method that meets the above attributes. The proposed method updates every element of the model parameters separately in a computationally efficient manner using an adaptive vector-form learning rate, resulting in low per-iteration cost. The adaptive learning rate computes the absolute difference of current and previous model parameters over the difference in subgradients of current and previous state estimates. In the deterministic setting, we show that the cost function value converges at a linear rate for smooth and strongly convex cost functions. Whereas in both the deterministic and stochastic setting, we show that the gradient converges in expectation at the order of O(1/k) for a non-convex cost function with Lipschitz continuous gradient. In addition, we show that after T iterates, the cost function of the last iterate scales as O(log(T)/T) for non-smooth strongly convex cost functions. Effectiveness of the proposed method is validated on convex functions, smooth non-convex function, non-smooth convex function, and four image classification data sets, whilst showing that its execution requires hardly any tuning unlike existing popular optimizers that entail relatively large tuning efforts. Our empirical results show that our proposed algorithm provides the best overall performance when comparing it to tuned state-of-the-art optimizers.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.neunet.2022.05.016	DOI Listing

Publication Analysis

Top Keywords

cost function

multivariate adaptive

adaptive gradient

reduced tuning

tuning efforts

proposed method

model parameters

learning rate

current previous

convex cost

Similar Publications

Statistical analysis plan for a pragmatic randomised controlled trial comparing enhanced acceptance and commitment therapy plus ( +) added to usual aftercare versus usual aftercare only, in patients living with or beyond cancer: SUrvivors' Rehabilitation Evaluation after CANcer (SURECAN) trial.

Trials

January 2025

London Centre for Primary Care, Wolfson Institute of Population Health, Queen Mary University of London, London, UK.

Clare Robinson Trudie Chalder Paul McCrone Olivier Quintin Evdoxia Gkaintatzi

Background: The aim of the SURECAN trial is to evaluate a person-centred intervention, based on Acceptance and Commitment Therapy (ACT Plus ( +)), for people who have completed treatment for cancer with curative intent, but are experiencing poor quality of life. We present the statistical analysis plan for assessing the effectiveness and cost-effectiveness of the intervention in improving quality of life 1 year post randomisation.

Methods And Design: SURECAN is a multi-centre, pragmatic, two-arm, partially clustered randomised controlled superiority trial comparing the effectiveness of ACT + added to usual care with usual aftercare.

View Article and Find Full Text PDF

Similar Publications

Maintaining operability at a high personal cost - a mixed method study on maternal healthcare workers' experiences during the COVID-19 pandemic.

BMC Health Serv Res

January 2025

Institute of Health and Care Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden.

Magnus Akerstrom Anna Wessberg Emina Hadžibajramović Sofie Graner Ylva Carlsson

Introduction: The COVID-19 pandemic forced leaders and employees in health care services to take difficult decisions to manage risks associated with employee health and the organizations' functioning. This study aims to identify the changes in employee working routines, job demands, and job resources within Swedish maternal healthcare during the COVID-19 pandemic, and how these changes affected workload and health.

Methods: Data were derived from the longitudinal COPE Staff study involving midwives and physicians within maternal healthcare.

View Article and Find Full Text PDF

Similar Publications

Unsupervised detection of novel SARS-CoV-2 mutations and lineages in wastewater samples using long-read sequencing.

BMC Genomics

January 2025

Department of Virology, Norwegian Institute of Public Health, Oslo, 0456, Norway.

Ignacio Garcia Rasmus K Riis Line V Moen Andreas Rohringer Elisabeth H Madslien

The COVID-19 pandemic has underscored the importance of virus surveillance in public health and wastewater-based epidemiology (WBE) has emerged as a non-invasive, cost-effective method for monitoring SARS-CoV-2 and its variants at the community level. Unfortunately, current variant surveillance methods depend heavily on updated genomic databases with data derived from clinical samples, which can become less sensitive and representative as clinical testing and sequencing efforts decline.In this paper, we introduce HERCULES (High-throughput Epidemiological Reconstruction and Clustering for Uncovering Lineages from Environmental SARS-CoV-2), an unsupervised method that uses long-read sequencing of a single 1 Kb fragment of the Spike gene.

View Article and Find Full Text PDF

Similar Publications

Fabrication of heteroatom-doped graphene-porous organic polymer hybrid materials via femtosecond laser writing and their application in VOCs sensing.

Sci Rep

January 2025

Materials Science and Engineering Program, College of Arts and Sciences, American University of Sharjah, POB 26666, Sharjah, United Arab Emirates.

Rasha Diab Ganjaboy Boltaev Mahmoud M Kaid Ahmad Fawad Hani M El-Kaderi

Graphene, a two-dimensional material featuring densely packed sp-hybridized carbon atoms arranged in a honeycomb lattice, has revolutionized material science. Laser-induced graphene (LIG) represents a breakthrough method for producing graphene from both commercial and natural precursors via direct laser writing, offering advantages such as simplicity, efficiency, and cost-effectiveness. This study demonstrates a novel approach to synthesize a composite material exclusively from a porous organic polymer (POP) by direct femtosecond laser writing on a compressed imide-linked porous organic polymer substrate.

View Article and Find Full Text PDF

Similar Publications

Ecofriendly remediation of cadmium, lead, and zinc using dead cells of Microcystis aeruginosa.

Sci Rep

January 2025

Hydrobiology Lab, National Institute of Oceanography and Fisheries (NIOF), Cairo, Egypt.

Mohamad S Abdelkarim Mohamed H H Ali Doaa A Kassem

The utilization of cyanobacteria toxin-producing blooms for metal ions adsorption has garnered significant attention over the last decade. This study investigates the efficacy of dead cells from Microcystis aeruginosa blooms, collected from agricultural drainage water reservoir, in removing of cadmium, lead, and zinc ions from aqueous solutions, and simultaneously addressing the mitigation of toxin-producing M. aeruginosa bloom.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!