We present DANTE, a novel method for training neural networks using the alternating minimization principle. DANTE provides an alternate perspective to traditional gradient-based backpropagation techniques commonly used to train deep networks. It utilizes an adaptation of quasi-convexity to cast training a neural network as a bi-quasi-convex optimization problem. We show that for neural network configurations with both differentiable (e.g. sigmoid) and non-differentiable (e.g. ReLU) activation functions, we can perform the alternations effectively in this formulation. DANTE can also be extended to networks with multiple hidden layers. In experiments on standard datasets, neural networks trained using the proposed method were found to be promising and competitive to traditional backpropagation techniques, both in terms of quality of the solution, as well as training speed.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2020.07.026DOI Listing

Publication Analysis

Top Keywords

training neural
12
neural networks
12
backpropagation techniques
8
neural network
8
neural
5
networks
5
dante
4
dante deep
4
deep alternations
4
training
4

Similar Publications

Enhancing beer authentication, quality, and control assessment using non-invasive spectroscopy through bottle and machine learning modeling.

J Food Sci

January 2025

Digital Agriculture, Food and Wine Research Group, School of Agriculture, Food and Ecosystem Science, Faculty of Science, The University of Melbourne, Melbourne, Victoria, Australia.

Fraud in alcoholic beverages through counterfeiting and adulteration is rising, significantly impacting companies economically. This study aimed to develop a method using near-infrared (NIR) spectroscopy (1596-2396 nm) through the bottle, along with machine learning (ML) modeling for beer authentication, quality traits, and control assessment. For this study, 25 commercial beers from different brands, styles, and three types of fermentation were used.

View Article and Find Full Text PDF

Predicting few disinfection byproducts in the water distribution systems using machine learning models.

Environ Sci Pollut Res Int

January 2025

Research Engineer I, Applied Research Center for Environment & Marine Studies, Research Institute, King Fahd University of Petroleum & Minerals, 31261, Dhahran, Saudi Arabia.

Concerns regarding disinfection byproducts (DBPs) in drinking water persist, with measurements in water treatment plants (WTPs) being relatively easier than those in water distribution systems (WDSs) due to accessibility challenges, especially during adverse weather conditions. Machine learning (ML) models offer improved predictions of DBPs in WDSs. This study developed multiple ML models to predict Trihalomethanes (THMs), Haloacetic Acids (HAAs), Dichloroacetonitrile (DCAN), and N-nitrosodimethylamine (NDMA) in WDSs using data collected over 13 years (2008-2020) from 113 water supply systems (WSS) in Ontario.

View Article and Find Full Text PDF

Background: Accurate differentiation between benign and malignant pancreatic lesions is critical for effective patient management. This study aimed to develop and validate a novel deep learning network using baseline computed tomography (CT) images to predict the classification of pancreatic lesions.

Methods: This retrospective study included 864 patients (422 men, 442 women) with confirmed histopathological results across three medical centers, forming a training cohort, internal testing cohort, and external validation cohort.

View Article and Find Full Text PDF

Semiconductor metal oxide (SMO) gas sensors are gaining prominence owing to their high sensitivity, rapid response, and cost-effectiveness. These sensors detect changes in resistance resulting from oxidation-reduction reactions with target gases, responding to a variety of gases simultaneously. However, their inherent limitations lie in selectivity.

View Article and Find Full Text PDF

This study aimed to develop and evaluate an efficient method to automatically segment T1- and T2-weighted brain magnetic resonance imaging (MRI) images. We specifically compared the segmentation performance of individual convolutional neural network (CNN) models against an ensemble approach to advance the accuracy of MRI-guided radiotherapy (RT) planning..

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!