Boosting-GNN: Boosting Algorithm for Graph Networks on Imbalanced Node Classification.

Shuhao Shi Kai Qiao Shuai Yang Linyuan Wang Jian Chen Bin Yan

Front Neurorobot

Henan Key Laboratory of Imaging and Intelligence Processing, PLA Strategy Support Force Information Engineering University, Zhengzhou, China.

Published: November 2021

The graph neural network (GNN) has been widely used for graph data representation. However, the existing researches only consider the ideal balanced dataset, and the imbalanced dataset is rarely considered. Traditional methods such as resampling, reweighting, and synthetic samples that deal with imbalanced datasets are no longer applicable in GNN. This study proposes an ensemble model called Boosting-GNN, which uses GNNs as the base classifiers during boosting. In Boosting-GNN, higher weights are set for the training samples that are not correctly classified by the previous classifiers, thus achieving higher classification accuracy and better reliability. Besides, transfer learning is used to reduce computational cost and increase fitting ability. Experimental results indicate that the proposed Boosting-GNN model achieves better performance than graph convolutional network (GCN), GraphSAGE, graph attention network (GAT), simplifying graph convolutional networks (SGC), multi-scale graph convolution networks (N-GCN), and most advanced reweighting and resampling methods on synthetic imbalanced datasets, with an average performance improvement of 4.5%.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8655128	PMC
http://dx.doi.org/10.3389/fnbot.2021.775688	DOI Listing

Publication Analysis

Top Keywords

imbalanced datasets

graph convolutional

graph

boosting-gnn

boosting-gnn boosting

boosting algorithm

algorithm graph

graph networks

imbalanced

networks imbalanced

Similar Publications

Adaptive Tip Selection for DAG-Shard-Based Federated Learning with High Concurrency and Fairness.

Sensors (Basel)

December 2024

School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China.

Ruiqi Xiao Yun Cao Bin Xia

To cope with the challenges posed by high-concurrency training tasks involving large models and big data, Directed Acyclic Graph (DAG) and shard were proposed as alternatives to blockchain-based federated learning, aiming to enhance training concurrency. However, there is insufficient research on the specific consensus designs and the effects of varying shard sizes on federated learning. In this paper, we combine DAG and shard by designing three tip selection consensus algorithms and propose an adaptive algorithm to improve training performance.

View Article and Find Full Text PDF

Similar Publications

A hybrid unsupervised machine learning model with spectral clustering and semi-supervised support vector machine for credit risk assessment.

PLoS One

January 2025

College of Business, Southern University of Science and Technology, Shenzhen, China.

Tao Yu Wei Huang Xin Tang Duosi Zheng

In credit risk assessment, unsupervised classification techniques can be introduced to reduce human resource expenses and expedite decision-making. Despite the efficacy of unsupervised learning methods in handling unlabeled datasets, their performance remains limited owing to challenges such as imbalanced data, local optima, and parameter adjustment complexities. Thus, this paper introduces a novel hybrid unsupervised classification method, named the two-stage hybrid system with spectral clustering and semi-supervised support vector machine (TSC-SVM), which effectively addresses the unsupervised imbalance problem in credit risk assessment by targeting global optimal solutions.

View Article and Find Full Text PDF

Similar Publications

SProtFP: a machine learning-based method for functional classification of small ORFs in prokaryotes.

NAR Genom Bioinform

March 2025

National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110067, India.

Akshay Khanduja Debasisa Mohanty

Small proteins (≤100 amino acids) play important roles across all life forms, ranging from unicellular bacteria to higher organisms. In this study, we have developed SProtFP which is a machine learning-based method for functional annotation of prokaryotic small proteins into selected functional categories. SProtFP uses independent artificial neural networks (ANNs) trained using a combination of physicochemical descriptors for classifying small proteins into antitoxin type 2, bacteriocin, DNA-binding, metal-binding, ribosomal protein, RNA-binding, type 1 toxin and type 2 toxin proteins.

View Article and Find Full Text PDF

Similar Publications

GQEO: Nearest neighbor graph-based generalized quadrilateral element oversampling for class-imbalance problem.

Neural Netw

December 2024

College of Science, North China University of Science and Technology, Tangshan, 063210, China. Electronic address:

Qi Dai Longhui Wang Jing Zhang Weiping Ding Lifang Chen

The class imbalance problem is one of the difficult factors affecting the performance of traditional classifiers. The oversampling technique is the most common way to solve the class imbalance problem. They alleviate the performance impact of the class imbalance problem on traditional machine learning by augmenting minority instance feature representation.

View Article and Find Full Text PDF

Similar Publications

Unsupervised Learning for Machinery Adaptive Fault Detection Using Wide-Deep Convolutional Autoencoder with Kernelized Attention Mechanism.

Sensors (Basel)

December 2024

State Key Laboratory of Digital Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.

Hao Yan Xiangfeng Si Jianqiang Liang Jian Duan Tielin Shi

Applying deep learning to unsupervised bearing fault diagnosis in complex industrial environments is challenging. Traditional fault detection methods rely on labeled data, which is costly and labor-intensive to obtain. This paper proposes a novel unsupervised approach, WDCAE-LKA, combining a wide kernel convolutional autoencoder (WDCAE) with a large kernel attention (LKA) mechanism to improve fault detection under unlabeled conditions, and the adaptive threshold module based on a multi-layer perceptron (MLP) dynamically adjusts thresholds, boosting model robustness in imbalanced scenarios.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!