Inconsistency among evaluation metrics in link prediction.

PNAS Nexus

CompleX Lab, School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.

Published: November 2024

Link prediction is a paradigmatic and challenging problem in network science, which aims to predict missing links, future links, and temporal links based on known topology. Along with the increasing number of link prediction algorithms, a critical yet previously ignored risk is that the evaluation metrics for algorithm performance are usually chosen at will. This paper implements extensive experiments on hundreds of real networks and 26 well-known algorithms, revealing significant inconsistency among evaluation metrics, namely different metrics probably produce remarkably different rankings of algorithms. Therefore, we conclude that any single metric cannot comprehensively or credibly evaluate algorithm performance. In terms of information content, we suggest the usage of at least two metrics: one is the area under the receiver operating characteristic curve, and the other is one of the following three candidates, say the area under the precision-recall curve, the area under the precision curve, and the normalized discounted cumulative gain. When the data are imbalanced, say the number of negative samples significantly outweighs the number of positive samples, the area under the generalized Receiver Operating Characteristic curve should also be used. In addition, as we have proved the essential equivalence of threshold-dependent metrics, if in a link prediction task, some specific thresholds are meaningful, we can consider any one threshold-dependent metric with those thresholds. This work completes a missing part in the landscape of link prediction, and provides a starting point toward a well-accepted criterion or standard to select proper evaluation metrics for link prediction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11574622PMC
http://dx.doi.org/10.1093/pnasnexus/pgae498DOI Listing

Publication Analysis

Top Keywords

link prediction
24
evaluation metrics
16
metrics link
12
inconsistency evaluation
8
algorithm performance
8
receiver operating
8
operating characteristic
8
characteristic curve
8
metrics
7
link
6

Similar Publications

Hypertension is a critical risk factor and cause of mortality in cardiovascular diseases, and it remains a global public health issue. Therefore, understanding its mechanisms is essential for treating and preventing hypertension. Gene expression data is an important source for obtaining hypertension biomarkers.

View Article and Find Full Text PDF

Background: Sepsis and acute respiratory distress syndrome (ARDS) are common inflammatory conditions in intensive care, with ARDS significantly increasing mortality in septic patients. PANoptosis, a newly discovered form of programmed cell death involving multiple cell death pathways, plays a critical role in inflammatory diseases. This study aims to elucidate the PANoptosis-related genes (PRGs) and their involvement in the progression of sepsis to ARDS.

View Article and Find Full Text PDF

Deep learning analyses of splicing variants identify the link of PCP4 with amyotrophic lateral sclerosis.

Brain

January 2025

State Key Laboratory of Cardiology and Medical Innovation Center, Shanghai East Hospital, Clinical Center for Brain and Spinal Cord Research, School of Medicine, Tongji University, 200331, Shanghai, China.

Amyotrophic lateral sclerosis (ALS) is a severe motor neuron disease, with most sporadic cases lacking clear genetic causes. Abnormal pre-mRNA splicing is a fundamental mechanism in neurodegenerative diseases. For example, TAR DNA-binding protein 43 (TDP-43) loss-of-function (LOF) causes widespread RNA mis-splicing events in ALS.

View Article and Find Full Text PDF

The Role of Artificial Intelligence and Emerging Technologies in Advancing Total Hip Arthroplasty.

J Pers Med

January 2025

Sezione di Chirurgia Protesica ad Indirizzo Robotico-Unità di Traumatologia dello Sport, Ortopedia e Traumatologia, Fondazione Poliambulanza, 25124 Brescia, Italy.

Total hip arthroplasty (THA) is a widely performed surgical procedure that has evolved significantly due to advancements in artificial intelligence (AI) and robotics. As demand for THA grows, reliable tools are essential to enhance diagnosis, preoperative planning, surgical precision, and postoperative rehabilitation. AI applications in orthopedic surgery offer innovative solutions, including automated hip osteoarthritis (OA) diagnosis, precise implant positioning, and personalized risk stratification, thereby improving patient outcomes.

View Article and Find Full Text PDF

Diabetic nephropathy (DN) affects about one-third of patients with diabetes and can lead to end-stage renal disease despite numerous trials aimed at improving diabetic management. Non-coding RNAs (ncRNAs) represent a new frontier in DN research, as increasing evidence suggests their involvement in the occurrence and progression of DN. A growing body of evidence suggests that long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) in DN signaling pathways might serve as novel biomarkers or therapeutic targets, although this remains to be fully explored.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!