Background: Strategies to improve the selection of appropriate target journals may reduce delays in disseminating research results. Machine learning is increasingly used in content-based recommender algorithms to guide journal submissions for academic articles.

Objective: We sought to evaluate the performance of open-source artificial intelligence to predict the impact factor or Eigenfactor score tertile using academic article abstracts.

Methods: PubMed-indexed articles published between 2016 and 2021 were identified with the Medical Subject Headings (MeSH) terms "ophthalmology," "radiology," and "neurology." Journals, titles, abstracts, author lists, and MeSH terms were collected. Journal impact factor and Eigenfactor scores were sourced from the 2020 Clarivate Journal Citation Report. The journals included in the study were allocated percentile ranks based on impact factor and Eigenfactor scores, compared with other journals that released publications in the same year. All abstracts were preprocessed, which included the removal of the abstract structure, and combined with titles, authors, and MeSH terms as a single input. The input data underwent preprocessing with the inbuilt ktrain Bidirectional Encoder Representations from Transformers (BERT) preprocessing library before analysis with BERT. Before use for logistic regression and XGBoost models, the input data underwent punctuation removal, negation detection, stemming, and conversion into a term frequency-inverse document frequency array. Following this preprocessing, data were randomly split into training and testing data sets with a 3:1 train:test ratio. Models were developed to predict whether a given article would be published in a first, second, or third tertile journal (0-33rd centile, 34th-66th centile, or 67th-100th centile), as ranked either by impact factor or Eigenfactor score. BERT, XGBoost, and logistic regression models were developed on the training data set before evaluation on the hold-out test data set. The primary outcome was overall classification accuracy for the best-performing model in the prediction of accepting journal impact factor tertile.

Results: There were 10,813 articles from 382 unique journals. The median impact factor and Eigenfactor score were 2.117 (IQR 1.102-2.622) and 0.00247 (IQR 0.00105-0.03), respectively. The BERT model achieved the highest impact factor tertile classification accuracy of 75.0%, followed by an accuracy of 71.6% for XGBoost and 65.4% for logistic regression. Similarly, BERT achieved the highest Eigenfactor score tertile classification accuracy of 73.6%, followed by an accuracy of 71.8% for XGBoost and 65.3% for logistic regression.

Conclusions: Open-source artificial intelligence can predict the impact factor and Eigenfactor score of accepting peer-reviewed journals. Further studies are required to examine the effect on publication success and the time-to-publication of such recommender systems.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031443PMC
http://dx.doi.org/10.2196/42789DOI Listing

Publication Analysis

Top Keywords

impact factor
36
factor eigenfactor
28
eigenfactor score
24
open-source artificial
12
artificial intelligence
12
intelligence predict
12
mesh terms
12
logistic regression
12
classification accuracy
12
impact
9

Similar Publications

Background: Predicting treated language improvement (TLI) and transfer to the untreated language (cross-language generalization, CLG) after speech-language therapy in bilingual individuals with poststroke aphasia is crucial for personalized treatment planning. This study evaluated machine learning models to predict TLI and CLG and identified the key predictive features (eg, patient severity, demographics, and treatment variables) aligning with clinical evidence.

Methods: Forty-eight Spanish-English bilingual individuals with poststroke aphasia received 20 sessions of semantic feature-based naming treatment in either their first or second language.

View Article and Find Full Text PDF

High-Density Lipoprotein Lipid and Protein Cargo and Cholesterol Efflux Capacity Before and After Bariatric Surgery.

Arterioscler Thromb Vasc Biol

January 2025

Department of Medicine, Leon H. Charney Division of Cardiology (S.Z., B.-X.L., A.C., M.F., E.A.F., S.P.H.).

Background: Cholesterol efflux capacity (CEC) of HDL (high-density lipoprotein) is inversely associated with incident cardiovascular events, independent of HDL cholesterol. Obesity is characterized by low HDL cholesterol and impaired HDL function, such as CEC. Bariatric surgery, including Roux-en-Y gastric bypass (RYGB) and sleeve gastrectomy (SG), broadly leads to improved cardiovascular outcomes, but impacts on risk factors differ by procedure, with greater improvements in weight loss, blood pressure, and glycemic control after RYGB, but greater improvements in HDL cholesterol and CEC levels after SG.

View Article and Find Full Text PDF

Lignosulfonate as a versatile regulator for the mediated synthesis of Ag@AgCl nanocubes.

Nanoscale

January 2025

State Key Laboratory of Biobased Fiber Manufacturing Technology, Tianjin Key Laboratory of Pulp and Paper, China Light Industry Key Laboratory of Papermaking and Biorefinery, Tianjin University of Science and Technology, No. 29, 13th Street, TEDA, Tianjin 300457, P. R. China.

The remarkable catalytic activity, optical properties, and electrochemical behavior of nanomaterials based on noble metals (NM) are profoundly influenced by their physical characteristics, including particle size, morphology, and crystal structure. Effective regulation of these parameters necessitates a refined methodology. Lignin, a natural aromatic compound abundant in hydroxyl, carbonyl, carboxyl, and sulfonic acid groups, has emerged as an eco-friendly surfactant, reducing agent, and dispersant, offering the potential to precisely control the particle size and morphology of NM-based nanomaterials.

View Article and Find Full Text PDF

Objective: Serum uric acid (SUA) may play positive roles in diseases associated with oxidative stress, such as osteoporosis (OP). Nevertheless, the specific impact of SUA levels on both bone mineral density (BMD) and the risk of OP remains uncertain. Considering such information crucial for clinicians when making decisions about urate-lowering therapy (ULT), we sought to fill this gap by conducting dose-response meta-analyses.

View Article and Find Full Text PDF

Gestational diabetes mellitus and subsequent cardiovascular disease in a period of rising diagnoses: Cohort study.

Acta Obstet Gynecol Scand

January 2025

Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec, Canada.

Introduction: Evidence suggests that gestational diabetes mellitus (GDM) is associated with subsequent cardiovascular disease; however, it is unclear what impact changes in screening and diagnostic criteria have had on the association of GDM with long-term outcomes such as cardiovascular disease. The purpose of this study was to determine the association between GDM and subsequent cardiovascular disease during a period of rising gestational diabetes diagnosis in England. Specifically, associations were compared before and after 2008, when national guidelines supporting risk factor-based screening were introduced.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!