Towards a unified search: Improving PubMed retrieval with full text.

J Biomed Inform

National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA. Electronic address:

Published: October 2022

Objective: A significant number of recent articles in PubMed have full text available in PubMed Central®, and the availability of full texts has been consistently growing. However, it is not currently possible for a user to simultaneously query the contents of both databases and receive a single integrated search result. In this study, we investigate how to score full text articles given a multitoken query and how to combine those full text article scores with scores originating from abstracts and achieve an overall improved retrieval performance.

Materials And Methods: For scoring full text articles, we propose a method to combine information coming from different sections by converting the traditionally used BM25 scores into log odds ratio scores which can be treated uniformly. We further propose a method that successfully combines scores from two heterogenous retrieval sources - full text articles and abstract only articles - by balancing the contributions of their respective scores through a probabilistic transformation. We use PubMed click data that consists of queries sampled from PubMed user logs along with a subset of retrieved and clicked documents to train the probabilistic functions and to evaluate retrieval effectiveness.

Results And Conclusions: Random ranking achieves 0.579 MAP score on our PubMed click data. BM25 ranking on PubMed abstracts improves the MAP by 10.6%. For full text documents, experiments confirm that BM25 section scores are of different value depending on the section type and are not directly comparable. Naïvely using the body text of articles along with abstract text degrades the overall quality of the search. The proposed log odds ratio scores normalize and combine the contributions of occurrences of query tokens in different sections. By including full text where available, we gain another 0.67%, or 7% relative improvement over abstract alone. We find an advantage in the more accurate estimate of the value of BM25 scores depending on the section from which they were produced. Taking the sum of top three section scores performs the best.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9561061PMC
http://dx.doi.org/10.1016/j.jbi.2022.104211DOI Listing

Publication Analysis

Top Keywords

full text
32
text articles
16
bm25 scores
12
text
10
scores
10
full
9
propose method
8
log odds
8
odds ratio
8
ratio scores
8

Similar Publications

Background: Health worker migration from Nigeria poses significant challenges to the Nigerian health care sector and has far-reaching implications for health care systems globally. Understanding the factors driving migration, its effects on health care delivery, and potential policy interventions is critical for addressing this complex issue.

Objective: This study aims to comprehensively examine the factors encouraging the emigration of Nigerian health workers, map out the effects of health worker migration on the Nigerian health system, document the loss of investment in health training and education resulting from migration, identify relevant policy initiatives addressing migration, determine the effects of Nigerian health worker migration on destination countries, and identify the benefits and demerits to Nigeria of health worker migration.

View Article and Find Full Text PDF

Background: Although the benefits of engaging in cardiac rehabilitation are well established, patient perceptions of the changes in their health-related quality of life are poorly documented. This systematic review synthesized qualitative studies on patients' perspectives of change in their health-related quality of life after attending cardiac rehabilitation.

Objective: To identify and synthesize the best available evidence on the perspective of patients living with heart disease about the changes in their health-related quality of life after attending cardiac rehabilitation.

View Article and Find Full Text PDF

Knee kinematics during gait in patients with discoid lateral meniscus: a systematic review.

Minerva Pediatr (Torino)

January 2025

Transalpine Center of Pediatric Sports Medicine and Surgery, University of Milano-Bicocca, Monza, Monza-Brianza, Italy.

Introduction: The aim of this study was to evaluate the difference between symptomatic discoid lateral meniscus (DLM) and healthy knees in terms of gait analysis.

Evidence Acquisition: A systematic review was conducted from the electronic databases PubMed/MEDLINE, EMBASE and Scopus. The review was performed on studies that reported data on kinematics, gait analysis, biomechanics in discoid lateral meniscus, before and after surgery.

View Article and Find Full Text PDF

To systematically evaluate the differences in the clinical efficacy of lumbar degenerative disorders (LDDs) treatment between oblique lumbar interbody fusion with percutaneous pedicle screw fixation (OLIF-PF), OLIF stand-alone (OLIF-SA), and OLIF with anterolateral screw fixation (OLIF-AF). A systematic search was conducted on both English and Chinese databases, wherein the literature was screened based on title, abstract, and full text. Literature that met the inclusion criteria was assessed for quality and relevant information was extracted.

View Article and Find Full Text PDF

Background: Double J Stent is one of the procedures frequently performed in the field of urology. Forgotten DJ Stent is a problem that can cause serious complications. This systematic review aims to explore complications and management of patients with forgotten double J stents.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!