Automatically created concept graphs using descriptive keywords in the medical domain.

Methods Inf Med

L3S Research Center, Leibniz University, 30167 Hannover, Germany.

Published: July 2008

Objectives: Besides keyword search, navigational search is an important means to find relevant information in digital object collections. Such navigation is often supported by categorization systems or thesauri, which provide a hierarchical view on a particular domain and allow for browsing digital collections. Existing categorization systems, however, require large and expensive efforts for the manual creation and maintenance. Our Semantic GrowBag algorithm fully automatically creates concept graphs, i.e. directed graphs similar to categorization systems but without strong subsumption semantics. This article sketches our algorithm and evaluates it for the medical domain.

Methods: Our Semantic GrowBag algorithm uses descriptive keywords and exploits higher-order co-occurrences between them to create concept graphs (so-called GrowBag graphs) from annotated object collections. In this study, we have automatically created more than 2000 GrowBag graphs based on the Medline data set to show the applicability of our algorithm in the medical domain. For the evaluation, we first compared our algorithm to a baseline algorithm that does not take higher-order co-occurrences into account, and then compared the resulting GrowBag graphs systematically against the manually crafted MeSH thesaurus.

Results: Our experiments revealed that the Semantic GrowBag approach essentially increases the number of relevant relationships in comparison to a baseline approach by about 50%. Furthermore, the identified relations usually correspond to and hardly ever contradict to relationships as stated by MeSH.

Conclusions: The Semantic GrowBag algorithm allows creating concept graphs fully automatically. While it does not systematically exploit specifics of a domain (such as the fundamental separation between 'drugs' and 'therapy' in MeSH), the resulting GrowBag graphs are nevertheless well-suited to support navigation in digital object collections. Moreover, they can also be used to help maintaining existing categorization systems based on the actual usage of categories.

Download full-text PDF

Source
http://dx.doi.org/10.3414/me0492DOI Listing

Publication Analysis

Top Keywords

concept graphs
16
categorization systems
16
semantic growbag
16
growbag graphs
16
object collections
12
growbag algorithm
12
graphs
9
automatically created
8
descriptive keywords
8
medical domain
8

Similar Publications

Unlabelled: Since the inception of transplantation, it has been crucial to ensure that organ or tissue donations are made with valid informed consent to avoid concerns about coercion or exploitation. This issue is particularly challenging when it comes to infants and younger children, insofar as they are unable to provide consent. Despite their vulnerability, infants' organs and tissues are considered valuable for biomedical purposes due to their size and unique properties.

View Article and Find Full Text PDF

Social groups represent a collective identity defined by a distinct consensus of concepts (e.g., ideas, values, and goals) whose structural relationship varies between groups.

View Article and Find Full Text PDF

Analysis of longitudinal social media for monitoring symptoms during a pandemic.

J Biomed Inform

January 2025

School of Public Health, Zhejiang University School of Medicine, Hangzhou 310058 China; Department of Medicine, Harvard Medical School, Boston, MA 02115, USA. Electronic address:

Objective: Current studies leveraging social media data for disease monitoring face challenges like noisy colloquial language and insufficient tracking of user disease progression in longitudinal data settings. This study aims to develop a pipeline for collecting, cleaning, and analyzing large-scale longitudinal social media data for disease monitoring, with a focus on COVID-19 pandemic.

Materials And Methods: This pipeline initiates by screening COVID-19 cases from tweets spanning February 1, 2020, to April 30, 2022.

View Article and Find Full Text PDF

Stroke is the main cause of disability among neurological diseases. There are questions of the accuracy of topical diagnosis and rehabilitation prognosis in clinical practice. Answers to these questions may be given by an approach to the study of the nervous system as a dynamic network consisting of a set of brain regions with anatomical and functional connections between them.

View Article and Find Full Text PDF

Background: The worldwide scarcity of nurses is a pressing concern, with the World Health Organization predicting a deficit of 5.9 million nurses globally by 2025. Notably, 89% of this shortage is expected to impact low- and middle-income countries.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!