Targeting stopwords for quality assurance of SNOMED-CT.

Int J Med Inform

School of Computer Science, University College Dublin, Belfield, Dublin 4, Ireland.

Published: November 2022

Objective: We assess the potential of exploiting stopwords in biomedical concept names to complete the logical definitions of concepts that are not sufficiently defined.

Methods: Concepts containing stopwords are selected from the Disorder hierarchy of Systematized NOmenclature of MEDicine (SNOMED-CT). SNOMED-CT consists of two types of concepts: Fully Defined (FD) concepts which are sufficiently defined and Partially Defined (PD) concepts which are not sufficiently defined. In this work, FD concepts containing stopwords are treated as a source of ground truth to complete the definitions of, lexically and semantically similar, PD concepts. FD and PD concepts are lexically and semantically analysed to create sample-sets. Mandatory attribute-relationships are calculated by using an intersection-set logic for each FD sample-set. PD sample-sets are audited against this mandatory attribute-relationship template to identify inconsistencies in modelling styles and potentially missing attribute-relationships.

Results: Lexical and semantic patterns around 11 stopwords were analysed. 26 sample-sets were extracted for the 11 stopwords. Mandatory attribute-relationships were identified for 24 of the 26 sample-sets. The method identified 62.5% - 72.22% of the PD concepts, containing the stopwords in and due to, to be inconsistent in their modelling style and potentially missing at least one attribute-relationship according to the created template.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ijmedinf.2022.104870DOI Listing

Publication Analysis

Top Keywords

concepts stopwords
12
concepts
9
defined concepts
8
concepts defined
8
lexically semantically
8
mandatory attribute-relationships
8
stopwords
6
targeting stopwords
4
stopwords quality
4
quality assurance
4

Similar Publications

Targeting stopwords for quality assurance of SNOMED-CT.

Int J Med Inform

November 2022

School of Computer Science, University College Dublin, Belfield, Dublin 4, Ireland.

Objective: We assess the potential of exploiting stopwords in biomedical concept names to complete the logical definitions of concepts that are not sufficiently defined.

Methods: Concepts containing stopwords are selected from the Disorder hierarchy of Systematized NOmenclature of MEDicine (SNOMED-CT). SNOMED-CT consists of two types of concepts: Fully Defined (FD) concepts which are sufficiently defined and Partially Defined (PD) concepts which are not sufficiently defined.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!